[Remove Request] - Large list (VERY) #907
Replies: 3 comments 5 replies
-
That would be great if you could provide that script. We had a contributor that was working on scripts for us but IRL got busy and he has not been able to help as much. I have been learning python so I could take over but, I am a ways off from where I need to be to create it. Thank you for your offer! |
Beta Was this translation helpful? Give feedback.
-
It's the least I could do to give back for even having this awesome resource to begin with. I just need to tidy up the script, make the outputs a little more readable, then add some arguments so settings can be set via shell. I was thinking I could put the code in a gist with an unlicense [and dedicate it to your group] then just put the link here. Does that work with your contribution requirements? |
Beta Was this translation helpful? Give feedback.
-
@DOWRIGHTTV I am using the python script, starting on some smaller lists. I am looking at the output and unless I missed it somewhere I am not sure what I am looking at.
Can you help me understand the |
Beta Was this translation helpful? Give feedback.
-
I came across these lists while trying to find some open-source data to use as signatures for an open-source security appliance project I am working on. ( https://github.com/DOWRIGHTTV/dnxfirewall )
URL you wish to be removed: do you guys currently have a system to validate whether an active domain still resolves to a valid IP address?
Why you believe this to be a false positive: failed to resolve name.
List it is on: working on pornography at the moment.
Other info you think we should know: if you guys don't already have a system, I can provide a python script that can do it for you. the current state takes a while for large lists (I think this is more to being throttled by public resolver though because my process was running less than 5%). I have currently processed 50k of the names and ~16k of them did not return a valid (A) record. I need to do a few more spot checks to make sure there wasn't any failures due to too many retries though. my current settings took 40 minutes to get through the 50k.
anyways, let me know if you would like me to provide you the lists I have so far and how i can get them to you if so. they also have the successfully resolved IP addresses since they could be useful in determining whether hosts can be combined/merged or if any are sub-domains to another domain on the list.
Beta Was this translation helpful? Give feedback.
All reactions