Do they get some kind of real-time feed that tells them “hey this URL popped up in the web today, but it is a tracker, so block it”, or is this exercise is mostly helped by the crowd ?
The easy way would be to make a website sign up for all the ads and see what happens. Subtract your website from the data and there’s the ads.
The real answer will probably end up being that they ARE the ad companies in disguise.
As someone who runs a popular blocklist collection, I’ve come to find that most of the MASSIVE lists are people who collate a whole bunch of lists together and then promote their “one size fits all” solution alongside their donation link. There are very few original high quality ad-blocking lists maintained (where originality is defined as a sizeable amount of unique entries not shared by other lists) and almost all don’t appear to openly discuss the magic sauce behind their lists, outside of the obvious case of user submissions.
Nice try, ad company.
Lol
deleted by creator
And here are uBlockOrigin’s filterlist issue tracker: https://github.com/ublockorigin/uassets/issues
On the issue comment you can see maintainers @-ing each other to add things to upstream lists, so it’s all one big community rather than being extension specific.
deleted by creator
The problem with this approach is that the companies will just change the way ads are shown. DNS blocking is impossible to stop, provides you block every ad website.
DNS blocking is easy to stop, you just host the ads on the same domain instead of putting them on a subdomain. There are plenty of ways to do this already. Only reason it works right now is that lots of them have their own separate ad domain that they host from.
Maybe a general pop-up blocker which is built into modern browsers now, but something that blocks tracking and ads (for example uBlock Origin, AdGuard Home, PiHole…) works off of a list which is kept up to date by crowdsourcing. I’ve never contributed to one of these efforts, but there are lots of people dedicated to the cause.
deleted by creator
Nerds. Hundreds and hundreds of them.
Regular expression magic.
A lot of ad networks have a pattern to the name or the window the advert appears in.
Using regular expression you can find just the adwindow and ignore the actual content.
Now what is regular expression? A wizard language.
ask any programmer about RE after 4 beers and watch the hate wash over their face.The first language I learned was Perl, so regex are very close to my heart. I’m also quite excitable when I drink (I’m a happy drunk), so ask me and I’ll give you a very enthusiastic explanation while not noticing that you aren’t interested in my detailed explanation and examples. Do it. I dare ya.
Decade of c# game design under my belt, but never dealt with web scripting. Am I missing out?
You are missing out on headaches.
I love regex, I’m not even gonna lie. To be fair, my expressions haven’t been 50+ characters long, but still.
Same, I’ve never understood the hate. But then again I memorize based off of patterns and regex in my brain is just a pattern.
I used to play regex games online lol. I once wrote a pattern for work that was ~200 characters. I loved doing it lol.
Regex absolutely has many great uses. The issue is people trying to use for things they shouldn’t. Then it suddenly becomes a nightmare.
Can I parse [X]HTML with regex?
You can parse any plaintext with regex, but I would recommend using XPath for that use case, instead.