It looks like they try to compensate for that, per their FAQ page:
> For the surveys, we count the top 10 million websites according to Alexa and Tranco, see our technology overview for more explanations. We do crawl more sites, but we use the top 10 million to select a representative sample of established sites. We found that including more sites in the sample (e.g. all the sites we know) may easily lead to a bias towards technologies typically used for "throw-away" sites or parked sites or other types of spam domains.
There are content-writing AI's which you almost cannot tell is not written by a human, I would be surprised if an automated crawler would be able to tell the difference when humans barely can.
I've even seen agencies use these tools regularly, which makes it possible to spew out several sites per day.
Yes, it actually is. Web agencies throughout the world build and launch thousands (likely more) of sites a day in Wordpress. They are all legitimate businesses that once had other CMSs or just never had a web presence.
The sites stay around for a year on cheap domains and re-appear in a new suit the year after.
This will continue as long as search engines continue to favor Wordpress.