I'm searching a way to find "all" the sites ending with an given tld. I had several ideas on how to realize that, but I'm not sure what is the best/most effectiv way to realize this. I'm aware that pages that are linked nowhere aren't findable by spiders etc, so fir this example I'll not care about isolated pages. What I want to do, I want to have an TLD as input for my programm, and I which to have a list of sites as output. For example:
# <program> .de
- spiegel.de
- deutsche-bank.de
...
- bild.de
So what is the best way to reach this? Are there tools available to help me, or how would you program this?