I am trying to collect foaf.rdf files for my research project. One possible way I thought of collecting these files was to crawl the web. I was wondering if there is an repository that maintains a list of these files similar to that of CKAN which is a repository for linked data sets. Can someone help with this question?
If there is no such repository, is there any method which is better than crawling?
I don't know of any repositories but if you Google filetype:foaf it will return .foaf files in it's index. You can be more specific so
Will return .foaf files with the word smith in them. I haven't played too much with this, but I'd imagine it could help.
answered 17 Jan, 07:51
I would have pointed you to pingthesemanticweb.com but that service seems to be offline.
You could also look through the contents of the BTC-2012 dataset. This will contain a lot of FOAF data alongside other stuff. You could filter the data you need from the dumps.
You could also try queries against the LOD cache SPARQL endpoint. Here's one idea:
And here's some
You could also try something similar over at Sindice.
I should add that a lot of FOAF data available on the Web comes from a small number of sites like