Hi All, I am trying to generate a list of all the universities in Germany, France and Denmark. I built my query below. However, if you run it in dbpedia, you'll notice that it didn't capture all of the universities, and in the results there were even universities in Italy and other countries that we don't want. What may possibly be the issue here?

To see the result: Here

     ?e rdf:type   <http://schema.org/CollegeOrUniversity> .
    {  ?e <http://dbpedia.org/ontology/country> ?country; <http://dbpedia.org/ontology/country> <http://dbpedia.org/resource/France> .
     ?e <http://dbpedia.org/ontology/country> ?country; <http://dbpedia.org/ontology/country> <http://dbpedia.org/resource/Germany> .
     ?e <http://dbpedia.org/ontology/country> ?country; <http://dbpedia.org/ontology/country> <http://dbpedia.org/resource/Denmark> .

asked 22 Feb '13, 09:58

yonk's gravatar image

accept rate: 0%

edited 22 Feb '13, 11:02

Hi, there's nothing wrong with your query. However, based on the DBpedia data, there can be universities that are located in many countries (international universities). For instance, this university is located in Germany, Italy and France.

permanent link

answered 22 Feb '13, 10:31

fadirra's gravatar image

accept rate: 21%

thanks for the quick response, however, many universities didn't even make it to the result. like university of paderborn in germany http://dbpedia.org/page/University_of_Paderborn

(22 Feb '13, 10:36) yonk yonk's gravatar image

here is another uni in france that didn't make it to the results: http://dbpedia.org/page/Montpellier_2_University

(22 Feb '13, 10:39) yonk yonk's gravatar image

Because you only specified the types to be either http://dbpedia.org/class/yago/University108286163 or http://dbpedia.org/resource/Public_university. If you look at the data of Paderborn, especially in the values of rdf:type property, you'd understand why.

(22 Feb '13, 10:40) fadirra fadirra's gravatar image

do you mind to elaborate? we switched to http://schema.org/CollegeOrUniversity, we get much more results, but still not all of the results (e.g. Paderborn). I have also updated my query above

(22 Feb '13, 11:01) yonk yonk's gravatar image

To add to @fadirra's answer, and to ask the more general underlying question, DBpedia is neither 100% correct nor 100% complete nor 100% "consistent" (in terms of use of vocabulary):

  1. The underlying data source -- Wikipedia -- is itself not 100% correct or complete.
  2. Wikipedia is not designed to have RDF exported from it. DBpedia often fails to correctly and completely extract (even structured) data that Wikipedia has available. DBpedia is a best effort exercise to do so.
  3. Wikipedia editors are many and only agree to de facto conventions, which may mean heterogeneity in how similar things are described (in Wikipedia and in RDF).

In other words, don't assume that queries against DBpedia will always return all and only correct results, and don't assume that a specific structure of query will hit upon all possible heterogeneities in how the target data are described.

If you feel this isn't good enough, I guess keeping an eye on Wikidata developments might be a good idea. ;)

permanent link

answered 22 Feb '13, 12:54

Signified's gravatar image

Signified ♦
accept rate: 37%

edited 22 Feb '13, 12:58

I should add, this is not a criticism of the great ongoing work over at DBpedia. This is just a reflection of how difficult a task it is to automatically extract comprehensive, high quality RDF data from a source like Wikipedia.

(22 Feb '13, 12:55) Signified ♦ Signified's gravatar image
Your answer
toggle preview

Follow this question

By Email:

Once you sign in you will be able to subscribe for any updates here



Answers and Comments

Markdown Basics

  • *italic* or _italic_
  • **bold** or __bold__
  • link:[text](http://url.com/ "title")
  • image?![alt text](/path/img.jpg "title")
  • numbered list: 1. Foo 2. Bar
  • to add a line break simply add two spaces to where you would like the new line to be.
  • basic HTML tags are also supported

Question tags:


question asked: 22 Feb '13, 09:58

question was seen: 585 times

last updated: 22 Feb '13, 12:58