I have been working on converting the Wikipedia tables into N-Triple custom format. by "custom" I mean I am concerned with the literals of the entries not the URI or ID or anything, so my Triple would be something like < USA > < Captial > < Washington D.C. >

My main concern here is the semantic of the subject, I mean, how am I going to convert the notion of a table into a triple format? tables can be formed in any way, and some table doesn't indicate clearly in their headers what they're describing ....etc

So far I've been able to download the wikipedia dump and extract the tables only in a separate file and I parsed the tables into HTML to be able to deal with it more easily than in the Wiki Markup format.

I want to know if there's some kind of convention for forming tables into triples or something similar.

asked 12 Nov '12, 05:39

sherifkandeel's gravatar image

sherifkandeel
23915
accept rate: 0%

Have you look at dbpedia (http://dbpedia.org/About)? Doesn't make sense to re-invent the wheel.

(12 Nov '12, 08:04) mhgrove mhgrove's gravatar image

DBpedia gets it's information from the Infoboxes only, I wan to implement a triple format for the Tables of Wikipedia

(12 Nov '12, 08:39) sherifkandeel sherifkandeel's gravatar image
Be the first one to answer this question!
toggle preview

Follow this question

By Email:

Once you sign in you will be able to subscribe for any updates here

By RSS:

Answers

Answers and Comments

Markdown Basics

  • *italic* or _italic_
  • **bold** or __bold__
  • link:[text](http://url.com/ "title")
  • image?![alt text](/path/img.jpg "title")
  • numbered list: 1. Foo 2. Bar
  • to add a line break simply add two spaces to where you would like the new line to be.
  • basic HTML tags are also supported

Question tags:

×206
×148
×22
×14

question asked: 12 Nov '12, 05:39

question was seen: 630 times

last updated: 12 Nov '12, 08:39