in our college we are working on a project called Proctor management system.the project is about developing a web based system to build a relationship between staff and sudents.our college website has separate logins for students and staffs.our project's aim is to relate the students and their incharge staff.we have already created an ontology using we need to map the original data into this ontology.I'm a newbie to semantic what should be done next?can anybody please help me

asked 21 Nov '12, 09:31

semanticmania's gravatar image

accept rate: 0%


Thank you for being honest that this is for a course. In any case to help we need a lot more context than you are currently providing e.g. what format is the original data in. In any case don't forget to tell your prof/teacher that you used this site.

(22 Nov '12, 05:29) Jerven Jerven's gravatar image

Sure sir.The data may consists of the student roll number,subject wise marks and attendance and his proctor's name.Therefore if we speak in terms of database(which is known to me well)there will be two table student table and the staff table.Attributes of student table are roll num(primary key),name,marks,attendance,proctor id(foreign key).The attributes of staff table are staff id(primary key) ,name,student id

(22 Nov '12, 06:13) semanticmania semanticmania's gravatar image

@semanticmania, I guess your question could then be rephrased: how to convert relational data into RDF according to a given ontology?

In that case, you should check out this question:

Hope this helps.

(22 Nov '12, 12:08) Signified ♦ Signified's gravatar image

@Jerven @Signified I need not convert RDBMS to RDF. My need is that I should develop a crawler such that it should crawl the web pages of our college website(for eg.An Attendance page of a student which is in xml should be mapped with my student ontology.Is there any way to do it??

(26 Nov '12, 00:27) semanticmania semanticmania's gravatar image

Using an XSLT to convert the source XML into RDF/XMl is probably simplest. see this old tutorial for the general idea you will need to modify this to your exact needs yourself.


answered 26 Nov '12, 09:18

Jerven's gravatar image

accept rate: 34%

Another option if you are uncomfortable with xslt:

1 - What are the Relevant Pages?
You need to be able to get a list of the pages you will need eg. student pages, staff pages etc. Extracting links on an index page is usually helpful for this. If an index page is not available, you might need to crawl pages (ie. follow links) looking for links of a certain pattern

2 - Get the Relevant Pages
ie. Download the contents of the pages - in html / xml etc.

3 - Extract the Data of Interest
Extract relevant information using a parser of your choice. (ie. XPATH / regex / library specific navigation to data of interest - tools such as firebug can be useful for XPATH)

4 - Convert Data of Interest into Triples (using terms from your ontology)
Lots of rdf libraries you can use. Or you could generate the n triples by simply writing to a text file.

Data Cleaning
Certain terms eg. research interests - might appear as strings rather than hyperlinks. When converting to RDF - it is probably desirable to convert things like being interested in "Semantic Web" (the string) to (the concept).

The libraries depend on your langauge eg. in python ElementTree / BeautifulSoup (for scraping), or Scrapy for more extensive web crawling with RDFlib to generate the RDF


answered 26 Nov '12, 14:44

Sweet%20Burlap's gravatar image

Sweet Burlap
accept rate: 18%

@Sweet Burlap Sir,Ur answer is so helpful and i think it is analogous to my need.When i asked for web crawler in a blog they referred this link. whether it is enough to use this jar file and do my jobs??

(29 Nov '12, 09:30) semanticmania semanticmania's gravatar image

I've never used it, but from my understanding - ldspider is designed specifically to crawl data that is already in RDF. (As opposed to crawling HTML and extracting that data into RDF)

(29 Nov '12, 10:06) Sweet Burlap Sweet%20Burlap's gravatar image

You might also be interested in -

doesnt seem to work on current versions of firefox though :-(


answered 26 Nov '12, 15:04

Sweet%20Burlap's gravatar image

Sweet Burlap
accept rate: 18%

edited 26 Nov '12, 15:09

Your answer
toggle preview

Follow this question

By Email:

Once you sign in you will be able to subscribe for any updates here



Answers and Comments

Markdown Basics

  • *italic* or _italic_
  • **bold** or __bold__
  • link:[text]( "Title")
  • image?![alt text](/path/img.jpg "Title")
  • numbered list: 1. Foo 2. Bar
  • to add a line break simply add two spaces to where you would like the new line to be.
  • basic HTML tags are also supported



Asked: 21 Nov '12, 09:31

Seen: 923 times

Last updated: 29 Nov '12, 10:06