Jonas Brekle | 13 Mar 2012 13:22
Picon
Gravatar

[ANN] wiktionary.dbpedia.org online - Linked Data, SPARQL and Dumps

Hi lists,

we are proud to announce that we now host the data we extract from
wiktionary publicly on wiktionary.dbpedia.org.

We offer Linked Data: http://wiktionary.dbpedia.org/resource/word
a SPARQL endpoint: http://wiktionary.dbpedia.org/sparql
and N-Triple Dumps: http://downloads.dbpedia.org/wiktionary/

There is also a wiki explaining some details:
http://wiki.dbpedia.org/Wiktionary/

We currently extracted data from the English and German Wiktionary (28M
triples and 3.7M triples), but plan to extend that to at least the
biggest 5 wiktionaries within the next weeks, as our approach focuses on
extendability. The data for each word is structured hierarchically (as
wiktionary is) and contains information about language, part of speech,
definitions, translations, synonyms, hyperonyms and hyponyms etc.
There might be some quality issues, but we want to release early, so
bear with us and report major problems.

Thanks goes to the wiktionary community which does a great job creating
this dataset, and we hope to enable new use cases and consequently
promote the contribution to the wiktionary project.

Regards,
Jonas Brekle
Department of Computer Science, University of Leipzig
Research Group: http://aksw.org
(Continue reading)

Bernard Vatant | 13 Mar 2012 14:48
Favicon
Gravatar

Re: [ANN] wiktionary.dbpedia.org online - Linked Data, SPARQL and Dumps

Hi Jonas

Great resource! I'm curious, though, about the vocabulary (predicates) used, such as

http://wiktionary.dbpedia.org/terms/hasPoSUsage
http://wiktionary.dbpedia.org/terms/hasMeaning

The above URIs are not dereferencable, at least not to a usable description, formal or not, neither is the namespace http://wiktionary.dbpedia.org/terms/

Will this vocabulary be published at some point? And did you consider reusing existing predicates from existing vocabularies?
such as http://lexvo.org/ontology# or http://linguistics-ontology.org/gold/
... or other listed at http://labs.mondeca.com/dataset/lov/details/vocabularySpace_Vocabularies.html

Best regards

Bernard

Le 13 mars 2012 13:22, Jonas Brekle <jonas.brekle <at> gmail.com> a écrit :
Hi lists,

we are proud to announce that we now host the data we extract from
wiktionary publicly on wiktionary.dbpedia.org.

We offer Linked Data: http://wiktionary.dbpedia.org/resource/word
a SPARQL endpoint: http://wiktionary.dbpedia.org/sparql
and N-Triple Dumps: http://downloads.dbpedia.org/wiktionary/

There is also a wiki explaining some details:
http://wiki.dbpedia.org/Wiktionary/

We currently extracted data from the English and German Wiktionary (28M
triples and 3.7M triples), but plan to extend that to at least the
biggest 5 wiktionaries within the next weeks, as our approach focuses on
extendability. The data for each word is structured hierarchically (as
wiktionary is) and contains information about language, part of speech,
definitions, translations, synonyms, hyperonyms and hyponyms etc.
There might be some quality issues, but we want to release early, so
bear with us and report major problems.

Thanks goes to the wiktionary community which does a great job creating
this dataset, and we hope to enable new use cases and consequently
promote the contribution to the wiktionary project.

Regards,
Jonas Brekle
Department of Computer Science, University of Leipzig
Research Group: http://aksw.org


------------------------------------------------------------------------------
Keep Your Developer Skills Current with LearnDevNow!
The most comprehensive online learning library for Microsoft developers
is just $99.99! Visual Studio, SharePoint, SQL - plus HTML5, CSS3, MVC3,
Metro Style Apps, more. Free future releases when you subscribe now!
http://p.sf.net/sfu/learndevnow-d2d
_______________________________________________
Dbpedia-discussion mailing list
Dbpedia-discussion-5NWGOfrQmneRv+LV9MX5uipxlwaOVQ5f@public.gmane.org
https://lists.sourceforge.net/lists/listinfo/dbpedia-discussion



--
Bernard Vatant
Vocabularies & Data Engineering
Tel :  + 33 (0)9 71 48 84 59
Skype : bernard.vatant
Linked Open Vocabularies

--------------------------------------------------------
Mondeca                             
3 cité Nollez 75018 Paris, France
Follow us on Twitter : <at> mondecanews

------------------------------------------------------------------------------
Keep Your Developer Skills Current with LearnDevNow!
The most comprehensive online learning library for Microsoft developers
is just $99.99! Visual Studio, SharePoint, SQL - plus HTML5, CSS3, MVC3,
Metro Style Apps, more. Free future releases when you subscribe now!
http://p.sf.net/sfu/learndevnow-d2d
------------------------------------------------------------------------------
Keep Your Developer Skills Current with LearnDevNow!
The most comprehensive online learning library for Microsoft developers
is just $99.99! Visual Studio, SharePoint, SQL - plus HTML5, CSS3, MVC3,
Metro Style Apps, more. Free future releases when you subscribe now!
http://p.sf.net/sfu/learndevnow-d2d
Jonas Brekle | 13 Mar 2012 18:57
Picon
Gravatar

Re: [ANN] wiktionary.dbpedia.org online - Linked Data, SPARQL and Dumps

Am Dienstag, den 13.03.2012, 14:48 +0100 schrieb Bernard Vatant:
> Hi Jonas 
> 
> Great resource! I'm curious, though, about the vocabulary (predicates)
> used, such as 
> 
> http://wiktionary.dbpedia.org/terms/hasPoSUsage
> http://wiktionary.dbpedia.org/terms/hasMeaning
> 
> The above URIs are not dereferencable, at least not to a usable
> description, formal or not, neither is the namespace
> http://wiktionary.dbpedia.org/terms/
> 
> Will this vocabulary be published at some point?
yes, we need to fix this soon.
to be honest this is just a dummy vocabulary until we decide what to
reuse.
>  And did you consider reusing existing predicates from existing
> vocabularies?
> such as http://lexvo.org/ontology# or
> http://linguistics-ontology.org/gold/
> ... or other listed at
> http://labs.mondeca.com/dataset/lov/details/vocabularySpace_Vocabularies.html

yes, some of these.

also the schema might change: the data is very hierarchical and we might
(additionally?) transform it to "word" -> "senses".

> Best regards
> 
> Bernard
> 
> Le 13 mars 2012 13:22, Jonas Brekle <jonas.brekle <at> gmail.com> a écrit :
>         Hi lists,
>         
>         we are proud to announce that we now host the data we extract
>         from
>         wiktionary publicly on wiktionary.dbpedia.org.
>         
>         We offer Linked Data:
>         http://wiktionary.dbpedia.org/resource/word
>         a SPARQL endpoint: http://wiktionary.dbpedia.org/sparql
>         and N-Triple Dumps: http://downloads.dbpedia.org/wiktionary/
>         
>         There is also a wiki explaining some details:
>         http://wiki.dbpedia.org/Wiktionary/
>         
>         We currently extracted data from the English and German
>         Wiktionary (28M
>         triples and 3.7M triples), but plan to extend that to at least
>         the
>         biggest 5 wiktionaries within the next weeks, as our approach
>         focuses on
>         extendability. The data for each word is structured
>         hierarchically (as
>         wiktionary is) and contains information about language, part
>         of speech,
>         definitions, translations, synonyms, hyperonyms and hyponyms
>         etc.
>         There might be some quality issues, but we want to release
>         early, so
>         bear with us and report major problems.
>         
>         Thanks goes to the wiktionary community which does a great job
>         creating
>         this dataset, and we hope to enable new use cases and
>         consequently
>         promote the contribution to the wiktionary project.
>         
>         Regards,
>         Jonas Brekle
>         Department of Computer Science, University of Leipzig
>         Research Group: http://aksw.org
>         
>         
>         ------------------------------------------------------------------------------
>         Keep Your Developer Skills Current with LearnDevNow!
>         The most comprehensive online learning library for Microsoft
>         developers
>         is just $99.99! Visual Studio, SharePoint, SQL - plus HTML5,
>         CSS3, MVC3,
>         Metro Style Apps, more. Free future releases when you
>         subscribe now!
>         http://p.sf.net/sfu/learndevnow-d2d
>         _______________________________________________
>         Dbpedia-discussion mailing list
>         Dbpedia-discussion <at> lists.sourceforge.net
>         https://lists.sourceforge.net/lists/listinfo/dbpedia-discussion
> 
> 
> 
> -- 
> Bernard Vatant
> Vocabularies & Data Engineering
> Tel :  + 33 (0)9 71 48 84 59
> Skype : bernard.vatant
> Linked Open Vocabularies
> 
> 
> --------------------------------------------------------
> 
> Mondeca                             
> 3 cité Nollez 75018 Paris, France
> www.mondeca.com
> Follow us on Twitter :  <at> mondecanews
> 

------------------------------------------------------------------------------
Keep Your Developer Skills Current with LearnDevNow!
The most comprehensive online learning library for Microsoft developers
is just $99.99! Visual Studio, SharePoint, SQL - plus HTML5, CSS3, MVC3,
Metro Style Apps, more. Free future releases when you subscribe now!
http://p.sf.net/sfu/learndevnow-d2d
_______________________________________________
Dbpedia-discussion mailing list
Dbpedia-discussion <at> lists.sourceforge.net
https://lists.sourceforge.net/lists/listinfo/dbpedia-discussion
Bernard Vatant | 13 Mar 2012 19:04
Favicon
Gravatar

Re: [ANN] wiktionary.dbpedia.org online - Linked Data, SPARQL and Dumps

Hi Jonas

Great if this is in your roadmap! I was afraid this was about to be yet-another-great-dataset-without-retrievable-vocabulary :)
Please ping LOV when it's done ;-)

Bernard


Le 13 mars 2012 18:57, Jonas Brekle <jonas.brekle-Re5JQEeQqe8AvxtiuMwx3w@public.gmane.org> a écrit :
Am Dienstag, den 13.03.2012, 14:48 +0100 schrieb Bernard Vatant:
> Hi Jonas
>
> Great resource! I'm curious, though, about the vocabulary (predicates)
> used, such as
>
> http://wiktionary.dbpedia.org/terms/hasPoSUsage
> http://wiktionary.dbpedia.org/terms/hasMeaning
>
> The above URIs are not dereferencable, at least not to a usable
> description, formal or not, neither is the namespace
> http://wiktionary.dbpedia.org/terms/
>
> Will this vocabulary be published at some point?
yes, we need to fix this soon.
to be honest this is just a dummy vocabulary until we decide what to
reuse.
>  And did you consider reusing existing predicates from existing
> vocabularies?
> such as http://lexvo.org/ontology# or
> http://linguistics-ontology.org/gold/
> ... or other listed at
> http://labs.mondeca.com/dataset/lov/details/vocabularySpace_Vocabularies.html

yes, some of these.

also the schema might change: the data is very hierarchical and we might
(additionally?) transform it to "word" -> "senses".

> Best regards
>
> Bernard
>
> Le 13 mars 2012 13:22, Jonas Brekle <jonas.brekle-Re5JQEeQqe8AvxtiuMwx3w@public.gmane.org> a écrit :
>         Hi lists,
>
>         we are proud to announce that we now host the data we extract
>         from
>         wiktionary publicly on wiktionary.dbpedia.org.
>
>         We offer Linked Data:
>         http://wiktionary.dbpedia.org/resource/word
>         a SPARQL endpoint: http://wiktionary.dbpedia.org/sparql
>         and N-Triple Dumps: http://downloads.dbpedia.org/wiktionary/
>
>         There is also a wiki explaining some details:
>         http://wiki.dbpedia.org/Wiktionary/
>
>         We currently extracted data from the English and German
>         Wiktionary (28M
>         triples and 3.7M triples), but plan to extend that to at least
>         the
>         biggest 5 wiktionaries within the next weeks, as our approach
>         focuses on
>         extendability. The data for each word is structured
>         hierarchically (as
>         wiktionary is) and contains information about language, part
>         of speech,
>         definitions, translations, synonyms, hyperonyms and hyponyms
>         etc.
>         There might be some quality issues, but we want to release
>         early, so
>         bear with us and report major problems.
>
>         Thanks goes to the wiktionary community which does a great job
>         creating
>         this dataset, and we hope to enable new use cases and
>         consequently
>         promote the contribution to the wiktionary project.
>
>         Regards,
>         Jonas Brekle
>         Department of Computer Science, University of Leipzig
>         Research Group: http://aksw.org
>
>
>         ------------------------------------------------------------------------------
>         Keep Your Developer Skills Current with LearnDevNow!
>         The most comprehensive online learning library for Microsoft
>         developers
>         is just $99.99! Visual Studio, SharePoint, SQL - plus HTML5,
>         CSS3, MVC3,
>         Metro Style Apps, more. Free future releases when you
>         subscribe now!
>         http://p.sf.net/sfu/learndevnow-d2d
>         _______________________________________________
>         Dbpedia-discussion mailing list
>         Dbpedia-discussion-5NWGOfrQmneRv+LV9MX5uipxlwaOVQ5f@public.gmane.org
>         https://lists.sourceforge.net/lists/listinfo/dbpedia-discussion
>
>
>
> --
> Bernard Vatant
> Vocabularies & Data Engineering
> Tel :  + 33 (0)9 71 48 84 59
> Skype : bernard.vatant
> Linked Open Vocabularies
>
>
> --------------------------------------------------------
>
> Mondeca
> 3 cité Nollez 75018 Paris, France
> www.mondeca.com
> Follow us on Twitter : <at> mondecanews
>



------------------------------------------------------------------------------
Keep Your Developer Skills Current with LearnDevNow!
The most comprehensive online learning library for Microsoft developers
is just $99.99! Visual Studio, SharePoint, SQL - plus HTML5, CSS3, MVC3,
Metro Style Apps, more. Free future releases when you subscribe now!
http://p.sf.net/sfu/learndevnow-d2d
_______________________________________________
Dbpedia-discussion mailing list
Dbpedia-discussion-5NWGOfrQmneRv+LV9MX5uipxlwaOVQ5f@public.gmane.org
https://lists.sourceforge.net/lists/listinfo/dbpedia-discussion



--
Bernard Vatant
Vocabularies & Data Engineering
Tel :  + 33 (0)9 71 48 84 59
Skype : bernard.vatant
Linked Open Vocabularies

--------------------------------------------------------
Mondeca                             
3 cité Nollez 75018 Paris, France
Follow us on Twitter : <at> mondecanews

------------------------------------------------------------------------------
Keep Your Developer Skills Current with LearnDevNow!
The most comprehensive online learning library for Microsoft developers
is just $99.99! Visual Studio, SharePoint, SQL - plus HTML5, CSS3, MVC3,
Metro Style Apps, more. Free future releases when you subscribe now!
http://p.sf.net/sfu/learndevnow-d2d
------------------------------------------------------------------------------
Keep Your Developer Skills Current with LearnDevNow!
The most comprehensive online learning library for Microsoft developers
is just $99.99! Visual Studio, SharePoint, SQL - plus HTML5, CSS3, MVC3,
Metro Style Apps, more. Free future releases when you subscribe now!
http://p.sf.net/sfu/learndevnow-d2d
Andy Mabbett | 13 Mar 2012 16:05
Picon
Favicon
Gravatar

Re: [ANN] wiktionary.dbpedia.org online - Linked Data, SPARQL and Dumps

On 13 March 2012 12:22, Jonas Brekle <jonas.brekle@...> wrote:

> Hi lists,

CCs trimmed to the list I'm on

> we are proud to announce that we now host the data we extract from
> wiktionary publicly on wiktionary.dbpedia.org.

That URL redirects to:

  http://wiktionary.dbpedia.org/About

which makes no mention of Wiktionary.

--

-- 
Andy Mabbett
 <at> pigsonthewing
http://pigsonthewing.org.uk

------------------------------------------------------------------------------
Keep Your Developer Skills Current with LearnDevNow!
The most comprehensive online learning library for Microsoft developers
is just $99.99! Visual Studio, SharePoint, SQL - plus HTML5, CSS3, MVC3,
Metro Style Apps, more. Free future releases when you subscribe now!
http://p.sf.net/sfu/learndevnow-d2d
Jonas Brekle | 13 Mar 2012 18:49
Picon
Gravatar

Re: [ANN] wiktionary.dbpedia.org online - Linked Data, SPARQL and Dumps

Am Dienstag, den 13.03.2012, 15:05 +0000 schrieb Andy Mabbett:
> On 13 March 2012 12:22, Jonas Brekle <jonas.brekle@...> wrote:
> 
> > Hi lists,
> 
> CCs trimmed to the list I'm on
> 
> > we are proud to announce that we now host the data we extract from
> > wiktionary publicly on wiktionary.dbpedia.org.
> 
> That URL redirects to:
> 
>   http://wiktionary.dbpedia.org/About
> 
> which makes no mention of Wiktionary.

yes we got no project website yet. just the data. i will make one soon.

------------------------------------------------------------------------------
Keep Your Developer Skills Current with LearnDevNow!
The most comprehensive online learning library for Microsoft developers
is just $99.99! Visual Studio, SharePoint, SQL - plus HTML5, CSS3, MVC3,
Metro Style Apps, more. Free future releases when you subscribe now!
http://p.sf.net/sfu/learndevnow-d2d
Pablo Mendes | 13 Mar 2012 17:00
Picon
Gravatar

Fwd: [open-linguistics] [ANN] wiktionary.dbpedia.org online - Linked Data, SPARQL and Dumps



---------- Forwarded message ----------
From: Jonas Brekle <jonas.brekle-Re5JQEeQqe8AvxtiuMwx3w@public.gmane.org>
Date: Tue, Mar 13, 2012 at 1:22 PM
Subject: [open-linguistics] [ANN] wiktionary.dbpedia.org online - Linked Data, SPARQL and Dumps
To: dbpedia-discussion <dbpedia-discussion-5NWGOfrQmnfLDRD5uJR0wg@public.gmane.orgeforge.net>, wiktionary-l <wiktionary-l-RusutVdil2icGmH+5r0DM0B+6BGkLq7r@public.gmane.org>, dbpedia-wiktionary <dbpedia-wiktionary-5NWGOfrQmneRv+LV9MX5uipxlwaOVQ5f@public.gmane.org>, wikitext-l-RusutVdil2icGmH+5r0DM0B+6BGkLq7r@public.gmane.org, open-linguistics <open-linguistics-6A+mB+4cr9EkOmmEDKq9/Q@public.gmane.orgorg>


Hi lists,

we are proud to announce that we now host the data we extract from
wiktionary publicly on wiktionary.dbpedia.org.

We offer Linked Data: http://wiktionary.dbpedia.org/resource/word
a SPARQL endpoint: http://wiktionary.dbpedia.org/sparql
and N-Triple Dumps: http://downloads.dbpedia.org/wiktionary/

There is also a wiki explaining some details:
http://wiki.dbpedia.org/Wiktionary/

We currently extracted data from the English and German Wiktionary (28M
triples and 3.7M triples), but plan to extend that to at least the
biggest 5 wiktionaries within the next weeks, as our approach focuses on
extendability. The data for each word is structured hierarchically (as
wiktionary is) and contains information about language, part of speech,
definitions, translations, synonyms, hyperonyms and hyponyms etc.
There might be some quality issues, but we want to release early, so
bear with us and report major problems.

Thanks goes to the wiktionary community which does a great job creating
this dataset, and we hope to enable new use cases and consequently
promote the contribution to the wiktionary project.

Regards,
Jonas Brekle
Department of Computer Science, University of Leipzig
Research Group: http://aksw.org


_______________________________________________
open-linguistics mailing list
open-linguistics <at> lists.okfn.org
http://lists.okfn.org/mailman/listinfo/open-linguistics

------------------------------------------------------------------------------
Keep Your Developer Skills Current with LearnDevNow!
The most comprehensive online learning library for Microsoft developers
is just $99.99! Visual Studio, SharePoint, SQL - plus HTML5, CSS3, MVC3,
Metro Style Apps, more. Free future releases when you subscribe now!
http://p.sf.net/sfu/learndevnow-d2d
_______________________________________________
Dbp-spotlight-users mailing list
Dbp-spotlight-users@...
https://lists.sourceforge.net/lists/listinfo/dbp-spotlight-users
Roberto Mirizzi | 13 Mar 2012 17:06
Picon

Re: [ANN] wiktionary.dbpedia.org online - Linked Data, SPARQL and Dumps

Great idea!
What about extracting also the cateogories via 
dcterms:subject/skosk:broader properties as for the "regular" DBpedia?

regards,
roberto

Il 13/03/2012 13:22, Jonas Brekle ha scritto:
> Hi lists,
>
> we are proud to announce that we now host the data we extract from
> wiktionary publicly on wiktionary.dbpedia.org.
>
> We offer Linked Data: http://wiktionary.dbpedia.org/resource/word
> a SPARQL endpoint: http://wiktionary.dbpedia.org/sparql
> and N-Triple Dumps: http://downloads.dbpedia.org/wiktionary/
>
> There is also a wiki explaining some details:
> http://wiki.dbpedia.org/Wiktionary/
>
> We currently extracted data from the English and German Wiktionary (28M
> triples and 3.7M triples), but plan to extend that to at least the
> biggest 5 wiktionaries within the next weeks, as our approach focuses on
> extendability. The data for each word is structured hierarchically (as
> wiktionary is) and contains information about language, part of speech,
> definitions, translations, synonyms, hyperonyms and hyponyms etc.
> There might be some quality issues, but we want to release early, so
> bear with us and report major problems.
>
> Thanks goes to the wiktionary community which does a great job creating
> this dataset, and we hope to enable new use cases and consequently
> promote the contribution to the wiktionary project.
>
> Regards,
> Jonas Brekle
> Department of Computer Science, University of Leipzig
> Research Group: http://aksw.org
>
>
> ------------------------------------------------------------------------------
> Keep Your Developer Skills Current with LearnDevNow!
> The most comprehensive online learning library for Microsoft developers
> is just $99.99! Visual Studio, SharePoint, SQL - plus HTML5, CSS3, MVC3,
> Metro Style Apps, more. Free future releases when you subscribe now!
> http://p.sf.net/sfu/learndevnow-d2d
> _______________________________________________
> Dbpedia-discussion mailing list
> Dbpedia-discussion@...
> https://lists.sourceforge.net/lists/listinfo/dbpedia-discussion

--

-- 
Roberto Mirizzi
http://sisinflab.poliba.it/mirizzi

------------------------------------------------------------------------------
Keep Your Developer Skills Current with LearnDevNow!
The most comprehensive online learning library for Microsoft developers
is just $99.99! Visual Studio, SharePoint, SQL - plus HTML5, CSS3, MVC3,
Metro Style Apps, more. Free future releases when you subscribe now!
http://p.sf.net/sfu/learndevnow-d2d

Gmane