<?xml version="1.0"?>
<feed xmlns="http://www.w3.org/2005/Atom" xml:lang="en">
	<id>http://ceur-ws.bitplan.com/api.php?action=feedcontributions&amp;feedformat=atom&amp;user=Tim+Holzheim</id>
	<title>BITPlan ceur-ws Wiki - User contributions [en]</title>
	<link rel="self" type="application/atom+xml" href="http://ceur-ws.bitplan.com/api.php?action=feedcontributions&amp;feedformat=atom&amp;user=Tim+Holzheim"/>
	<link rel="alternate" type="text/html" href="http://ceur-ws.bitplan.com/index.php/Special:Contributions/Tim_Holzheim"/>
	<updated>2026-04-03T23:23:21Z</updated>
	<subtitle>User contributions</subtitle>
	<generator>MediaWiki 1.35.5</generator>
	<entry>
		<id>http://ceur-ws.bitplan.com/index.php?title=Workdocumentation_2023-04-06&amp;diff=1789</id>
		<title>Workdocumentation 2023-04-06</title>
		<link rel="alternate" type="text/html" href="http://ceur-ws.bitplan.com/index.php?title=Workdocumentation_2023-04-06&amp;diff=1789"/>
		<updated>2023-04-06T09:33:29Z</updated>

		<summary type="html">&lt;p&gt;Tim Holzheim: /* Participants */&lt;/p&gt;
&lt;hr /&gt;
&lt;div&gt;{{PageSequence|prev=Workdocumentation 2023-03-30|next=|category=Workdocumentation}}&lt;br /&gt;
= Prototype Feedback =&lt;br /&gt;
== Date &amp;amp; Time ==&lt;br /&gt;
2023-04-06 11:00 - 12:00&lt;br /&gt;
== Participants ==&lt;br /&gt;
* [[User:Wf|Wf]] ([[User talk:Wf|talk]]) 10:55, 6 April 2023 (CEST)&lt;br /&gt;
* Daniel Mietchen&lt;br /&gt;
* [[User:Tim Holzheim|Tim Holzheim]] ([[User talk:Tim Holzheim|talk]])&lt;br /&gt;
&lt;br /&gt;
== Agenda ==&lt;br /&gt;
* Prototypes&lt;br /&gt;
* Possible Queries&lt;br /&gt;
&lt;br /&gt;
== Prototypes ==&lt;br /&gt;
The three main prototypes are:&lt;br /&gt;
&lt;br /&gt;
* Single Point of truth server: &lt;br /&gt;
http://ceurspt.wikidata.dbis.rwth-aachen.de/index.html&lt;br /&gt;
&lt;br /&gt;
* Volume browser: http://cvb.bitplan.com/&lt;br /&gt;
&lt;br /&gt;
* Semantic Media Wiki: &lt;br /&gt;
https://ceur-ws.bitplan.com/index.php/Main_Page&lt;br /&gt;
== Possible Queries ==&lt;br /&gt;
* https://cr.bitplan.com/index.php/List_of_Queries&lt;/div&gt;</summary>
		<author><name>Tim Holzheim</name></author>
	</entry>
	<entry>
		<id>http://ceur-ws.bitplan.com/index.php?title=Workdocumentation_2023-03-27&amp;diff=1349</id>
		<title>Workdocumentation 2023-03-27</title>
		<link rel="alternate" type="text/html" href="http://ceur-ws.bitplan.com/index.php?title=Workdocumentation_2023-03-27&amp;diff=1349"/>
		<updated>2023-03-27T07:29:46Z</updated>

		<summary type="html">&lt;p&gt;Tim Holzheim: /* Individual problems */&lt;/p&gt;
&lt;hr /&gt;
&lt;div&gt;= Participants =&lt;br /&gt;
* Tim&lt;br /&gt;
* Wolfgang&lt;br /&gt;
= Agenda =&lt;br /&gt;
* CEUR-SPT&lt;br /&gt;
* Projectmeeting preperation&lt;br /&gt;
&lt;br /&gt;
= CEUR-SPT =&lt;br /&gt;
= Projectmeeting preparation =&lt;br /&gt;
== Feedback on UI / HTML ==&lt;br /&gt;
* [https://ceur-ws.org/ Original CEUR-WS static page]&lt;br /&gt;
* [http://cvb.bitplan.com Volume Browser]]&lt;br /&gt;
* [http://ceurspt.wikidata.dbis.rwth-aachen.de/Vol-3362.html CEUR-SPT ]&lt;br /&gt;
* [http://ceur-ws.bitplan.com Semantic Mediawiki]]&lt;br /&gt;
=== Ideas ===&lt;br /&gt;
* Help Button&lt;br /&gt;
* Link to Open Source Projects / Issues&lt;br /&gt;
* Link to Ticket System&lt;br /&gt;
&lt;br /&gt;
== Feedback on Structure / Ontology /Link ML ==&lt;br /&gt;
* [https://docs.google.com/spreadsheets/d/1rDyzmiphqrnwugid9-y4B6nRUUBjxusHXvmxzkEeGKg/edit#gid=0 Progress]&lt;br /&gt;
* Series handling e.g. [https://ceur-ws.org/iaoa.html iaoa]&lt;br /&gt;
&lt;br /&gt;
== Feedback on Software ==&lt;br /&gt;
* Tickets - Severity / Milestones&lt;br /&gt;
* Documentation&lt;br /&gt;
* Code Review&lt;br /&gt;
* Tests / CI&lt;br /&gt;
&lt;br /&gt;
&lt;br /&gt;
== Fixing errors in the original HTML files ==&lt;br /&gt;
&lt;br /&gt;
=== Individual problems ===&lt;br /&gt;
* [https://ceur-ws.org/Vol-2836/ Vol-2836: Proceedings of the Conference on Digital Curation Technologies (Qurator 2021) ]&lt;br /&gt;
** publication date: 2019-03-24&lt;br /&gt;
** submitted: 2021-01-29&lt;br /&gt;
&lt;br /&gt;
=== Systematic problems ===&lt;br /&gt;
* newlines in acronyms ..&lt;br /&gt;
* failed parsing for papers&lt;br /&gt;
* duplicate paper ids&lt;br /&gt;
* paper-1 pointing to complete proceedings&lt;br /&gt;
* trailing blanks in url https://stackoverflow.com/questions/75816859/why-is-a-valid-uri-not-valid-in-linkml/75819565#75819565&lt;br /&gt;
* ftp link instead of http link&lt;br /&gt;
* bibo / dcterms annotations&lt;br /&gt;
* logos&lt;br /&gt;
* emphasized text&lt;br /&gt;
* other HTML pecularities / exotic cases&lt;/div&gt;</summary>
		<author><name>Tim Holzheim</name></author>
	</entry>
	<entry>
		<id>http://ceur-ws.bitplan.com/index.php?title=Workdocumentation_2023-03-27&amp;diff=1345</id>
		<title>Workdocumentation 2023-03-27</title>
		<link rel="alternate" type="text/html" href="http://ceur-ws.bitplan.com/index.php?title=Workdocumentation_2023-03-27&amp;diff=1345"/>
		<updated>2023-03-27T07:15:43Z</updated>

		<summary type="html">&lt;p&gt;Tim Holzheim: /* Individual problems */&lt;/p&gt;
&lt;hr /&gt;
&lt;div&gt;= Participants =&lt;br /&gt;
* Tim&lt;br /&gt;
* Wolfgang&lt;br /&gt;
= Agenda =&lt;br /&gt;
* CEUR-SPT&lt;br /&gt;
* Projectmeeting preperation&lt;br /&gt;
&lt;br /&gt;
= CEUR-SPT =&lt;br /&gt;
= Projectmeeting preparation =&lt;br /&gt;
== Feedback on UI / HTML ==&lt;br /&gt;
* Volume Browser&lt;br /&gt;
* CEUR-SPT&lt;br /&gt;
* Semantic Mediawiki&lt;br /&gt;
== Feedback on Structure / Ontology /Link ML ==&lt;br /&gt;
* [https://docs.google.com/spreadsheets/d/1rDyzmiphqrnwugid9-y4B6nRUUBjxusHXvmxzkEeGKg/edit#gid=0 Progress]&lt;br /&gt;
&lt;br /&gt;
== Feedback on Software ==&lt;br /&gt;
* Tickets - Severity / Milestones&lt;br /&gt;
* Documentation&lt;br /&gt;
* Code Review&lt;br /&gt;
* Tests / CI&lt;br /&gt;
&lt;br /&gt;
&lt;br /&gt;
== Fixing errors in the original HTML files ==&lt;br /&gt;
&lt;br /&gt;
=== Individual problems ===&lt;br /&gt;
* [https://ceur-ws.org/Vol-2836/| Vol-2836: Proceedings of the Conference on Digital Curation Technologies (Qurator 2021) ]&lt;br /&gt;
** publication date: 2019-03-24&lt;br /&gt;
** submitted: 2021-01-29&lt;br /&gt;
&lt;br /&gt;
=== Systematic problems ===&lt;br /&gt;
* newlines in acronyms ..&lt;br /&gt;
* failed parsing for papers&lt;br /&gt;
* duplicate paper ids&lt;br /&gt;
* paper-1 pointing to complete proceedings&lt;br /&gt;
* trailing blanks in url https://stackoverflow.com/questions/75816859/why-is-a-valid-uri-not-valid-in-linkml/75819565#75819565&lt;br /&gt;
* ftp link instead of http link&lt;br /&gt;
* bibo / dcterms annotations&lt;br /&gt;
* logos&lt;br /&gt;
* emphasized text&lt;br /&gt;
* other HTML pecularities / exotic cases&lt;/div&gt;</summary>
		<author><name>Tim Holzheim</name></author>
	</entry>
	<entry>
		<id>http://ceur-ws.bitplan.com/index.php?title=Editor_Extraction_and_Reconciliation&amp;diff=1230</id>
		<title>Editor Extraction and Reconciliation</title>
		<link rel="alternate" type="text/html" href="http://ceur-ws.bitplan.com/index.php?title=Editor_Extraction_and_Reconciliation&amp;diff=1230"/>
		<updated>2023-03-09T09:47:53Z</updated>

		<summary type="html">&lt;p&gt;Tim Holzheim: /* Wikidata Reconciliation */&lt;/p&gt;
&lt;hr /&gt;
&lt;div&gt;= Editor Extraction =&lt;br /&gt;
&lt;br /&gt;
* covered volumes 1-3354&lt;br /&gt;
** optimized for volumes 600+&lt;br /&gt;
* 11764 Editor records&lt;br /&gt;
* for 228 volumes no editors could be extracted&lt;br /&gt;
[[File:volume_editor_distribution.png|400px]]&lt;br /&gt;
&lt;br /&gt;
= Reconciliation =&lt;br /&gt;
== dblp reconciliation ==&lt;br /&gt;
&lt;br /&gt;
=== Volume Editors of CEUR-WS in dblp ===&lt;br /&gt;
&amp;lt;source lang=&amp;quot;sparql&amp;quot;&amp;gt;&lt;br /&gt;
PREFIX dblp: &amp;lt;https://dblp.org/rdf/schema#&amp;gt;&lt;br /&gt;
SELECT DISTINCT ?vol_number &lt;br /&gt;
   (GROUP_CONCAT(DISTINCT ?name; separator=&amp;quot;|&amp;quot;) as ?names) &lt;br /&gt;
   (GROUP_CONCAT(DISTINCT ?dblp_id; separator=&amp;quot;|&amp;quot;) as ?concat_dblp_id)&lt;br /&gt;
WHERE {&lt;br /&gt;
  ?volume dblp:publishedIn &amp;quot;CEUR Workshop Proceedings&amp;quot; ;&lt;br /&gt;
    dblp:publishedInSeries &amp;quot;CEUR Workshop Proceedings&amp;quot; ;&lt;br /&gt;
    dblp:publishedInSeriesVolume ?vol_number;&lt;br /&gt;
    dblp:hasSignature ?editors.&lt;br /&gt;
    ?editors dblp:signatureDblpName ?name ;&lt;br /&gt;
        dblp:signatureCreator ?dblp_id ;&lt;br /&gt;
        dblp:signatureOrdinal ?editor_ordinal ;&lt;br /&gt;
        dblp:signaturePublication ?dblp_publication_id ;&lt;br /&gt;
        a dblp:EditorSignature.&lt;br /&gt;
}&lt;br /&gt;
GROUP BY  ?vol_number &lt;br /&gt;
&amp;lt;/source&amp;gt;&lt;br /&gt;
&lt;br /&gt;
=== Volume Editors of CEUR-WS in dblp with identifiers ===&lt;br /&gt;
&amp;lt;source lang=&amp;quot;sparql&amp;quot;&amp;gt;&lt;br /&gt;
PREFIX datacite: &amp;lt;http://purl.org/spar/datacite/&amp;gt;&lt;br /&gt;
PREFIX dblp: &amp;lt;https://dblp.org/rdf/schema#&amp;gt;&lt;br /&gt;
PREFIX litre: &amp;lt;http://purl.org/spar/literal/&amp;gt;&lt;br /&gt;
SELECT DISTINCT &lt;br /&gt;
	(group_concat(DISTINCT ?nameVar;separator='|') as ?name) &lt;br /&gt;
	(group_concat(DISTINCT ?homepageVar;separator='|') as ?homepage)&lt;br /&gt;
	(group_concat(DISTINCT ?affiliationVar;separator='|') as ?affiliation)&lt;br /&gt;
	(group_concat(DISTINCT ?dblpVar;separator='|') as ?dblp)&lt;br /&gt;
	(group_concat(DISTINCT ?wikidataVar;separator='|') as ?wikidata)&lt;br /&gt;
	(group_concat(DISTINCT ?orcidVar;separator='|') as ?orcid)&lt;br /&gt;
	(group_concat(DISTINCT ?googleScholarVar;separator='|') as ?googleScholar)&lt;br /&gt;
	(group_concat(DISTINCT ?acmVar;separator='|') as ?acm)&lt;br /&gt;
	(group_concat(DISTINCT ?twitterVar;separator='|') as ?twitter)&lt;br /&gt;
	(group_concat(DISTINCT ?githubVar;separator='|') as ?github)&lt;br /&gt;
	(group_concat(DISTINCT ?viafVar;separator='|') as ?viaf)&lt;br /&gt;
	(group_concat(DISTINCT ?scigraphVar;separator='|') as ?scigraph)&lt;br /&gt;
	(group_concat(DISTINCT ?zbmathVar;separator='|') as ?zbmath)&lt;br /&gt;
	(group_concat(DISTINCT ?researchGateVar;separator='|') as ?researchGate)&lt;br /&gt;
	(group_concat(DISTINCT ?mathGenealogyVar;separator='|') as ?mathGenealogy)&lt;br /&gt;
	(group_concat(DISTINCT ?locVar;separator='|') as ?loc)&lt;br /&gt;
	(group_concat(DISTINCT ?linkedinVar;separator='|') as ?linkedin)&lt;br /&gt;
	(group_concat(DISTINCT ?lattesVar;separator='|') as ?lattes)&lt;br /&gt;
	(group_concat(DISTINCT ?isniVar;separator='|') as ?isni)&lt;br /&gt;
	(group_concat(DISTINCT ?ieeeVar;separator='|') as ?ieee)&lt;br /&gt;
	(group_concat(DISTINCT ?geprisVar;separator='|') as ?gepris)&lt;br /&gt;
	(group_concat(DISTINCT ?gndVar;separator='|') as ?gnd)&lt;br /&gt;
WHERE{&lt;br /&gt;
	?proceeding dblp:publishedIn &amp;quot;CEUR Workshop Proceedings&amp;quot;;&lt;br /&gt;
		dblp:publishedInSeriesVolume ?volume;&lt;br /&gt;
		dblp:editedBy ?editor.&lt;br /&gt;
	?editor dblp:primaryCreatorName ?nameVar.&lt;br /&gt;
	OPTIONAL{?editor dblp:primaryHomepage ?homepageVar.}&lt;br /&gt;
	OPTIONAL{?editor dblp:primaryAffiliation ?affiliationVar.}&lt;br /&gt;
	OPTIONAL{&lt;br /&gt;
		?editor datacite:hasIdentifier ?dblp_blank.&lt;br /&gt;
		?dblp_blank datacite:usesIdentifierScheme datacite:dblp;&lt;br /&gt;
		litre:hasLiteralValue ?dblpVar.}&lt;br /&gt;
	OPTIONAL{&lt;br /&gt;
		?editor datacite:hasIdentifier ?wikidata_blank.&lt;br /&gt;
		?wikidata_blank datacite:usesIdentifierScheme datacite:wikidata;&lt;br /&gt;
		litre:hasLiteralValue ?wikidataVar.}&lt;br /&gt;
	OPTIONAL{&lt;br /&gt;
		?editor datacite:hasIdentifier ?orcid_blank.&lt;br /&gt;
		?orcid_blank datacite:usesIdentifierScheme datacite:orcid;&lt;br /&gt;
		litre:hasLiteralValue ?orcidVar.}&lt;br /&gt;
	OPTIONAL{&lt;br /&gt;
		?editor datacite:hasIdentifier ?googleScholar_blank.&lt;br /&gt;
		?googleScholar_blank datacite:usesIdentifierScheme datacite:google-scholar;&lt;br /&gt;
		litre:hasLiteralValue ?googleScholarVar.}&lt;br /&gt;
	OPTIONAL{&lt;br /&gt;
		?editor datacite:hasIdentifier ?acm_blank.&lt;br /&gt;
		?acm_blank datacite:usesIdentifierScheme datacite:acm;&lt;br /&gt;
		litre:hasLiteralValue ?acmVar.}&lt;br /&gt;
	OPTIONAL{&lt;br /&gt;
		?editor datacite:hasIdentifier ?twitter_blank.&lt;br /&gt;
		?twitter_blank datacite:usesIdentifierScheme datacite:twitter;&lt;br /&gt;
		litre:hasLiteralValue ?twitterVar.}&lt;br /&gt;
	OPTIONAL{&lt;br /&gt;
		?editor datacite:hasIdentifier ?github_blank.&lt;br /&gt;
		?github_blank datacite:usesIdentifierScheme datacite:github;&lt;br /&gt;
		litre:hasLiteralValue ?githubVar.}&lt;br /&gt;
	OPTIONAL{&lt;br /&gt;
		?editor datacite:hasIdentifier ?viaf_blank.&lt;br /&gt;
		?viaf_blank datacite:usesIdentifierScheme datacite:viaf;&lt;br /&gt;
		litre:hasLiteralValue ?viafVar.}&lt;br /&gt;
	OPTIONAL{&lt;br /&gt;
		?editor datacite:hasIdentifier ?scigraph_blank.&lt;br /&gt;
		?scigraph_blank datacite:usesIdentifierScheme datacite:scigraph;&lt;br /&gt;
		litre:hasLiteralValue ?scigraphVar.}&lt;br /&gt;
	OPTIONAL{&lt;br /&gt;
		?editor datacite:hasIdentifier ?zbmath_blank.&lt;br /&gt;
		?zbmath_blank datacite:usesIdentifierScheme datacite:zbmath;&lt;br /&gt;
		litre:hasLiteralValue ?zbmathVar.}&lt;br /&gt;
	OPTIONAL{&lt;br /&gt;
		?editor datacite:hasIdentifier ?researchGate_blank.&lt;br /&gt;
		?researchGate_blank datacite:usesIdentifierScheme datacite:research-gate;&lt;br /&gt;
		litre:hasLiteralValue ?researchGateVar.}&lt;br /&gt;
	OPTIONAL{&lt;br /&gt;
		?editor datacite:hasIdentifier ?mathGenealogy_blank.&lt;br /&gt;
		?mathGenealogy_blank datacite:usesIdentifierScheme datacite:math-genealogy;&lt;br /&gt;
		litre:hasLiteralValue ?mathGenealogyVar.}&lt;br /&gt;
	OPTIONAL{&lt;br /&gt;
		?editor datacite:hasIdentifier ?loc_blank.&lt;br /&gt;
		?loc_blank datacite:usesIdentifierScheme datacite:loc;&lt;br /&gt;
		litre:hasLiteralValue ?locVar.}&lt;br /&gt;
	OPTIONAL{&lt;br /&gt;
		?editor datacite:hasIdentifier ?linkedin_blank.&lt;br /&gt;
		?linkedin_blank datacite:usesIdentifierScheme datacite:linkedin;&lt;br /&gt;
		litre:hasLiteralValue ?linkedinVar.}&lt;br /&gt;
	OPTIONAL{&lt;br /&gt;
		?editor datacite:hasIdentifier ?lattes_blank.&lt;br /&gt;
		?lattes_blank datacite:usesIdentifierScheme datacite:lattes;&lt;br /&gt;
		litre:hasLiteralValue ?lattesVar.}&lt;br /&gt;
	OPTIONAL{&lt;br /&gt;
		?editor datacite:hasIdentifier ?isni_blank.&lt;br /&gt;
		?isni_blank datacite:usesIdentifierScheme datacite:isni;&lt;br /&gt;
		litre:hasLiteralValue ?isniVar.}&lt;br /&gt;
	OPTIONAL{&lt;br /&gt;
		?editor datacite:hasIdentifier ?ieee_blank.&lt;br /&gt;
		?ieee_blank datacite:usesIdentifierScheme datacite:ieee;&lt;br /&gt;
		litre:hasLiteralValue ?ieeeVar.}&lt;br /&gt;
	OPTIONAL{&lt;br /&gt;
		?editor datacite:hasIdentifier ?gepris_blank.&lt;br /&gt;
		?gepris_blank datacite:usesIdentifierScheme datacite:gepris;&lt;br /&gt;
		litre:hasLiteralValue ?geprisVar.}&lt;br /&gt;
	OPTIONAL{&lt;br /&gt;
		?editor datacite:hasIdentifier ?gnd_blank.&lt;br /&gt;
		?gnd_blank datacite:usesIdentifierScheme datacite:gnd;&lt;br /&gt;
		litre:hasLiteralValue ?gndVar.}&lt;br /&gt;
}&lt;br /&gt;
GROUP BY ?editor&lt;br /&gt;
                &lt;br /&gt;
&amp;lt;/source&amp;gt;&lt;br /&gt;
=== Comparing Extracted and dblp Editors ===&lt;br /&gt;
* editor by volume comparison&lt;br /&gt;
** 2233 volume the extracted editors match the dblp editors&lt;br /&gt;
** 807 volumes are missing in dblp (editors extracted)&lt;br /&gt;
** 27 volumes more editors were extracted than in dblp&lt;br /&gt;
** 387 volumes dblp has more editors than we could extract&lt;br /&gt;
* 9321 out of 11764 editor records can be reconciled&lt;br /&gt;
** 79.23%&lt;br /&gt;
&lt;br /&gt;
== Wikidata Reconciliation ==&lt;br /&gt;
Using the ids queried from dblp to find the corresponding wikidata entry.&lt;br /&gt;
&lt;br /&gt;
Current strategy:&lt;br /&gt;
;Input: List of different identifiers that are known about a editor&lt;br /&gt;
;Output: SPARQL query&lt;br /&gt;
&lt;br /&gt;
Example:&lt;br /&gt;
*Input:&lt;br /&gt;
** homepage: http://www.stefandecker.org&lt;br /&gt;
** gnd id: 173443443&lt;br /&gt;
** dblp author id: d/StefanDecker&lt;br /&gt;
* Output:&lt;br /&gt;
&amp;lt;source lang=&amp;quot;sparql&amp;quot;&amp;gt;&lt;br /&gt;
PREFIX wdt: &amp;lt;http://www.wikidata.org/prop/direct/&amp;gt;&lt;br /&gt;
PREFIX wikibase: &amp;lt;http://wikiba.se/ontology#&amp;gt;&lt;br /&gt;
PREFIX rdfs: &amp;lt;http://www.w3.org/2000/01/rdf-schema#&amp;gt;&lt;br /&gt;
SELECT DISTINCT ?person ?personLabel&lt;br /&gt;
WHERE&lt;br /&gt;
{&lt;br /&gt;
  {OPTIONAL{ ?person wdt:P856 &amp;lt;http://www.stefandecker.org&amp;gt;.} }&lt;br /&gt;
  UNION&lt;br /&gt;
  {OPTIONAL{ ?person wdt:P227 &amp;quot;173443443&amp;quot;.} } # gnd&lt;br /&gt;
  UNION&lt;br /&gt;
  {OPTIONAL{ ?person wdt:P2456 &amp;quot;d/StefanDecker&amp;quot;.} } # dblp&lt;br /&gt;
  ?person rdfs:label ?personLabel. FILTER(lang(?personLabel)=&amp;quot;en&amp;quot;)&lt;br /&gt;
}&lt;br /&gt;
&amp;lt;/source&amp;gt;&lt;br /&gt;
Depending on the available identifiers the query is adjusted accordingly by adding the corresponding OPTIONAL clauses.&lt;br /&gt;
&lt;br /&gt;
Running these queries for all 4942 editors known by dblp we get:&lt;br /&gt;
* 1467 editors were found in wikidata&lt;br /&gt;
* 62 editor records in dblp have a conflict with wikidata&lt;br /&gt;
* 3413 dblp editor records were not found in wikidata&lt;br /&gt;
&lt;br /&gt;
The figure below shows the distribution of the available identifiers depending of the three categories identified, conflict, unkown&lt;br /&gt;
&lt;br /&gt;
&lt;br /&gt;
[[File:editors_wikidata_reconciliation.png|800px]]&lt;/div&gt;</summary>
		<author><name>Tim Holzheim</name></author>
	</entry>
	<entry>
		<id>http://ceur-ws.bitplan.com/index.php?title=File:Editors_wikidata_reconciliation.png&amp;diff=1229</id>
		<title>File:Editors wikidata reconciliation.png</title>
		<link rel="alternate" type="text/html" href="http://ceur-ws.bitplan.com/index.php?title=File:Editors_wikidata_reconciliation.png&amp;diff=1229"/>
		<updated>2023-03-09T09:45:18Z</updated>

		<summary type="html">&lt;p&gt;Tim Holzheim: Tim Holzheim uploaded a new version of File:Editors wikidata reconciliation.png&lt;/p&gt;
&lt;hr /&gt;
&lt;div&gt;File uploaded with MsUpload&lt;/div&gt;</summary>
		<author><name>Tim Holzheim</name></author>
	</entry>
	<entry>
		<id>http://ceur-ws.bitplan.com/index.php?title=Editor_Extraction_and_Reconciliation&amp;diff=1228</id>
		<title>Editor Extraction and Reconciliation</title>
		<link rel="alternate" type="text/html" href="http://ceur-ws.bitplan.com/index.php?title=Editor_Extraction_and_Reconciliation&amp;diff=1228"/>
		<updated>2023-03-09T09:44:30Z</updated>

		<summary type="html">&lt;p&gt;Tim Holzheim: /* Wikidata Reconciliation */&lt;/p&gt;
&lt;hr /&gt;
&lt;div&gt;= Editor Extraction =&lt;br /&gt;
&lt;br /&gt;
* covered volumes 1-3354&lt;br /&gt;
** optimized for volumes 600+&lt;br /&gt;
* 11764 Editor records&lt;br /&gt;
* for 228 volumes no editors could be extracted&lt;br /&gt;
[[File:volume_editor_distribution.png|400px]]&lt;br /&gt;
&lt;br /&gt;
= Reconciliation =&lt;br /&gt;
== dblp reconciliation ==&lt;br /&gt;
&lt;br /&gt;
=== Volume Editors of CEUR-WS in dblp ===&lt;br /&gt;
&amp;lt;source lang=&amp;quot;sparql&amp;quot;&amp;gt;&lt;br /&gt;
PREFIX dblp: &amp;lt;https://dblp.org/rdf/schema#&amp;gt;&lt;br /&gt;
SELECT DISTINCT ?vol_number &lt;br /&gt;
   (GROUP_CONCAT(DISTINCT ?name; separator=&amp;quot;|&amp;quot;) as ?names) &lt;br /&gt;
   (GROUP_CONCAT(DISTINCT ?dblp_id; separator=&amp;quot;|&amp;quot;) as ?concat_dblp_id)&lt;br /&gt;
WHERE {&lt;br /&gt;
  ?volume dblp:publishedIn &amp;quot;CEUR Workshop Proceedings&amp;quot; ;&lt;br /&gt;
    dblp:publishedInSeries &amp;quot;CEUR Workshop Proceedings&amp;quot; ;&lt;br /&gt;
    dblp:publishedInSeriesVolume ?vol_number;&lt;br /&gt;
    dblp:hasSignature ?editors.&lt;br /&gt;
    ?editors dblp:signatureDblpName ?name ;&lt;br /&gt;
        dblp:signatureCreator ?dblp_id ;&lt;br /&gt;
        dblp:signatureOrdinal ?editor_ordinal ;&lt;br /&gt;
        dblp:signaturePublication ?dblp_publication_id ;&lt;br /&gt;
        a dblp:EditorSignature.&lt;br /&gt;
}&lt;br /&gt;
GROUP BY  ?vol_number &lt;br /&gt;
&amp;lt;/source&amp;gt;&lt;br /&gt;
&lt;br /&gt;
=== Volume Editors of CEUR-WS in dblp with identifiers ===&lt;br /&gt;
&amp;lt;source lang=&amp;quot;sparql&amp;quot;&amp;gt;&lt;br /&gt;
PREFIX datacite: &amp;lt;http://purl.org/spar/datacite/&amp;gt;&lt;br /&gt;
PREFIX dblp: &amp;lt;https://dblp.org/rdf/schema#&amp;gt;&lt;br /&gt;
PREFIX litre: &amp;lt;http://purl.org/spar/literal/&amp;gt;&lt;br /&gt;
SELECT DISTINCT &lt;br /&gt;
	(group_concat(DISTINCT ?nameVar;separator='|') as ?name) &lt;br /&gt;
	(group_concat(DISTINCT ?homepageVar;separator='|') as ?homepage)&lt;br /&gt;
	(group_concat(DISTINCT ?affiliationVar;separator='|') as ?affiliation)&lt;br /&gt;
	(group_concat(DISTINCT ?dblpVar;separator='|') as ?dblp)&lt;br /&gt;
	(group_concat(DISTINCT ?wikidataVar;separator='|') as ?wikidata)&lt;br /&gt;
	(group_concat(DISTINCT ?orcidVar;separator='|') as ?orcid)&lt;br /&gt;
	(group_concat(DISTINCT ?googleScholarVar;separator='|') as ?googleScholar)&lt;br /&gt;
	(group_concat(DISTINCT ?acmVar;separator='|') as ?acm)&lt;br /&gt;
	(group_concat(DISTINCT ?twitterVar;separator='|') as ?twitter)&lt;br /&gt;
	(group_concat(DISTINCT ?githubVar;separator='|') as ?github)&lt;br /&gt;
	(group_concat(DISTINCT ?viafVar;separator='|') as ?viaf)&lt;br /&gt;
	(group_concat(DISTINCT ?scigraphVar;separator='|') as ?scigraph)&lt;br /&gt;
	(group_concat(DISTINCT ?zbmathVar;separator='|') as ?zbmath)&lt;br /&gt;
	(group_concat(DISTINCT ?researchGateVar;separator='|') as ?researchGate)&lt;br /&gt;
	(group_concat(DISTINCT ?mathGenealogyVar;separator='|') as ?mathGenealogy)&lt;br /&gt;
	(group_concat(DISTINCT ?locVar;separator='|') as ?loc)&lt;br /&gt;
	(group_concat(DISTINCT ?linkedinVar;separator='|') as ?linkedin)&lt;br /&gt;
	(group_concat(DISTINCT ?lattesVar;separator='|') as ?lattes)&lt;br /&gt;
	(group_concat(DISTINCT ?isniVar;separator='|') as ?isni)&lt;br /&gt;
	(group_concat(DISTINCT ?ieeeVar;separator='|') as ?ieee)&lt;br /&gt;
	(group_concat(DISTINCT ?geprisVar;separator='|') as ?gepris)&lt;br /&gt;
	(group_concat(DISTINCT ?gndVar;separator='|') as ?gnd)&lt;br /&gt;
WHERE{&lt;br /&gt;
	?proceeding dblp:publishedIn &amp;quot;CEUR Workshop Proceedings&amp;quot;;&lt;br /&gt;
		dblp:publishedInSeriesVolume ?volume;&lt;br /&gt;
		dblp:editedBy ?editor.&lt;br /&gt;
	?editor dblp:primaryCreatorName ?nameVar.&lt;br /&gt;
	OPTIONAL{?editor dblp:primaryHomepage ?homepageVar.}&lt;br /&gt;
	OPTIONAL{?editor dblp:primaryAffiliation ?affiliationVar.}&lt;br /&gt;
	OPTIONAL{&lt;br /&gt;
		?editor datacite:hasIdentifier ?dblp_blank.&lt;br /&gt;
		?dblp_blank datacite:usesIdentifierScheme datacite:dblp;&lt;br /&gt;
		litre:hasLiteralValue ?dblpVar.}&lt;br /&gt;
	OPTIONAL{&lt;br /&gt;
		?editor datacite:hasIdentifier ?wikidata_blank.&lt;br /&gt;
		?wikidata_blank datacite:usesIdentifierScheme datacite:wikidata;&lt;br /&gt;
		litre:hasLiteralValue ?wikidataVar.}&lt;br /&gt;
	OPTIONAL{&lt;br /&gt;
		?editor datacite:hasIdentifier ?orcid_blank.&lt;br /&gt;
		?orcid_blank datacite:usesIdentifierScheme datacite:orcid;&lt;br /&gt;
		litre:hasLiteralValue ?orcidVar.}&lt;br /&gt;
	OPTIONAL{&lt;br /&gt;
		?editor datacite:hasIdentifier ?googleScholar_blank.&lt;br /&gt;
		?googleScholar_blank datacite:usesIdentifierScheme datacite:google-scholar;&lt;br /&gt;
		litre:hasLiteralValue ?googleScholarVar.}&lt;br /&gt;
	OPTIONAL{&lt;br /&gt;
		?editor datacite:hasIdentifier ?acm_blank.&lt;br /&gt;
		?acm_blank datacite:usesIdentifierScheme datacite:acm;&lt;br /&gt;
		litre:hasLiteralValue ?acmVar.}&lt;br /&gt;
	OPTIONAL{&lt;br /&gt;
		?editor datacite:hasIdentifier ?twitter_blank.&lt;br /&gt;
		?twitter_blank datacite:usesIdentifierScheme datacite:twitter;&lt;br /&gt;
		litre:hasLiteralValue ?twitterVar.}&lt;br /&gt;
	OPTIONAL{&lt;br /&gt;
		?editor datacite:hasIdentifier ?github_blank.&lt;br /&gt;
		?github_blank datacite:usesIdentifierScheme datacite:github;&lt;br /&gt;
		litre:hasLiteralValue ?githubVar.}&lt;br /&gt;
	OPTIONAL{&lt;br /&gt;
		?editor datacite:hasIdentifier ?viaf_blank.&lt;br /&gt;
		?viaf_blank datacite:usesIdentifierScheme datacite:viaf;&lt;br /&gt;
		litre:hasLiteralValue ?viafVar.}&lt;br /&gt;
	OPTIONAL{&lt;br /&gt;
		?editor datacite:hasIdentifier ?scigraph_blank.&lt;br /&gt;
		?scigraph_blank datacite:usesIdentifierScheme datacite:scigraph;&lt;br /&gt;
		litre:hasLiteralValue ?scigraphVar.}&lt;br /&gt;
	OPTIONAL{&lt;br /&gt;
		?editor datacite:hasIdentifier ?zbmath_blank.&lt;br /&gt;
		?zbmath_blank datacite:usesIdentifierScheme datacite:zbmath;&lt;br /&gt;
		litre:hasLiteralValue ?zbmathVar.}&lt;br /&gt;
	OPTIONAL{&lt;br /&gt;
		?editor datacite:hasIdentifier ?researchGate_blank.&lt;br /&gt;
		?researchGate_blank datacite:usesIdentifierScheme datacite:research-gate;&lt;br /&gt;
		litre:hasLiteralValue ?researchGateVar.}&lt;br /&gt;
	OPTIONAL{&lt;br /&gt;
		?editor datacite:hasIdentifier ?mathGenealogy_blank.&lt;br /&gt;
		?mathGenealogy_blank datacite:usesIdentifierScheme datacite:math-genealogy;&lt;br /&gt;
		litre:hasLiteralValue ?mathGenealogyVar.}&lt;br /&gt;
	OPTIONAL{&lt;br /&gt;
		?editor datacite:hasIdentifier ?loc_blank.&lt;br /&gt;
		?loc_blank datacite:usesIdentifierScheme datacite:loc;&lt;br /&gt;
		litre:hasLiteralValue ?locVar.}&lt;br /&gt;
	OPTIONAL{&lt;br /&gt;
		?editor datacite:hasIdentifier ?linkedin_blank.&lt;br /&gt;
		?linkedin_blank datacite:usesIdentifierScheme datacite:linkedin;&lt;br /&gt;
		litre:hasLiteralValue ?linkedinVar.}&lt;br /&gt;
	OPTIONAL{&lt;br /&gt;
		?editor datacite:hasIdentifier ?lattes_blank.&lt;br /&gt;
		?lattes_blank datacite:usesIdentifierScheme datacite:lattes;&lt;br /&gt;
		litre:hasLiteralValue ?lattesVar.}&lt;br /&gt;
	OPTIONAL{&lt;br /&gt;
		?editor datacite:hasIdentifier ?isni_blank.&lt;br /&gt;
		?isni_blank datacite:usesIdentifierScheme datacite:isni;&lt;br /&gt;
		litre:hasLiteralValue ?isniVar.}&lt;br /&gt;
	OPTIONAL{&lt;br /&gt;
		?editor datacite:hasIdentifier ?ieee_blank.&lt;br /&gt;
		?ieee_blank datacite:usesIdentifierScheme datacite:ieee;&lt;br /&gt;
		litre:hasLiteralValue ?ieeeVar.}&lt;br /&gt;
	OPTIONAL{&lt;br /&gt;
		?editor datacite:hasIdentifier ?gepris_blank.&lt;br /&gt;
		?gepris_blank datacite:usesIdentifierScheme datacite:gepris;&lt;br /&gt;
		litre:hasLiteralValue ?geprisVar.}&lt;br /&gt;
	OPTIONAL{&lt;br /&gt;
		?editor datacite:hasIdentifier ?gnd_blank.&lt;br /&gt;
		?gnd_blank datacite:usesIdentifierScheme datacite:gnd;&lt;br /&gt;
		litre:hasLiteralValue ?gndVar.}&lt;br /&gt;
}&lt;br /&gt;
GROUP BY ?editor&lt;br /&gt;
                &lt;br /&gt;
&amp;lt;/source&amp;gt;&lt;br /&gt;
=== Comparing Extracted and dblp Editors ===&lt;br /&gt;
* editor by volume comparison&lt;br /&gt;
** 2233 volume the extracted editors match the dblp editors&lt;br /&gt;
** 807 volumes are missing in dblp (editors extracted)&lt;br /&gt;
** 27 volumes more editors were extracted than in dblp&lt;br /&gt;
** 387 volumes dblp has more editors than we could extract&lt;br /&gt;
* 9321 out of 11764 editor records can be reconciled&lt;br /&gt;
** 79.23%&lt;br /&gt;
&lt;br /&gt;
== Wikidata Reconciliation ==&lt;br /&gt;
Using the ids queried from dblp to find the corresponding wikidata entry.&lt;br /&gt;
&lt;br /&gt;
Current strategy:&lt;br /&gt;
;Input: List of different identifiers that are known about a editor&lt;br /&gt;
;Output: SPARQL query&lt;br /&gt;
&lt;br /&gt;
Example:&lt;br /&gt;
*Input:&lt;br /&gt;
** homepage: http://www.stefandecker.org&lt;br /&gt;
** gnd id: 173443443&lt;br /&gt;
** dblp author id: d/StefanDecker&lt;br /&gt;
* Output:&lt;br /&gt;
&amp;lt;source lang=&amp;quot;sparql&amp;quot;&amp;gt;&lt;br /&gt;
PREFIX wdt: &amp;lt;http://www.wikidata.org/prop/direct/&amp;gt;&lt;br /&gt;
PREFIX wikibase: &amp;lt;http://wikiba.se/ontology#&amp;gt;&lt;br /&gt;
PREFIX rdfs: &amp;lt;http://www.w3.org/2000/01/rdf-schema#&amp;gt;&lt;br /&gt;
SELECT DISTINCT ?person ?personLabel&lt;br /&gt;
WHERE&lt;br /&gt;
{&lt;br /&gt;
  {OPTIONAL{ ?person wdt:P856 &amp;lt;http://www.stefandecker.org&amp;gt;.} }&lt;br /&gt;
  UNION&lt;br /&gt;
  {OPTIONAL{ ?person wdt:P227 &amp;quot;173443443&amp;quot;.} } # gnd&lt;br /&gt;
  UNION&lt;br /&gt;
  {OPTIONAL{ ?person wdt:P2456 &amp;quot;d/StefanDecker&amp;quot;.} } # dblp&lt;br /&gt;
  ?person rdfs:label ?personLabel. FILTER(lang(?personLabel)=&amp;quot;en&amp;quot;)&lt;br /&gt;
}&lt;br /&gt;
&amp;lt;/source&amp;gt;&lt;br /&gt;
&lt;br /&gt;
Running these queries for all 4942 editors known by dblp we get:&lt;br /&gt;
* 1467 editors were found in wikidata&lt;br /&gt;
* 62 editor records in dblp have a conflict with wikidata&lt;br /&gt;
* 3413 dblp editor records were not found in wikidata&lt;br /&gt;
&lt;br /&gt;
The figure below shows the distribution of the available identifiers depending of the three categories identified, conflict, unkown&lt;br /&gt;
[[File:editors_wikidata_reconciliation.png|800px]]&lt;/div&gt;</summary>
		<author><name>Tim Holzheim</name></author>
	</entry>
	<entry>
		<id>http://ceur-ws.bitplan.com/index.php?title=File:Editors_wikidata_reconciliation.png&amp;diff=1227</id>
		<title>File:Editors wikidata reconciliation.png</title>
		<link rel="alternate" type="text/html" href="http://ceur-ws.bitplan.com/index.php?title=File:Editors_wikidata_reconciliation.png&amp;diff=1227"/>
		<updated>2023-03-09T09:33:39Z</updated>

		<summary type="html">&lt;p&gt;Tim Holzheim: File uploaded with MsUpload&lt;/p&gt;
&lt;hr /&gt;
&lt;div&gt;File uploaded with MsUpload&lt;/div&gt;</summary>
		<author><name>Tim Holzheim</name></author>
	</entry>
	<entry>
		<id>http://ceur-ws.bitplan.com/index.php?title=Workdocumentation_2023-03-09&amp;diff=1226</id>
		<title>Workdocumentation 2023-03-09</title>
		<link rel="alternate" type="text/html" href="http://ceur-ws.bitplan.com/index.php?title=Workdocumentation_2023-03-09&amp;diff=1226"/>
		<updated>2023-03-09T09:17:35Z</updated>

		<summary type="html">&lt;p&gt;Tim Holzheim: /* Paper Template  Example */&lt;/p&gt;
&lt;hr /&gt;
&lt;div&gt;== Add properties to CeurwsSchema ==&lt;br /&gt;
Add properties url,urn,k10plus,dblp to CeurwsSchema&lt;br /&gt;
* https://ceur-ws.bitplan.com/index.php?title=CeurwsSchema&amp;amp;type=revision&amp;amp;diff=1126&amp;amp;oldid=968&lt;br /&gt;
&amp;lt;source lang='bash'&amp;gt;&lt;br /&gt;
py-yprinciple-gen % ypgen --wikiId ceur-ws --context CeurwsSchema --topics Volume --genViaMwApi --noDry&lt;br /&gt;
&lt;br /&gt;
generating Category for Volume via Mediawiki Api...&lt;br /&gt;
diff: https://ceur-ws.bitplan.com/index.php?title=Volume&amp;amp;type=revision&amp;amp;diff=1133&amp;amp;oldid=931(498)&lt;br /&gt;
generating Concept for Volume via Mediawiki Api...&lt;br /&gt;
diff: https://ceur-ws.bitplan.com/index.php?title=Volume&amp;amp;type=revision&amp;amp;diff=1134&amp;amp;oldid=932(223)&lt;br /&gt;
generating Form for Volume via Mediawiki Api...&lt;br /&gt;
diff: https://ceur-ws.bitplan.com/index.php?title=Volume&amp;amp;type=revision&amp;amp;diff=1135&amp;amp;oldid=933(487)&lt;br /&gt;
generating Help for Volume via Mediawiki Api...&lt;br /&gt;
diff: https://ceur-ws.bitplan.com/index.php?title=Volume&amp;amp;type=revision&amp;amp;diff=1136&amp;amp;oldid=934(223)&lt;br /&gt;
generating List of for Volume via Mediawiki Api...&lt;br /&gt;
diff: https://ceur-ws.bitplan.com/index.php?title=List of Volumes&amp;amp;type=revision&amp;amp;diff=1137&amp;amp;oldid=1008(350)&lt;br /&gt;
generating Template for Volume via Mediawiki Api...&lt;br /&gt;
diff: https://ceur-ws.bitplan.com/index.php?title=Volume&amp;amp;type=revision&amp;amp;diff=1138&amp;amp;oldid=936(1582)&lt;br /&gt;
generating Property for Volume via Mediawiki Api...&lt;br /&gt;
diff: https://ceur-ws.bitplan.com/index.php?title=Volume number&amp;amp;type=revision&amp;amp;diff=924&amp;amp;oldid=924(80)&lt;br /&gt;
generating Property for Volume via Mediawiki Api...&lt;br /&gt;
diff: https://ceur-ws.bitplan.com/index.php?title=Volume acronym&amp;amp;type=revision&amp;amp;diff=938&amp;amp;oldid=938(80)&lt;br /&gt;
generating Property for Volume via Mediawiki Api...&lt;br /&gt;
diff: https://ceur-ws.bitplan.com/index.php?title=Volume wikidataid&amp;amp;type=revision&amp;amp;diff=939&amp;amp;oldid=939(80)&lt;br /&gt;
generating Property for Volume via Mediawiki Api...&lt;br /&gt;
diff: https://ceur-ws.bitplan.com/index.php?title=Volume title&amp;amp;type=revision&amp;amp;diff=940&amp;amp;oldid=940(80)&lt;br /&gt;
generating Property for Volume via Mediawiki Api...&lt;br /&gt;
diff: https://ceur-ws.bitplan.com/index.php?title=Volume description&amp;amp;type=revision&amp;amp;diff=937&amp;amp;oldid=937(80)&lt;br /&gt;
generating Property for Volume via Mediawiki Api...&lt;br /&gt;
diff: https://ceur-ws.bitplan.com/index.php?title=Volume url&amp;amp;type=revision&amp;amp;diff=1139&amp;amp;oldid=0(0)&lt;br /&gt;
generating Property for Volume via Mediawiki Api...&lt;br /&gt;
diff: https://ceur-ws.bitplan.com/index.php?title=Volume date&amp;amp;type=revision&amp;amp;diff=1140&amp;amp;oldid=0(0)&lt;br /&gt;
generating Property for Volume via Mediawiki Api...&lt;br /&gt;
diff: https://ceur-ws.bitplan.com/index.php?title=Volume dblp&amp;amp;type=revision&amp;amp;diff=1141&amp;amp;oldid=0(0)&lt;br /&gt;
generating Property for Volume via Mediawiki Api...&lt;br /&gt;
diff: https://ceur-ws.bitplan.com/index.php?title=Volume k10plus&amp;amp;type=revision&amp;amp;diff=1142&amp;amp;oldid=0(0)&lt;br /&gt;
generating Property for Volume via Mediawiki Api...&lt;br /&gt;
diff: https://ceur-ws.bitplan.com/index.php?title=Volume urn&amp;amp;type=revision&amp;amp;diff=1143&amp;amp;oldid=0(0)&lt;br /&gt;
generating Python for Volume via Mediawiki Api...&lt;br /&gt;
diff: (1764)&lt;br /&gt;
&amp;lt;/source&amp;gt;&lt;br /&gt;
== Sync new fields with wikidata ==&lt;br /&gt;
&amp;lt;source lang='bash'&amp;gt;&lt;br /&gt;
 smwsync -u -t ceur-ws --context CeurwsSchema --topic Volume -p url urn k10plus dblp  --progress&lt;br /&gt;
updating cache for CeurwsSchema:Volume from wiki ceur-ws ...&lt;br /&gt;
stored 8 Volume items to /Users/wf/.smwsync/ceur-ws/CeurwsSchema/Volume.json&lt;br /&gt;
8 Volume items to sync ...&lt;br /&gt;
Vol-3346→dlbp: 100%|████████████████████████████| 32/32 [00:40&amp;lt;00:00,  1.26s/it]&lt;br /&gt;
&amp;lt;/source&amp;gt;&lt;br /&gt;
&amp;lt;source lang='bash'&amp;gt;&lt;br /&gt;
 smwsync -u -t ceur-ws --context CeurwsSchema --topic Volume -pkv  Q113542713 -p desc title&lt;br /&gt;
&amp;lt;/source&amp;gt;&lt;br /&gt;
&lt;br /&gt;
&lt;br /&gt;
= Paper Template  Example=&lt;br /&gt;
&amp;lt;pre&amp;gt;&lt;br /&gt;
{|class=&amp;quot;wikitable&amp;quot; style=&amp;quot;&amp;quot;&lt;br /&gt;
! colspan=&amp;quot;2&amp;quot;| Paper&lt;br /&gt;
|-&lt;br /&gt;
![[Title::@@@]]&lt;br /&gt;
| {{#show: {{PAGENAME}}| ?title}}&lt;br /&gt;
|-&lt;br /&gt;
![[Author::@@@]]&lt;br /&gt;
|{{#show: {{PAGENAME}}| ?author}}&lt;br /&gt;
|-&lt;br /&gt;
![[DOI::@@@]]&lt;br /&gt;
|{{#show: {{PAGENAME}}| ?doi}}&lt;br /&gt;
|-&lt;br /&gt;
![[Publication date::@@@]]&lt;br /&gt;
|{{#show: {{PAGENAME}}| ?publication_date}}&lt;br /&gt;
|-&lt;br /&gt;
![[Wikidata id::@@@]]&lt;br /&gt;
|{{#show: {{PAGENAME}}| ?wikidata_id}}&lt;br /&gt;
|-&lt;br /&gt;
![[Event::@@@]]&lt;br /&gt;
|{{#show: {{PAGENAME}}| ?event}}&lt;br /&gt;
|} &lt;br /&gt;
&lt;br /&gt;
&amp;lt;/pre&amp;gt;&lt;br /&gt;
* https://www.semantic-mediawiki.org/wiki/Help:Property_links&lt;/div&gt;</summary>
		<author><name>Tim Holzheim</name></author>
	</entry>
	<entry>
		<id>http://ceur-ws.bitplan.com/index.php?title=Workdocumentation_2023-03-09&amp;diff=1223</id>
		<title>Workdocumentation 2023-03-09</title>
		<link rel="alternate" type="text/html" href="http://ceur-ws.bitplan.com/index.php?title=Workdocumentation_2023-03-09&amp;diff=1223"/>
		<updated>2023-03-09T09:15:58Z</updated>

		<summary type="html">&lt;p&gt;Tim Holzheim: &lt;/p&gt;
&lt;hr /&gt;
&lt;div&gt;== Add properties to CeurwsSchema ==&lt;br /&gt;
Add properties url,urn,k10plus,dblp to CeurwsSchema&lt;br /&gt;
* https://ceur-ws.bitplan.com/index.php?title=CeurwsSchema&amp;amp;type=revision&amp;amp;diff=1126&amp;amp;oldid=968&lt;br /&gt;
&amp;lt;source lang='bash'&amp;gt;&lt;br /&gt;
py-yprinciple-gen % ypgen --wikiId ceur-ws --context CeurwsSchema --topics Volume --genViaMwApi --noDry&lt;br /&gt;
&lt;br /&gt;
generating Category for Volume via Mediawiki Api...&lt;br /&gt;
diff: https://ceur-ws.bitplan.com/index.php?title=Volume&amp;amp;type=revision&amp;amp;diff=1133&amp;amp;oldid=931(498)&lt;br /&gt;
generating Concept for Volume via Mediawiki Api...&lt;br /&gt;
diff: https://ceur-ws.bitplan.com/index.php?title=Volume&amp;amp;type=revision&amp;amp;diff=1134&amp;amp;oldid=932(223)&lt;br /&gt;
generating Form for Volume via Mediawiki Api...&lt;br /&gt;
diff: https://ceur-ws.bitplan.com/index.php?title=Volume&amp;amp;type=revision&amp;amp;diff=1135&amp;amp;oldid=933(487)&lt;br /&gt;
generating Help for Volume via Mediawiki Api...&lt;br /&gt;
diff: https://ceur-ws.bitplan.com/index.php?title=Volume&amp;amp;type=revision&amp;amp;diff=1136&amp;amp;oldid=934(223)&lt;br /&gt;
generating List of for Volume via Mediawiki Api...&lt;br /&gt;
diff: https://ceur-ws.bitplan.com/index.php?title=List of Volumes&amp;amp;type=revision&amp;amp;diff=1137&amp;amp;oldid=1008(350)&lt;br /&gt;
generating Template for Volume via Mediawiki Api...&lt;br /&gt;
diff: https://ceur-ws.bitplan.com/index.php?title=Volume&amp;amp;type=revision&amp;amp;diff=1138&amp;amp;oldid=936(1582)&lt;br /&gt;
generating Property for Volume via Mediawiki Api...&lt;br /&gt;
diff: https://ceur-ws.bitplan.com/index.php?title=Volume number&amp;amp;type=revision&amp;amp;diff=924&amp;amp;oldid=924(80)&lt;br /&gt;
generating Property for Volume via Mediawiki Api...&lt;br /&gt;
diff: https://ceur-ws.bitplan.com/index.php?title=Volume acronym&amp;amp;type=revision&amp;amp;diff=938&amp;amp;oldid=938(80)&lt;br /&gt;
generating Property for Volume via Mediawiki Api...&lt;br /&gt;
diff: https://ceur-ws.bitplan.com/index.php?title=Volume wikidataid&amp;amp;type=revision&amp;amp;diff=939&amp;amp;oldid=939(80)&lt;br /&gt;
generating Property for Volume via Mediawiki Api...&lt;br /&gt;
diff: https://ceur-ws.bitplan.com/index.php?title=Volume title&amp;amp;type=revision&amp;amp;diff=940&amp;amp;oldid=940(80)&lt;br /&gt;
generating Property for Volume via Mediawiki Api...&lt;br /&gt;
diff: https://ceur-ws.bitplan.com/index.php?title=Volume description&amp;amp;type=revision&amp;amp;diff=937&amp;amp;oldid=937(80)&lt;br /&gt;
generating Property for Volume via Mediawiki Api...&lt;br /&gt;
diff: https://ceur-ws.bitplan.com/index.php?title=Volume url&amp;amp;type=revision&amp;amp;diff=1139&amp;amp;oldid=0(0)&lt;br /&gt;
generating Property for Volume via Mediawiki Api...&lt;br /&gt;
diff: https://ceur-ws.bitplan.com/index.php?title=Volume date&amp;amp;type=revision&amp;amp;diff=1140&amp;amp;oldid=0(0)&lt;br /&gt;
generating Property for Volume via Mediawiki Api...&lt;br /&gt;
diff: https://ceur-ws.bitplan.com/index.php?title=Volume dblp&amp;amp;type=revision&amp;amp;diff=1141&amp;amp;oldid=0(0)&lt;br /&gt;
generating Property for Volume via Mediawiki Api...&lt;br /&gt;
diff: https://ceur-ws.bitplan.com/index.php?title=Volume k10plus&amp;amp;type=revision&amp;amp;diff=1142&amp;amp;oldid=0(0)&lt;br /&gt;
generating Property for Volume via Mediawiki Api...&lt;br /&gt;
diff: https://ceur-ws.bitplan.com/index.php?title=Volume urn&amp;amp;type=revision&amp;amp;diff=1143&amp;amp;oldid=0(0)&lt;br /&gt;
generating Python for Volume via Mediawiki Api...&lt;br /&gt;
diff: (1764)&lt;br /&gt;
&amp;lt;/source&amp;gt;&lt;br /&gt;
== Sync new fields with wikidata ==&lt;br /&gt;
&amp;lt;source lang='bash'&amp;gt;&lt;br /&gt;
 smwsync -u -t ceur-ws --context CeurwsSchema --topic Volume -p url urn k10plus dblp  --progress&lt;br /&gt;
updating cache for CeurwsSchema:Volume from wiki ceur-ws ...&lt;br /&gt;
stored 8 Volume items to /Users/wf/.smwsync/ceur-ws/CeurwsSchema/Volume.json&lt;br /&gt;
8 Volume items to sync ...&lt;br /&gt;
Vol-3346→dlbp: 100%|████████████████████████████| 32/32 [00:40&amp;lt;00:00,  1.26s/it]&lt;br /&gt;
&amp;lt;/source&amp;gt;&lt;br /&gt;
&amp;lt;source lang='bash'&amp;gt;&lt;br /&gt;
 smwsync -u -t ceur-ws --context CeurwsSchema --topic Volume -pkv  Q113542713 -p desc title&lt;br /&gt;
&amp;lt;/source&amp;gt;&lt;br /&gt;
&lt;br /&gt;
&lt;br /&gt;
= Paper Template  Example=&lt;br /&gt;
&amp;lt;pre&amp;gt;&lt;br /&gt;
{|class=&amp;quot;wikitable&amp;quot; style=&amp;quot;&amp;quot;&lt;br /&gt;
! colspan=&amp;quot;2&amp;quot;| Paper&lt;br /&gt;
|-&lt;br /&gt;
![[Title::@@@]]&lt;br /&gt;
| {{#show: {{PAGENAME}}| ?title}}&lt;br /&gt;
|-&lt;br /&gt;
![[Author::@@@]]&lt;br /&gt;
|{{#show: {{PAGENAME}}| ?author}}&lt;br /&gt;
|-&lt;br /&gt;
![[DOI::@@@]]&lt;br /&gt;
|{{#show: {{PAGENAME}}| ?doi}}&lt;br /&gt;
|-&lt;br /&gt;
![[Publication date::@@@]]&lt;br /&gt;
|{{#show: {{PAGENAME}}| ?publication_date}}&lt;br /&gt;
|-&lt;br /&gt;
![[Wikidata id::@@@]]&lt;br /&gt;
|{{#show: {{PAGENAME}}| ?wikidata_id}}&lt;br /&gt;
|-&lt;br /&gt;
![[Event::@@@]]&lt;br /&gt;
|{{#show: {{PAGENAME}}| ?event}}&lt;br /&gt;
|} &lt;br /&gt;
&lt;br /&gt;
&amp;lt;/pre&amp;gt;&lt;/div&gt;</summary>
		<author><name>Tim Holzheim</name></author>
	</entry>
	<entry>
		<id>http://ceur-ws.bitplan.com/index.php?title=Editor_Extraction_and_Reconciliation&amp;diff=1211</id>
		<title>Editor Extraction and Reconciliation</title>
		<link rel="alternate" type="text/html" href="http://ceur-ws.bitplan.com/index.php?title=Editor_Extraction_and_Reconciliation&amp;diff=1211"/>
		<updated>2023-03-09T09:00:42Z</updated>

		<summary type="html">&lt;p&gt;Tim Holzheim: /* Volume Editors of CEUR-WS in dblp */&lt;/p&gt;
&lt;hr /&gt;
&lt;div&gt;= Editor Extraction =&lt;br /&gt;
&lt;br /&gt;
* covered volumes 1-3354&lt;br /&gt;
** optimized for volumes 600+&lt;br /&gt;
* 11764 Editor records&lt;br /&gt;
* for 228 volumes no editors could be extracted&lt;br /&gt;
[[File:volume_editor_distribution.png|400px]]&lt;br /&gt;
&lt;br /&gt;
= Reconciliation =&lt;br /&gt;
== dblp reconciliation ==&lt;br /&gt;
&lt;br /&gt;
=== Volume Editors of CEUR-WS in dblp ===&lt;br /&gt;
&amp;lt;source lang=&amp;quot;sparql&amp;quot;&amp;gt;&lt;br /&gt;
PREFIX dblp: &amp;lt;https://dblp.org/rdf/schema#&amp;gt;&lt;br /&gt;
SELECT DISTINCT ?vol_number &lt;br /&gt;
   (GROUP_CONCAT(DISTINCT ?name; separator=&amp;quot;|&amp;quot;) as ?names) &lt;br /&gt;
   (GROUP_CONCAT(DISTINCT ?dblp_id; separator=&amp;quot;|&amp;quot;) as ?concat_dblp_id)&lt;br /&gt;
WHERE {&lt;br /&gt;
  ?volume dblp:publishedIn &amp;quot;CEUR Workshop Proceedings&amp;quot; ;&lt;br /&gt;
    dblp:publishedInSeries &amp;quot;CEUR Workshop Proceedings&amp;quot; ;&lt;br /&gt;
    dblp:publishedInSeriesVolume ?vol_number;&lt;br /&gt;
    dblp:hasSignature ?editors.&lt;br /&gt;
    ?editors dblp:signatureDblpName ?name ;&lt;br /&gt;
        dblp:signatureCreator ?dblp_id ;&lt;br /&gt;
        dblp:signatureOrdinal ?editor_ordinal ;&lt;br /&gt;
        dblp:signaturePublication ?dblp_publication_id ;&lt;br /&gt;
        a dblp:EditorSignature.&lt;br /&gt;
}&lt;br /&gt;
GROUP BY  ?vol_number &lt;br /&gt;
&amp;lt;/source&amp;gt;&lt;br /&gt;
&lt;br /&gt;
=== Volume Editors of CEUR-WS in dblp with identifiers ===&lt;br /&gt;
&amp;lt;source lang=&amp;quot;sparql&amp;quot;&amp;gt;&lt;br /&gt;
PREFIX datacite: &amp;lt;http://purl.org/spar/datacite/&amp;gt;&lt;br /&gt;
PREFIX dblp: &amp;lt;https://dblp.org/rdf/schema#&amp;gt;&lt;br /&gt;
PREFIX litre: &amp;lt;http://purl.org/spar/literal/&amp;gt;&lt;br /&gt;
SELECT DISTINCT &lt;br /&gt;
	(group_concat(DISTINCT ?nameVar;separator='|') as ?name) &lt;br /&gt;
	(group_concat(DISTINCT ?homepageVar;separator='|') as ?homepage)&lt;br /&gt;
	(group_concat(DISTINCT ?affiliationVar;separator='|') as ?affiliation)&lt;br /&gt;
	(group_concat(DISTINCT ?dblpVar;separator='|') as ?dblp)&lt;br /&gt;
	(group_concat(DISTINCT ?wikidataVar;separator='|') as ?wikidata)&lt;br /&gt;
	(group_concat(DISTINCT ?orcidVar;separator='|') as ?orcid)&lt;br /&gt;
	(group_concat(DISTINCT ?googleScholarVar;separator='|') as ?googleScholar)&lt;br /&gt;
	(group_concat(DISTINCT ?acmVar;separator='|') as ?acm)&lt;br /&gt;
	(group_concat(DISTINCT ?twitterVar;separator='|') as ?twitter)&lt;br /&gt;
	(group_concat(DISTINCT ?githubVar;separator='|') as ?github)&lt;br /&gt;
	(group_concat(DISTINCT ?viafVar;separator='|') as ?viaf)&lt;br /&gt;
	(group_concat(DISTINCT ?scigraphVar;separator='|') as ?scigraph)&lt;br /&gt;
	(group_concat(DISTINCT ?zbmathVar;separator='|') as ?zbmath)&lt;br /&gt;
	(group_concat(DISTINCT ?researchGateVar;separator='|') as ?researchGate)&lt;br /&gt;
	(group_concat(DISTINCT ?mathGenealogyVar;separator='|') as ?mathGenealogy)&lt;br /&gt;
	(group_concat(DISTINCT ?locVar;separator='|') as ?loc)&lt;br /&gt;
	(group_concat(DISTINCT ?linkedinVar;separator='|') as ?linkedin)&lt;br /&gt;
	(group_concat(DISTINCT ?lattesVar;separator='|') as ?lattes)&lt;br /&gt;
	(group_concat(DISTINCT ?isniVar;separator='|') as ?isni)&lt;br /&gt;
	(group_concat(DISTINCT ?ieeeVar;separator='|') as ?ieee)&lt;br /&gt;
	(group_concat(DISTINCT ?geprisVar;separator='|') as ?gepris)&lt;br /&gt;
	(group_concat(DISTINCT ?gndVar;separator='|') as ?gnd)&lt;br /&gt;
WHERE{&lt;br /&gt;
	?proceeding dblp:publishedIn &amp;quot;CEUR Workshop Proceedings&amp;quot;;&lt;br /&gt;
		dblp:publishedInSeriesVolume ?volume;&lt;br /&gt;
		dblp:editedBy ?editor.&lt;br /&gt;
	?editor dblp:primaryCreatorName ?nameVar.&lt;br /&gt;
	OPTIONAL{?editor dblp:primaryHomepage ?homepageVar.}&lt;br /&gt;
	OPTIONAL{?editor dblp:primaryAffiliation ?affiliationVar.}&lt;br /&gt;
	OPTIONAL{&lt;br /&gt;
		?editor datacite:hasIdentifier ?dblp_blank.&lt;br /&gt;
		?dblp_blank datacite:usesIdentifierScheme datacite:dblp;&lt;br /&gt;
		litre:hasLiteralValue ?dblpVar.}&lt;br /&gt;
	OPTIONAL{&lt;br /&gt;
		?editor datacite:hasIdentifier ?wikidata_blank.&lt;br /&gt;
		?wikidata_blank datacite:usesIdentifierScheme datacite:wikidata;&lt;br /&gt;
		litre:hasLiteralValue ?wikidataVar.}&lt;br /&gt;
	OPTIONAL{&lt;br /&gt;
		?editor datacite:hasIdentifier ?orcid_blank.&lt;br /&gt;
		?orcid_blank datacite:usesIdentifierScheme datacite:orcid;&lt;br /&gt;
		litre:hasLiteralValue ?orcidVar.}&lt;br /&gt;
	OPTIONAL{&lt;br /&gt;
		?editor datacite:hasIdentifier ?googleScholar_blank.&lt;br /&gt;
		?googleScholar_blank datacite:usesIdentifierScheme datacite:google-scholar;&lt;br /&gt;
		litre:hasLiteralValue ?googleScholarVar.}&lt;br /&gt;
	OPTIONAL{&lt;br /&gt;
		?editor datacite:hasIdentifier ?acm_blank.&lt;br /&gt;
		?acm_blank datacite:usesIdentifierScheme datacite:acm;&lt;br /&gt;
		litre:hasLiteralValue ?acmVar.}&lt;br /&gt;
	OPTIONAL{&lt;br /&gt;
		?editor datacite:hasIdentifier ?twitter_blank.&lt;br /&gt;
		?twitter_blank datacite:usesIdentifierScheme datacite:twitter;&lt;br /&gt;
		litre:hasLiteralValue ?twitterVar.}&lt;br /&gt;
	OPTIONAL{&lt;br /&gt;
		?editor datacite:hasIdentifier ?github_blank.&lt;br /&gt;
		?github_blank datacite:usesIdentifierScheme datacite:github;&lt;br /&gt;
		litre:hasLiteralValue ?githubVar.}&lt;br /&gt;
	OPTIONAL{&lt;br /&gt;
		?editor datacite:hasIdentifier ?viaf_blank.&lt;br /&gt;
		?viaf_blank datacite:usesIdentifierScheme datacite:viaf;&lt;br /&gt;
		litre:hasLiteralValue ?viafVar.}&lt;br /&gt;
	OPTIONAL{&lt;br /&gt;
		?editor datacite:hasIdentifier ?scigraph_blank.&lt;br /&gt;
		?scigraph_blank datacite:usesIdentifierScheme datacite:scigraph;&lt;br /&gt;
		litre:hasLiteralValue ?scigraphVar.}&lt;br /&gt;
	OPTIONAL{&lt;br /&gt;
		?editor datacite:hasIdentifier ?zbmath_blank.&lt;br /&gt;
		?zbmath_blank datacite:usesIdentifierScheme datacite:zbmath;&lt;br /&gt;
		litre:hasLiteralValue ?zbmathVar.}&lt;br /&gt;
	OPTIONAL{&lt;br /&gt;
		?editor datacite:hasIdentifier ?researchGate_blank.&lt;br /&gt;
		?researchGate_blank datacite:usesIdentifierScheme datacite:research-gate;&lt;br /&gt;
		litre:hasLiteralValue ?researchGateVar.}&lt;br /&gt;
	OPTIONAL{&lt;br /&gt;
		?editor datacite:hasIdentifier ?mathGenealogy_blank.&lt;br /&gt;
		?mathGenealogy_blank datacite:usesIdentifierScheme datacite:math-genealogy;&lt;br /&gt;
		litre:hasLiteralValue ?mathGenealogyVar.}&lt;br /&gt;
	OPTIONAL{&lt;br /&gt;
		?editor datacite:hasIdentifier ?loc_blank.&lt;br /&gt;
		?loc_blank datacite:usesIdentifierScheme datacite:loc;&lt;br /&gt;
		litre:hasLiteralValue ?locVar.}&lt;br /&gt;
	OPTIONAL{&lt;br /&gt;
		?editor datacite:hasIdentifier ?linkedin_blank.&lt;br /&gt;
		?linkedin_blank datacite:usesIdentifierScheme datacite:linkedin;&lt;br /&gt;
		litre:hasLiteralValue ?linkedinVar.}&lt;br /&gt;
	OPTIONAL{&lt;br /&gt;
		?editor datacite:hasIdentifier ?lattes_blank.&lt;br /&gt;
		?lattes_blank datacite:usesIdentifierScheme datacite:lattes;&lt;br /&gt;
		litre:hasLiteralValue ?lattesVar.}&lt;br /&gt;
	OPTIONAL{&lt;br /&gt;
		?editor datacite:hasIdentifier ?isni_blank.&lt;br /&gt;
		?isni_blank datacite:usesIdentifierScheme datacite:isni;&lt;br /&gt;
		litre:hasLiteralValue ?isniVar.}&lt;br /&gt;
	OPTIONAL{&lt;br /&gt;
		?editor datacite:hasIdentifier ?ieee_blank.&lt;br /&gt;
		?ieee_blank datacite:usesIdentifierScheme datacite:ieee;&lt;br /&gt;
		litre:hasLiteralValue ?ieeeVar.}&lt;br /&gt;
	OPTIONAL{&lt;br /&gt;
		?editor datacite:hasIdentifier ?gepris_blank.&lt;br /&gt;
		?gepris_blank datacite:usesIdentifierScheme datacite:gepris;&lt;br /&gt;
		litre:hasLiteralValue ?geprisVar.}&lt;br /&gt;
	OPTIONAL{&lt;br /&gt;
		?editor datacite:hasIdentifier ?gnd_blank.&lt;br /&gt;
		?gnd_blank datacite:usesIdentifierScheme datacite:gnd;&lt;br /&gt;
		litre:hasLiteralValue ?gndVar.}&lt;br /&gt;
}&lt;br /&gt;
GROUP BY ?editor&lt;br /&gt;
                &lt;br /&gt;
&amp;lt;/source&amp;gt;&lt;br /&gt;
=== Comparing Extracted and dblp Editors ===&lt;br /&gt;
* editor by volume comparison&lt;br /&gt;
** 2233 volume the extracted editors match the dblp editors&lt;br /&gt;
** 807 volumes are missing in dblp (editors extracted)&lt;br /&gt;
** 27 volumes more editors were extracted than in dblp&lt;br /&gt;
** 387 volumes dblp has more editors than we could extract&lt;br /&gt;
* 9321 out of 11764 editor records can be reconciled&lt;br /&gt;
** 79.23%&lt;br /&gt;
&lt;br /&gt;
== Wikidata Reconciliation ==&lt;br /&gt;
Using the ids queried from dblp to find the corresponding wikidata entry.&lt;br /&gt;
&lt;br /&gt;
Current strategy:&lt;br /&gt;
;Input: List of different identifiers that are known about a editor&lt;br /&gt;
;Output: SPARQL query&lt;br /&gt;
&lt;br /&gt;
Example:&lt;br /&gt;
*Input:&lt;br /&gt;
** homepage: http://www.stefandecker.org&lt;br /&gt;
** gnd id: 173443443&lt;br /&gt;
** dblp author id: d/StefanDecker&lt;br /&gt;
* Output:&lt;br /&gt;
&amp;lt;source lang=&amp;quot;sparql&amp;quot;&amp;gt;&lt;br /&gt;
PREFIX wdt: &amp;lt;http://www.wikidata.org/prop/direct/&amp;gt;&lt;br /&gt;
PREFIX wikibase: &amp;lt;http://wikiba.se/ontology#&amp;gt;&lt;br /&gt;
PREFIX rdfs: &amp;lt;http://www.w3.org/2000/01/rdf-schema#&amp;gt;&lt;br /&gt;
SELECT DISTINCT ?person ?personLabel&lt;br /&gt;
WHERE&lt;br /&gt;
{&lt;br /&gt;
  {OPTIONAL{ ?person wdt:P856 &amp;lt;http://www.stefandecker.org&amp;gt;.} }&lt;br /&gt;
  UNION&lt;br /&gt;
  {OPTIONAL{ ?person wdt:P227 &amp;quot;173443443&amp;quot;.} } # gnd&lt;br /&gt;
  UNION&lt;br /&gt;
  {OPTIONAL{ ?person wdt:P2456 &amp;quot;d/StefanDecker&amp;quot;.} } # dblp&lt;br /&gt;
  ?person rdfs:label ?personLabel. FILTER(lang(?personLabel)=&amp;quot;en&amp;quot;)&lt;br /&gt;
}&lt;br /&gt;
&amp;lt;/source&amp;gt;&lt;br /&gt;
&lt;br /&gt;
Running these queries for all 4942 editors known by dblp we get:&lt;br /&gt;
* Queries currently running...[[User:Tim Holzheim|Tim Holzheim]] ([[User talk:Tim Holzheim|talk]]) 09:49, 9 March 2023 (CET)&lt;/div&gt;</summary>
		<author><name>Tim Holzheim</name></author>
	</entry>
	<entry>
		<id>http://ceur-ws.bitplan.com/index.php?title=Editor_Extraction_and_Reconciliation&amp;diff=1210</id>
		<title>Editor Extraction and Reconciliation</title>
		<link rel="alternate" type="text/html" href="http://ceur-ws.bitplan.com/index.php?title=Editor_Extraction_and_Reconciliation&amp;diff=1210"/>
		<updated>2023-03-09T08:57:35Z</updated>

		<summary type="html">&lt;p&gt;Tim Holzheim: /* Comparing Extracted and dblp Editors */&lt;/p&gt;
&lt;hr /&gt;
&lt;div&gt;= Editor Extraction =&lt;br /&gt;
&lt;br /&gt;
* covered volumes 1-3354&lt;br /&gt;
** optimized for volumes 600+&lt;br /&gt;
* 11764 Editor records&lt;br /&gt;
* for 228 volumes no editors could be extracted&lt;br /&gt;
[[File:volume_editor_distribution.png|400px]]&lt;br /&gt;
&lt;br /&gt;
= Reconciliation =&lt;br /&gt;
== dblp reconciliation ==&lt;br /&gt;
&lt;br /&gt;
=== Volume Editors of CEUR-WS in dblp ===&lt;br /&gt;
&amp;lt;source lang=&amp;quot;sparql&amp;quot;&amp;gt;&lt;br /&gt;
PREFIX dblp: &amp;lt;https://dblp.org/rdf/schema#&amp;gt;&lt;br /&gt;
SELECT DISTINCT ?vol_number (GROUP_CONCAT(?name; separator=&amp;quot;|&amp;quot;) as ?names) (GROUP_CONCAT(?dblp_id; separator=&amp;quot;|&amp;quot;) as ?concat_dblp_id)&lt;br /&gt;
WHERE {&lt;br /&gt;
  ?volume dblp:publishedIn &amp;quot;CEUR Workshop Proceedings&amp;quot; ;&lt;br /&gt;
    dblp:publishedInSeries &amp;quot;CEUR Workshop Proceedings&amp;quot; ;&lt;br /&gt;
    dblp:publishedInSeriesVolume ?vol_number;&lt;br /&gt;
    dblp:hasSignature ?editors.&lt;br /&gt;
    ?editors dblp:signatureDblpName ?name ;&lt;br /&gt;
        dblp:signatureCreator ?dblp_id ;&lt;br /&gt;
        dblp:signatureOrdinal ?editor_ordinal ;&lt;br /&gt;
        dblp:signaturePublication ?dblp_publication_id ;&lt;br /&gt;
        a dblp:EditorSignature.&lt;br /&gt;
}&lt;br /&gt;
GROUP BY  ?vol_number &lt;br /&gt;
&amp;lt;/source&amp;gt;&lt;br /&gt;
&lt;br /&gt;
=== Volume Editors of CEUR-WS in dblp with identifiers ===&lt;br /&gt;
&amp;lt;source lang=&amp;quot;sparql&amp;quot;&amp;gt;&lt;br /&gt;
PREFIX datacite: &amp;lt;http://purl.org/spar/datacite/&amp;gt;&lt;br /&gt;
PREFIX dblp: &amp;lt;https://dblp.org/rdf/schema#&amp;gt;&lt;br /&gt;
PREFIX litre: &amp;lt;http://purl.org/spar/literal/&amp;gt;&lt;br /&gt;
SELECT DISTINCT &lt;br /&gt;
	(group_concat(DISTINCT ?nameVar;separator='|') as ?name) &lt;br /&gt;
	(group_concat(DISTINCT ?homepageVar;separator='|') as ?homepage)&lt;br /&gt;
	(group_concat(DISTINCT ?affiliationVar;separator='|') as ?affiliation)&lt;br /&gt;
	(group_concat(DISTINCT ?dblpVar;separator='|') as ?dblp)&lt;br /&gt;
	(group_concat(DISTINCT ?wikidataVar;separator='|') as ?wikidata)&lt;br /&gt;
	(group_concat(DISTINCT ?orcidVar;separator='|') as ?orcid)&lt;br /&gt;
	(group_concat(DISTINCT ?googleScholarVar;separator='|') as ?googleScholar)&lt;br /&gt;
	(group_concat(DISTINCT ?acmVar;separator='|') as ?acm)&lt;br /&gt;
	(group_concat(DISTINCT ?twitterVar;separator='|') as ?twitter)&lt;br /&gt;
	(group_concat(DISTINCT ?githubVar;separator='|') as ?github)&lt;br /&gt;
	(group_concat(DISTINCT ?viafVar;separator='|') as ?viaf)&lt;br /&gt;
	(group_concat(DISTINCT ?scigraphVar;separator='|') as ?scigraph)&lt;br /&gt;
	(group_concat(DISTINCT ?zbmathVar;separator='|') as ?zbmath)&lt;br /&gt;
	(group_concat(DISTINCT ?researchGateVar;separator='|') as ?researchGate)&lt;br /&gt;
	(group_concat(DISTINCT ?mathGenealogyVar;separator='|') as ?mathGenealogy)&lt;br /&gt;
	(group_concat(DISTINCT ?locVar;separator='|') as ?loc)&lt;br /&gt;
	(group_concat(DISTINCT ?linkedinVar;separator='|') as ?linkedin)&lt;br /&gt;
	(group_concat(DISTINCT ?lattesVar;separator='|') as ?lattes)&lt;br /&gt;
	(group_concat(DISTINCT ?isniVar;separator='|') as ?isni)&lt;br /&gt;
	(group_concat(DISTINCT ?ieeeVar;separator='|') as ?ieee)&lt;br /&gt;
	(group_concat(DISTINCT ?geprisVar;separator='|') as ?gepris)&lt;br /&gt;
	(group_concat(DISTINCT ?gndVar;separator='|') as ?gnd)&lt;br /&gt;
WHERE{&lt;br /&gt;
	?proceeding dblp:publishedIn &amp;quot;CEUR Workshop Proceedings&amp;quot;;&lt;br /&gt;
		dblp:publishedInSeriesVolume ?volume;&lt;br /&gt;
		dblp:editedBy ?editor.&lt;br /&gt;
	?editor dblp:primaryCreatorName ?nameVar.&lt;br /&gt;
	OPTIONAL{?editor dblp:primaryHomepage ?homepageVar.}&lt;br /&gt;
	OPTIONAL{?editor dblp:primaryAffiliation ?affiliationVar.}&lt;br /&gt;
	OPTIONAL{&lt;br /&gt;
		?editor datacite:hasIdentifier ?dblp_blank.&lt;br /&gt;
		?dblp_blank datacite:usesIdentifierScheme datacite:dblp;&lt;br /&gt;
		litre:hasLiteralValue ?dblpVar.}&lt;br /&gt;
	OPTIONAL{&lt;br /&gt;
		?editor datacite:hasIdentifier ?wikidata_blank.&lt;br /&gt;
		?wikidata_blank datacite:usesIdentifierScheme datacite:wikidata;&lt;br /&gt;
		litre:hasLiteralValue ?wikidataVar.}&lt;br /&gt;
	OPTIONAL{&lt;br /&gt;
		?editor datacite:hasIdentifier ?orcid_blank.&lt;br /&gt;
		?orcid_blank datacite:usesIdentifierScheme datacite:orcid;&lt;br /&gt;
		litre:hasLiteralValue ?orcidVar.}&lt;br /&gt;
	OPTIONAL{&lt;br /&gt;
		?editor datacite:hasIdentifier ?googleScholar_blank.&lt;br /&gt;
		?googleScholar_blank datacite:usesIdentifierScheme datacite:google-scholar;&lt;br /&gt;
		litre:hasLiteralValue ?googleScholarVar.}&lt;br /&gt;
	OPTIONAL{&lt;br /&gt;
		?editor datacite:hasIdentifier ?acm_blank.&lt;br /&gt;
		?acm_blank datacite:usesIdentifierScheme datacite:acm;&lt;br /&gt;
		litre:hasLiteralValue ?acmVar.}&lt;br /&gt;
	OPTIONAL{&lt;br /&gt;
		?editor datacite:hasIdentifier ?twitter_blank.&lt;br /&gt;
		?twitter_blank datacite:usesIdentifierScheme datacite:twitter;&lt;br /&gt;
		litre:hasLiteralValue ?twitterVar.}&lt;br /&gt;
	OPTIONAL{&lt;br /&gt;
		?editor datacite:hasIdentifier ?github_blank.&lt;br /&gt;
		?github_blank datacite:usesIdentifierScheme datacite:github;&lt;br /&gt;
		litre:hasLiteralValue ?githubVar.}&lt;br /&gt;
	OPTIONAL{&lt;br /&gt;
		?editor datacite:hasIdentifier ?viaf_blank.&lt;br /&gt;
		?viaf_blank datacite:usesIdentifierScheme datacite:viaf;&lt;br /&gt;
		litre:hasLiteralValue ?viafVar.}&lt;br /&gt;
	OPTIONAL{&lt;br /&gt;
		?editor datacite:hasIdentifier ?scigraph_blank.&lt;br /&gt;
		?scigraph_blank datacite:usesIdentifierScheme datacite:scigraph;&lt;br /&gt;
		litre:hasLiteralValue ?scigraphVar.}&lt;br /&gt;
	OPTIONAL{&lt;br /&gt;
		?editor datacite:hasIdentifier ?zbmath_blank.&lt;br /&gt;
		?zbmath_blank datacite:usesIdentifierScheme datacite:zbmath;&lt;br /&gt;
		litre:hasLiteralValue ?zbmathVar.}&lt;br /&gt;
	OPTIONAL{&lt;br /&gt;
		?editor datacite:hasIdentifier ?researchGate_blank.&lt;br /&gt;
		?researchGate_blank datacite:usesIdentifierScheme datacite:research-gate;&lt;br /&gt;
		litre:hasLiteralValue ?researchGateVar.}&lt;br /&gt;
	OPTIONAL{&lt;br /&gt;
		?editor datacite:hasIdentifier ?mathGenealogy_blank.&lt;br /&gt;
		?mathGenealogy_blank datacite:usesIdentifierScheme datacite:math-genealogy;&lt;br /&gt;
		litre:hasLiteralValue ?mathGenealogyVar.}&lt;br /&gt;
	OPTIONAL{&lt;br /&gt;
		?editor datacite:hasIdentifier ?loc_blank.&lt;br /&gt;
		?loc_blank datacite:usesIdentifierScheme datacite:loc;&lt;br /&gt;
		litre:hasLiteralValue ?locVar.}&lt;br /&gt;
	OPTIONAL{&lt;br /&gt;
		?editor datacite:hasIdentifier ?linkedin_blank.&lt;br /&gt;
		?linkedin_blank datacite:usesIdentifierScheme datacite:linkedin;&lt;br /&gt;
		litre:hasLiteralValue ?linkedinVar.}&lt;br /&gt;
	OPTIONAL{&lt;br /&gt;
		?editor datacite:hasIdentifier ?lattes_blank.&lt;br /&gt;
		?lattes_blank datacite:usesIdentifierScheme datacite:lattes;&lt;br /&gt;
		litre:hasLiteralValue ?lattesVar.}&lt;br /&gt;
	OPTIONAL{&lt;br /&gt;
		?editor datacite:hasIdentifier ?isni_blank.&lt;br /&gt;
		?isni_blank datacite:usesIdentifierScheme datacite:isni;&lt;br /&gt;
		litre:hasLiteralValue ?isniVar.}&lt;br /&gt;
	OPTIONAL{&lt;br /&gt;
		?editor datacite:hasIdentifier ?ieee_blank.&lt;br /&gt;
		?ieee_blank datacite:usesIdentifierScheme datacite:ieee;&lt;br /&gt;
		litre:hasLiteralValue ?ieeeVar.}&lt;br /&gt;
	OPTIONAL{&lt;br /&gt;
		?editor datacite:hasIdentifier ?gepris_blank.&lt;br /&gt;
		?gepris_blank datacite:usesIdentifierScheme datacite:gepris;&lt;br /&gt;
		litre:hasLiteralValue ?geprisVar.}&lt;br /&gt;
	OPTIONAL{&lt;br /&gt;
		?editor datacite:hasIdentifier ?gnd_blank.&lt;br /&gt;
		?gnd_blank datacite:usesIdentifierScheme datacite:gnd;&lt;br /&gt;
		litre:hasLiteralValue ?gndVar.}&lt;br /&gt;
}&lt;br /&gt;
GROUP BY ?editor&lt;br /&gt;
                &lt;br /&gt;
&amp;lt;/source&amp;gt;&lt;br /&gt;
=== Comparing Extracted and dblp Editors ===&lt;br /&gt;
* editor by volume comparison&lt;br /&gt;
** 2233 volume the extracted editors match the dblp editors&lt;br /&gt;
** 807 volumes are missing in dblp (editors extracted)&lt;br /&gt;
** 27 volumes more editors were extracted than in dblp&lt;br /&gt;
** 387 volumes dblp has more editors than we could extract&lt;br /&gt;
* 9321 out of 11764 editor records can be reconciled&lt;br /&gt;
** 79.23%&lt;br /&gt;
&lt;br /&gt;
== Wikidata Reconciliation ==&lt;br /&gt;
Using the ids queried from dblp to find the corresponding wikidata entry.&lt;br /&gt;
&lt;br /&gt;
Current strategy:&lt;br /&gt;
;Input: List of different identifiers that are known about a editor&lt;br /&gt;
;Output: SPARQL query&lt;br /&gt;
&lt;br /&gt;
Example:&lt;br /&gt;
*Input:&lt;br /&gt;
** homepage: http://www.stefandecker.org&lt;br /&gt;
** gnd id: 173443443&lt;br /&gt;
** dblp author id: d/StefanDecker&lt;br /&gt;
* Output:&lt;br /&gt;
&amp;lt;source lang=&amp;quot;sparql&amp;quot;&amp;gt;&lt;br /&gt;
PREFIX wdt: &amp;lt;http://www.wikidata.org/prop/direct/&amp;gt;&lt;br /&gt;
PREFIX wikibase: &amp;lt;http://wikiba.se/ontology#&amp;gt;&lt;br /&gt;
PREFIX rdfs: &amp;lt;http://www.w3.org/2000/01/rdf-schema#&amp;gt;&lt;br /&gt;
SELECT DISTINCT ?person ?personLabel&lt;br /&gt;
WHERE&lt;br /&gt;
{&lt;br /&gt;
  {OPTIONAL{ ?person wdt:P856 &amp;lt;http://www.stefandecker.org&amp;gt;.} }&lt;br /&gt;
  UNION&lt;br /&gt;
  {OPTIONAL{ ?person wdt:P227 &amp;quot;173443443&amp;quot;.} } # gnd&lt;br /&gt;
  UNION&lt;br /&gt;
  {OPTIONAL{ ?person wdt:P2456 &amp;quot;d/StefanDecker&amp;quot;.} } # dblp&lt;br /&gt;
  ?person rdfs:label ?personLabel. FILTER(lang(?personLabel)=&amp;quot;en&amp;quot;)&lt;br /&gt;
}&lt;br /&gt;
&amp;lt;/source&amp;gt;&lt;br /&gt;
&lt;br /&gt;
Running these queries for all 4942 editors known by dblp we get:&lt;br /&gt;
* Queries currently running...[[User:Tim Holzheim|Tim Holzheim]] ([[User talk:Tim Holzheim|talk]]) 09:49, 9 March 2023 (CET)&lt;/div&gt;</summary>
		<author><name>Tim Holzheim</name></author>
	</entry>
	<entry>
		<id>http://ceur-ws.bitplan.com/index.php?title=Editor_Extraction_and_Reconciliation&amp;diff=1200</id>
		<title>Editor Extraction and Reconciliation</title>
		<link rel="alternate" type="text/html" href="http://ceur-ws.bitplan.com/index.php?title=Editor_Extraction_and_Reconciliation&amp;diff=1200"/>
		<updated>2023-03-09T08:49:22Z</updated>

		<summary type="html">&lt;p&gt;Tim Holzheim: Created page with &amp;quot;= Editor Extraction =  * covered volumes 1-3354 ** optimized for volumes 600+ * 11764 Editor records * for 228 volumes no editors could be extracted File:volume_editor_distr...&amp;quot;&lt;/p&gt;
&lt;hr /&gt;
&lt;div&gt;= Editor Extraction =&lt;br /&gt;
&lt;br /&gt;
* covered volumes 1-3354&lt;br /&gt;
** optimized for volumes 600+&lt;br /&gt;
* 11764 Editor records&lt;br /&gt;
* for 228 volumes no editors could be extracted&lt;br /&gt;
[[File:volume_editor_distribution.png|400px]]&lt;br /&gt;
&lt;br /&gt;
= Reconciliation =&lt;br /&gt;
== dblp reconciliation ==&lt;br /&gt;
&lt;br /&gt;
=== Volume Editors of CEUR-WS in dblp ===&lt;br /&gt;
&amp;lt;source lang=&amp;quot;sparql&amp;quot;&amp;gt;&lt;br /&gt;
PREFIX dblp: &amp;lt;https://dblp.org/rdf/schema#&amp;gt;&lt;br /&gt;
SELECT DISTINCT ?vol_number (GROUP_CONCAT(?name; separator=&amp;quot;|&amp;quot;) as ?names) (GROUP_CONCAT(?dblp_id; separator=&amp;quot;|&amp;quot;) as ?concat_dblp_id)&lt;br /&gt;
WHERE {&lt;br /&gt;
  ?volume dblp:publishedIn &amp;quot;CEUR Workshop Proceedings&amp;quot; ;&lt;br /&gt;
    dblp:publishedInSeries &amp;quot;CEUR Workshop Proceedings&amp;quot; ;&lt;br /&gt;
    dblp:publishedInSeriesVolume ?vol_number;&lt;br /&gt;
    dblp:hasSignature ?editors.&lt;br /&gt;
    ?editors dblp:signatureDblpName ?name ;&lt;br /&gt;
        dblp:signatureCreator ?dblp_id ;&lt;br /&gt;
        dblp:signatureOrdinal ?editor_ordinal ;&lt;br /&gt;
        dblp:signaturePublication ?dblp_publication_id ;&lt;br /&gt;
        a dblp:EditorSignature.&lt;br /&gt;
}&lt;br /&gt;
GROUP BY  ?vol_number &lt;br /&gt;
&amp;lt;/source&amp;gt;&lt;br /&gt;
&lt;br /&gt;
=== Volume Editors of CEUR-WS in dblp with identifiers ===&lt;br /&gt;
&amp;lt;source lang=&amp;quot;sparql&amp;quot;&amp;gt;&lt;br /&gt;
PREFIX datacite: &amp;lt;http://purl.org/spar/datacite/&amp;gt;&lt;br /&gt;
PREFIX dblp: &amp;lt;https://dblp.org/rdf/schema#&amp;gt;&lt;br /&gt;
PREFIX litre: &amp;lt;http://purl.org/spar/literal/&amp;gt;&lt;br /&gt;
SELECT DISTINCT &lt;br /&gt;
	(group_concat(DISTINCT ?nameVar;separator='|') as ?name) &lt;br /&gt;
	(group_concat(DISTINCT ?homepageVar;separator='|') as ?homepage)&lt;br /&gt;
	(group_concat(DISTINCT ?affiliationVar;separator='|') as ?affiliation)&lt;br /&gt;
	(group_concat(DISTINCT ?dblpVar;separator='|') as ?dblp)&lt;br /&gt;
	(group_concat(DISTINCT ?wikidataVar;separator='|') as ?wikidata)&lt;br /&gt;
	(group_concat(DISTINCT ?orcidVar;separator='|') as ?orcid)&lt;br /&gt;
	(group_concat(DISTINCT ?googleScholarVar;separator='|') as ?googleScholar)&lt;br /&gt;
	(group_concat(DISTINCT ?acmVar;separator='|') as ?acm)&lt;br /&gt;
	(group_concat(DISTINCT ?twitterVar;separator='|') as ?twitter)&lt;br /&gt;
	(group_concat(DISTINCT ?githubVar;separator='|') as ?github)&lt;br /&gt;
	(group_concat(DISTINCT ?viafVar;separator='|') as ?viaf)&lt;br /&gt;
	(group_concat(DISTINCT ?scigraphVar;separator='|') as ?scigraph)&lt;br /&gt;
	(group_concat(DISTINCT ?zbmathVar;separator='|') as ?zbmath)&lt;br /&gt;
	(group_concat(DISTINCT ?researchGateVar;separator='|') as ?researchGate)&lt;br /&gt;
	(group_concat(DISTINCT ?mathGenealogyVar;separator='|') as ?mathGenealogy)&lt;br /&gt;
	(group_concat(DISTINCT ?locVar;separator='|') as ?loc)&lt;br /&gt;
	(group_concat(DISTINCT ?linkedinVar;separator='|') as ?linkedin)&lt;br /&gt;
	(group_concat(DISTINCT ?lattesVar;separator='|') as ?lattes)&lt;br /&gt;
	(group_concat(DISTINCT ?isniVar;separator='|') as ?isni)&lt;br /&gt;
	(group_concat(DISTINCT ?ieeeVar;separator='|') as ?ieee)&lt;br /&gt;
	(group_concat(DISTINCT ?geprisVar;separator='|') as ?gepris)&lt;br /&gt;
	(group_concat(DISTINCT ?gndVar;separator='|') as ?gnd)&lt;br /&gt;
WHERE{&lt;br /&gt;
	?proceeding dblp:publishedIn &amp;quot;CEUR Workshop Proceedings&amp;quot;;&lt;br /&gt;
		dblp:publishedInSeriesVolume ?volume;&lt;br /&gt;
		dblp:editedBy ?editor.&lt;br /&gt;
	?editor dblp:primaryCreatorName ?nameVar.&lt;br /&gt;
	OPTIONAL{?editor dblp:primaryHomepage ?homepageVar.}&lt;br /&gt;
	OPTIONAL{?editor dblp:primaryAffiliation ?affiliationVar.}&lt;br /&gt;
	OPTIONAL{&lt;br /&gt;
		?editor datacite:hasIdentifier ?dblp_blank.&lt;br /&gt;
		?dblp_blank datacite:usesIdentifierScheme datacite:dblp;&lt;br /&gt;
		litre:hasLiteralValue ?dblpVar.}&lt;br /&gt;
	OPTIONAL{&lt;br /&gt;
		?editor datacite:hasIdentifier ?wikidata_blank.&lt;br /&gt;
		?wikidata_blank datacite:usesIdentifierScheme datacite:wikidata;&lt;br /&gt;
		litre:hasLiteralValue ?wikidataVar.}&lt;br /&gt;
	OPTIONAL{&lt;br /&gt;
		?editor datacite:hasIdentifier ?orcid_blank.&lt;br /&gt;
		?orcid_blank datacite:usesIdentifierScheme datacite:orcid;&lt;br /&gt;
		litre:hasLiteralValue ?orcidVar.}&lt;br /&gt;
	OPTIONAL{&lt;br /&gt;
		?editor datacite:hasIdentifier ?googleScholar_blank.&lt;br /&gt;
		?googleScholar_blank datacite:usesIdentifierScheme datacite:google-scholar;&lt;br /&gt;
		litre:hasLiteralValue ?googleScholarVar.}&lt;br /&gt;
	OPTIONAL{&lt;br /&gt;
		?editor datacite:hasIdentifier ?acm_blank.&lt;br /&gt;
		?acm_blank datacite:usesIdentifierScheme datacite:acm;&lt;br /&gt;
		litre:hasLiteralValue ?acmVar.}&lt;br /&gt;
	OPTIONAL{&lt;br /&gt;
		?editor datacite:hasIdentifier ?twitter_blank.&lt;br /&gt;
		?twitter_blank datacite:usesIdentifierScheme datacite:twitter;&lt;br /&gt;
		litre:hasLiteralValue ?twitterVar.}&lt;br /&gt;
	OPTIONAL{&lt;br /&gt;
		?editor datacite:hasIdentifier ?github_blank.&lt;br /&gt;
		?github_blank datacite:usesIdentifierScheme datacite:github;&lt;br /&gt;
		litre:hasLiteralValue ?githubVar.}&lt;br /&gt;
	OPTIONAL{&lt;br /&gt;
		?editor datacite:hasIdentifier ?viaf_blank.&lt;br /&gt;
		?viaf_blank datacite:usesIdentifierScheme datacite:viaf;&lt;br /&gt;
		litre:hasLiteralValue ?viafVar.}&lt;br /&gt;
	OPTIONAL{&lt;br /&gt;
		?editor datacite:hasIdentifier ?scigraph_blank.&lt;br /&gt;
		?scigraph_blank datacite:usesIdentifierScheme datacite:scigraph;&lt;br /&gt;
		litre:hasLiteralValue ?scigraphVar.}&lt;br /&gt;
	OPTIONAL{&lt;br /&gt;
		?editor datacite:hasIdentifier ?zbmath_blank.&lt;br /&gt;
		?zbmath_blank datacite:usesIdentifierScheme datacite:zbmath;&lt;br /&gt;
		litre:hasLiteralValue ?zbmathVar.}&lt;br /&gt;
	OPTIONAL{&lt;br /&gt;
		?editor datacite:hasIdentifier ?researchGate_blank.&lt;br /&gt;
		?researchGate_blank datacite:usesIdentifierScheme datacite:research-gate;&lt;br /&gt;
		litre:hasLiteralValue ?researchGateVar.}&lt;br /&gt;
	OPTIONAL{&lt;br /&gt;
		?editor datacite:hasIdentifier ?mathGenealogy_blank.&lt;br /&gt;
		?mathGenealogy_blank datacite:usesIdentifierScheme datacite:math-genealogy;&lt;br /&gt;
		litre:hasLiteralValue ?mathGenealogyVar.}&lt;br /&gt;
	OPTIONAL{&lt;br /&gt;
		?editor datacite:hasIdentifier ?loc_blank.&lt;br /&gt;
		?loc_blank datacite:usesIdentifierScheme datacite:loc;&lt;br /&gt;
		litre:hasLiteralValue ?locVar.}&lt;br /&gt;
	OPTIONAL{&lt;br /&gt;
		?editor datacite:hasIdentifier ?linkedin_blank.&lt;br /&gt;
		?linkedin_blank datacite:usesIdentifierScheme datacite:linkedin;&lt;br /&gt;
		litre:hasLiteralValue ?linkedinVar.}&lt;br /&gt;
	OPTIONAL{&lt;br /&gt;
		?editor datacite:hasIdentifier ?lattes_blank.&lt;br /&gt;
		?lattes_blank datacite:usesIdentifierScheme datacite:lattes;&lt;br /&gt;
		litre:hasLiteralValue ?lattesVar.}&lt;br /&gt;
	OPTIONAL{&lt;br /&gt;
		?editor datacite:hasIdentifier ?isni_blank.&lt;br /&gt;
		?isni_blank datacite:usesIdentifierScheme datacite:isni;&lt;br /&gt;
		litre:hasLiteralValue ?isniVar.}&lt;br /&gt;
	OPTIONAL{&lt;br /&gt;
		?editor datacite:hasIdentifier ?ieee_blank.&lt;br /&gt;
		?ieee_blank datacite:usesIdentifierScheme datacite:ieee;&lt;br /&gt;
		litre:hasLiteralValue ?ieeeVar.}&lt;br /&gt;
	OPTIONAL{&lt;br /&gt;
		?editor datacite:hasIdentifier ?gepris_blank.&lt;br /&gt;
		?gepris_blank datacite:usesIdentifierScheme datacite:gepris;&lt;br /&gt;
		litre:hasLiteralValue ?geprisVar.}&lt;br /&gt;
	OPTIONAL{&lt;br /&gt;
		?editor datacite:hasIdentifier ?gnd_blank.&lt;br /&gt;
		?gnd_blank datacite:usesIdentifierScheme datacite:gnd;&lt;br /&gt;
		litre:hasLiteralValue ?gndVar.}&lt;br /&gt;
}&lt;br /&gt;
GROUP BY ?editor&lt;br /&gt;
                &lt;br /&gt;
&amp;lt;/source&amp;gt;&lt;br /&gt;
=== Comparing Extracted and dblp Editors ===&lt;br /&gt;
* editor by volume comparison&lt;br /&gt;
** 2233 volume the extracted editors match the dblp editors&lt;br /&gt;
** 807 volumes are missing in dblp (editors extracted)&lt;br /&gt;
** 27 volumes more editors were extracted than in dblp&lt;br /&gt;
** 387 volumes dblp has more editors than we could extract&lt;br /&gt;
&lt;br /&gt;
&lt;br /&gt;
== Wikidata Reconciliation ==&lt;br /&gt;
Using the ids queried from dblp to find the corresponding wikidata entry.&lt;br /&gt;
&lt;br /&gt;
Current strategy:&lt;br /&gt;
;Input: List of different identifiers that are known about a editor&lt;br /&gt;
;Output: SPARQL query&lt;br /&gt;
&lt;br /&gt;
Example:&lt;br /&gt;
*Input:&lt;br /&gt;
** homepage: http://www.stefandecker.org&lt;br /&gt;
** gnd id: 173443443&lt;br /&gt;
** dblp author id: d/StefanDecker&lt;br /&gt;
* Output:&lt;br /&gt;
&amp;lt;source lang=&amp;quot;sparql&amp;quot;&amp;gt;&lt;br /&gt;
PREFIX wdt: &amp;lt;http://www.wikidata.org/prop/direct/&amp;gt;&lt;br /&gt;
PREFIX wikibase: &amp;lt;http://wikiba.se/ontology#&amp;gt;&lt;br /&gt;
PREFIX rdfs: &amp;lt;http://www.w3.org/2000/01/rdf-schema#&amp;gt;&lt;br /&gt;
SELECT DISTINCT ?person ?personLabel&lt;br /&gt;
WHERE&lt;br /&gt;
{&lt;br /&gt;
  {OPTIONAL{ ?person wdt:P856 &amp;lt;http://www.stefandecker.org&amp;gt;.} }&lt;br /&gt;
  UNION&lt;br /&gt;
  {OPTIONAL{ ?person wdt:P227 &amp;quot;173443443&amp;quot;.} } # gnd&lt;br /&gt;
  UNION&lt;br /&gt;
  {OPTIONAL{ ?person wdt:P2456 &amp;quot;d/StefanDecker&amp;quot;.} } # dblp&lt;br /&gt;
  ?person rdfs:label ?personLabel. FILTER(lang(?personLabel)=&amp;quot;en&amp;quot;)&lt;br /&gt;
}&lt;br /&gt;
&amp;lt;/source&amp;gt;&lt;br /&gt;
&lt;br /&gt;
Running these queries for all 4942 editors known by dblp we get:&lt;br /&gt;
* Queries currently running...[[User:Tim Holzheim|Tim Holzheim]] ([[User talk:Tim Holzheim|talk]]) 09:49, 9 March 2023 (CET)&lt;/div&gt;</summary>
		<author><name>Tim Holzheim</name></author>
	</entry>
	<entry>
		<id>http://ceur-ws.bitplan.com/index.php?title=File:Volume_editor_distribution.png&amp;diff=1184</id>
		<title>File:Volume editor distribution.png</title>
		<link rel="alternate" type="text/html" href="http://ceur-ws.bitplan.com/index.php?title=File:Volume_editor_distribution.png&amp;diff=1184"/>
		<updated>2023-03-09T08:16:24Z</updated>

		<summary type="html">&lt;p&gt;Tim Holzheim: File uploaded with MsUpload&lt;/p&gt;
&lt;hr /&gt;
&lt;div&gt;File uploaded with MsUpload&lt;/div&gt;</summary>
		<author><name>Tim Holzheim</name></author>
	</entry>
	<entry>
		<id>http://ceur-ws.bitplan.com/index.php?title=Wikidata_Synchronization_for_the_CEUR-WS_publishing_platform_Use-Case_of&amp;diff=786</id>
		<title>Wikidata Synchronization for the CEUR-WS publishing platform Use-Case of</title>
		<link rel="alternate" type="text/html" href="http://ceur-ws.bitplan.com/index.php?title=Wikidata_Synchronization_for_the_CEUR-WS_publishing_platform_Use-Case_of&amp;diff=786"/>
		<updated>2022-10-17T12:24:49Z</updated>

		<summary type="html">&lt;p&gt;Tim Holzheim: Created page with &amp;quot;CEUR Workshop Proceedings is a free open-access publication service for academic workshops.  Data about the proceedings, events, papers, editors and authors are stored in a se...&amp;quot;&lt;/p&gt;
&lt;hr /&gt;
&lt;div&gt;CEUR Workshop Proceedings is a free open-access publication service for academic workshops. &lt;br /&gt;
Data about the proceedings, events, papers, editors and authors are stored in a semi-structured html format.&lt;br /&gt;
In a effort to semantify the data and move it into a semantic mediawiki the goal was also to integrate the data into wikidata.&lt;br /&gt;
Since wikidata is one of the largest knowledge graphs with a large portion covering scholarly articles, adding the ceur-ws data to it increases accessibility and the cross referencing of the contained enties.&lt;br /&gt;
For the historic records the data needs to be extracted to create the wiki pages and wikidata items.&lt;br /&gt;
Additionally for new proceedings and for the curration of existing once data from the smw wiki needs to be published to wikidata.&lt;br /&gt;
The CEUR-WS browser in our pyCEURmake project aims to introduce such a bidirectional api for edits on wikidata and a smw wiki.&lt;/div&gt;</summary>
		<author><name>Tim Holzheim</name></author>
	</entry>
	<entry>
		<id>http://ceur-ws.bitplan.com/index.php?title=Workdocumentation_2022-09-06&amp;diff=758</id>
		<title>Workdocumentation 2022-09-06</title>
		<link rel="alternate" type="text/html" href="http://ceur-ws.bitplan.com/index.php?title=Workdocumentation_2022-09-06&amp;diff=758"/>
		<updated>2022-09-06T07:40:19Z</updated>

		<summary type="html">&lt;p&gt;Tim Holzheim: &lt;/p&gt;
&lt;hr /&gt;
&lt;div&gt;{{PageSequence|prev=Workdocumentation 2022-08-29|next=Workdocumentation 2022-09-07|category=Workdocumentation}}&lt;br /&gt;
== Definition of Done ==&lt;br /&gt;
&lt;br /&gt;
&lt;br /&gt;
=== ToDo ===&lt;br /&gt;
* Extraction of Location&lt;br /&gt;
* Extraction of start time &amp;amp; end time&lt;br /&gt;
* Extraction of Papers&lt;br /&gt;
** Name of the paper&lt;br /&gt;
** authors (with dblp id)&lt;br /&gt;
* Extraction of Editors&lt;br /&gt;
** name of the editors&lt;br /&gt;
** affiliation of the editors&lt;br /&gt;
&lt;br /&gt;
* connecting the events to its series&lt;/div&gt;</summary>
		<author><name>Tim Holzheim</name></author>
	</entry>
	<entry>
		<id>http://ceur-ws.bitplan.com/index.php?title=Workdocumentation_2022-08-29&amp;diff=757</id>
		<title>Workdocumentation 2022-08-29</title>
		<link rel="alternate" type="text/html" href="http://ceur-ws.bitplan.com/index.php?title=Workdocumentation_2022-08-29&amp;diff=757"/>
		<updated>2022-09-06T07:39:32Z</updated>

		<summary type="html">&lt;p&gt;Tim Holzheim: &lt;/p&gt;
&lt;hr /&gt;
&lt;div&gt;{{PageSequence|prev=Workdocumentation 2022-08-23|next=Workdocumentation 2022-09-06|category=Workdocumentation}}&lt;br /&gt;
= Participants =&lt;br /&gt;
* Wolfgang&lt;br /&gt;
* Beyza &lt;br /&gt;
= CEUR-WS Wikidata Sync =&lt;br /&gt;
&lt;br /&gt;
CEUR-WS Platform siehe http://ceur-ws.org/&lt;br /&gt;
&lt;br /&gt;
Etwa 3200 Proceedings Volumes&lt;br /&gt;
&lt;br /&gt;
Synchronisation mit Wikidata, k10plus und dblp&lt;br /&gt;
&lt;br /&gt;
Dokumentiert in https://ceur-ws.bitplan.com/index.php/Workdocumentation_2022-08-12 und  folgende - siehe auch RQ Wiki&lt;br /&gt;
&lt;br /&gt;
Nutzung dblp SPARQL Abfrage via https://qlever.cs.uni-freiburg.de/dblp bzw. lokale Endpoints&lt;br /&gt;
&lt;br /&gt;
CEUR-WS Browser Frontend: http://ceur-ws-browser.bitplan.com/&lt;br /&gt;
&lt;br /&gt;
Beispiel: http://ceur-ws-browser.bitplan.com/volume/3111&lt;br /&gt;
&lt;br /&gt;
Bewusst noch keine Detaildaten wie Ort/Datum - Link-First Ansatz ...&lt;br /&gt;
&lt;br /&gt;
Beispiel für komplette Serie: https://scholia.toolforge.org/event-series/Q56846035 - erzeugt von Lars Willighagen mit https://citation.js.org/&lt;br /&gt;
&lt;br /&gt;
Ziel: Paper/Author Links see also https://github.com/zotero/translators/issues/2015&lt;br /&gt;
&lt;br /&gt;
Wie soll die ConfIDent Anbindung erfolgen?&lt;br /&gt;
&lt;br /&gt;
= Justpy =&lt;br /&gt;
&amp;lt;source lang='bash'&amp;gt;&lt;br /&gt;
scripts/findexamples&lt;br /&gt;
wf@fix justpy % scripts/findexamples | grep plot_test&lt;br /&gt;
jp.justpy(plot_test1)&lt;br /&gt;
jp.justpy(plot_test2)&lt;br /&gt;
jp.justpy(plot_test3)&lt;br /&gt;
jp.justpy(plot_test4)&lt;br /&gt;
jp.justpy(plot_test5)&lt;br /&gt;
jp.justpy(plot_test6)&lt;br /&gt;
jp.justpy(plot_test7)&lt;br /&gt;
jp.justpy(plot_test8)&lt;br /&gt;
jp.justpy(plot_test9)&lt;br /&gt;
&amp;lt;/source&amp;gt;&lt;br /&gt;
* https://github.com/justpy-org/justpy/issues/468&lt;br /&gt;
* https://github.com/justpy-org/justpy/issues/464&lt;/div&gt;</summary>
		<author><name>Tim Holzheim</name></author>
	</entry>
	<entry>
		<id>http://ceur-ws.bitplan.com/index.php?title=Workdocumentation_2022-09-06&amp;diff=756</id>
		<title>Workdocumentation 2022-09-06</title>
		<link rel="alternate" type="text/html" href="http://ceur-ws.bitplan.com/index.php?title=Workdocumentation_2022-09-06&amp;diff=756"/>
		<updated>2022-09-06T07:39:30Z</updated>

		<summary type="html">&lt;p&gt;Tim Holzheim: Created page with &amp;quot;{{PageSequence|prev=Workdocumentation 2022-08-29|next=Workdocumentation 2022-09-07|category=Workdocumentation}}&amp;quot;&lt;/p&gt;
&lt;hr /&gt;
&lt;div&gt;{{PageSequence|prev=Workdocumentation 2022-08-29|next=Workdocumentation 2022-09-07|category=Workdocumentation}}&lt;/div&gt;</summary>
		<author><name>Tim Holzheim</name></author>
	</entry>
	<entry>
		<id>http://ceur-ws.bitplan.com/index.php?title=Workdocumentation_2022-08-16&amp;diff=704</id>
		<title>Workdocumentation 2022-08-16</title>
		<link rel="alternate" type="text/html" href="http://ceur-ws.bitplan.com/index.php?title=Workdocumentation_2022-08-16&amp;diff=704"/>
		<updated>2022-08-17T11:09:15Z</updated>

		<summary type="html">&lt;p&gt;Tim Holzheim: /* Further Queries */&lt;/p&gt;
&lt;hr /&gt;
&lt;div&gt;{{PageSequence|prev=Workdocumentation 2022-08-15|next=Workdocumentation 2022-08-17|category=Workdocumentation}}&lt;br /&gt;
= Participants =&lt;br /&gt;
* Tim&lt;br /&gt;
* Wolfgang&lt;br /&gt;
= Agenda =&lt;br /&gt;
dblp Qlever&lt;br /&gt;
&lt;br /&gt;
= dblp Qlever =&lt;br /&gt;
* https://github.com/WolfgangFahl/pyCEURmake/issues/18&lt;br /&gt;
&lt;br /&gt;
== on RWTH Aachen DBIS i5 server ==&lt;br /&gt;
&amp;lt;source lang='bash' highlight='1,12-13'&amp;gt;&lt;br /&gt;
wf@confident:/hd/torterra/dblp2022$ wget https://dblp.org/rdf/dblp.nt.gz&lt;br /&gt;
--2022-08-16 12:00:18--  https://dblp.org/rdf/dblp.nt.gz&lt;br /&gt;
Resolving dblp.org (dblp.org)... 192.76.146.204&lt;br /&gt;
Connecting to dblp.org (dblp.org)|192.76.146.204|:443... connected.&lt;br /&gt;
HTTP request sent, awaiting response... 200 OK&lt;br /&gt;
Length: 2789108619 (2.6G) [application/x-gzip]&lt;br /&gt;
Saving to: ‘dblp.nt.gz’&lt;br /&gt;
&lt;br /&gt;
dblp.nt.gz           36%[======&amp;gt;             ] 980.73M  38.9MB/s    eta 45s &lt;br /&gt;
dblp.nt.gz          100%[===================&amp;gt;]   2.60G  34.1MB/s    in 76s     &lt;br /&gt;
2022-08-16 12:01:34 (35.1 MB/s) - ‘dblp.nt.gz’ saved [2789108619/2789108619]&lt;br /&gt;
gunzip dblp.nt.gz&lt;br /&gt;
ls -l&lt;br /&gt;
total 34405612&lt;br /&gt;
-rw-rw-r-- 1 wf wf 35231339037 Aug 16 00:16 dblp.nt&lt;br /&gt;
&amp;lt;/source&amp;gt;&lt;br /&gt;
&amp;lt;source lang='bash' highlight='1'&amp;gt;&lt;br /&gt;
wget https://dblp.org/rdf/dblp.ttl.gz&lt;br /&gt;
--2022-08-16 15:36:37--  https://dblp.org/rdf/dblp.ttl.gz&lt;br /&gt;
Resolving dblp.org (dblp.org)... 192.76.146.204&lt;br /&gt;
Connecting to dblp.org (dblp.org)|192.76.146.204|:443... connected.&lt;br /&gt;
HTTP request sent, awaiting response... 200 OK&lt;br /&gt;
Length: 1065586620 (1016M) [application/x-gzip]&lt;br /&gt;
Saving to: ‘dblp.ttl.gz’&lt;br /&gt;
dblp.ttl.gz             47%[===========&amp;gt;              ] 483.91M  43.1MB/s    eta 16s&lt;br /&gt;
dblp.ttl.gz            100%[=========================&amp;gt;]   1016M  32.7MB/s    in 29s     &lt;br /&gt;
2022-08-16 15:37:06 (35.3 MB/s) - ‘dblp.ttl.gz’ saved [1065586620/1065586620]&lt;br /&gt;
gunzip dblp.ttl.gz&lt;br /&gt;
&amp;lt;/source&amp;gt;&lt;br /&gt;
&lt;br /&gt;
== QLever installation ==&lt;br /&gt;
https://wiki.bitplan.com/index.php/WikiData_Import_2022-01-29&lt;br /&gt;
=== Environment ===&lt;br /&gt;
&amp;lt;source lang='bash' highlight='1,7,9'&amp;gt;&lt;br /&gt;
lsb_release -a&lt;br /&gt;
No LSB modules are available.&lt;br /&gt;
Distributor ID:	Ubuntu&lt;br /&gt;
Description:	Ubuntu 20.04.4 LTS&lt;br /&gt;
Release:	20.04&lt;br /&gt;
Codename:	focal&lt;br /&gt;
wf@confident:/hd/torterra/qlever$ docker --version&lt;br /&gt;
Docker version 20.10.12, build 20.10.12-0ubuntu2~20.04.1&lt;br /&gt;
wf@confident:/hd/torterra/qlever$ free -h&lt;br /&gt;
              total        used        free      shared  buff/cache   available&lt;br /&gt;
Mem:           15Gi       1.2Gi        12Gi        45Mi       1.6Gi        13Gi&lt;br /&gt;
Swap:          11Gi          0B        11Gi&lt;br /&gt;
&amp;lt;/source&amp;gt;&lt;br /&gt;
===== Disk space =====&lt;br /&gt;
&amp;lt;source lang='bash'&amp;gt;&lt;br /&gt;
df | grep -v loop | grep -v tmp | grep -v udev&lt;br /&gt;
Filesystem      1K-blocks       Used Available Use% Mounted on&lt;br /&gt;
/dev/sda3       114226348   55228964  53148860  51% /&lt;br /&gt;
/dev/sdb1      3844590624 3266511864 382711544  90% /hd/torterra&lt;br /&gt;
&amp;lt;/source&amp;gt;&lt;br /&gt;
&lt;br /&gt;
=== QLever code clone ===&lt;br /&gt;
&amp;lt;source lang='bash' highlight='1-2'&amp;gt;&lt;br /&gt;
export QLEVER_HOME=$(pwd)&lt;br /&gt;
date;git clone --recursive https://github.com/ad-freiburg/qlever qlever-code;date&lt;br /&gt;
Tue Aug 16 15:10:09 CEST 2022&lt;br /&gt;
Cloning into 'qlever-code'...&lt;br /&gt;
remote: Enumerating objects: 14625, done.&lt;br /&gt;
remote: Counting objects: 100% (279/279), done.&lt;br /&gt;
remote: Compressing objects: 100% (227/227), done.&lt;br /&gt;
remote: Total 14625 (delta 133), reused 131 (delta 52), pack-reused 14346&lt;br /&gt;
Receiving objects: 100% (14625/14625), 190.60 MiB | 6.48 MiB/s, done.&lt;br /&gt;
...&lt;br /&gt;
Submodule path 'third_party/stxxl/extlib/foxxll/extlib/tlx': checked out 'ef81a598d9880cc7d242afc47de7328634f07f1d'&lt;br /&gt;
Tue Aug 16 15:10:56 CEST 2022&lt;br /&gt;
&amp;lt;/source&amp;gt;&lt;br /&gt;
==== Build ====&lt;br /&gt;
&amp;lt;source lang='bash' highlight='1'&amp;gt;&lt;br /&gt;
date;sudo docker build --file Dockerfiles/Dockerfile.Ubuntu20.04 -t qlever .;date&lt;br /&gt;
Tue Aug 16 15:13:02 CEST 2022&lt;br /&gt;
Sending build context to Docker daemon    453MB&lt;br /&gt;
Step 1/43 : FROM ubuntu:20.04 as base&lt;br /&gt;
20.04: Pulling from library/ubuntu&lt;br /&gt;
3b65ec22a9e9: Pull complete &lt;br /&gt;
&lt;br /&gt;
Removing intermediate container 1ccc2a50364e&lt;br /&gt;
 ---&amp;gt; d0018440a4cd&lt;br /&gt;
Successfully built d0018440a4cd&lt;br /&gt;
Successfully tagged qlever:latest&lt;br /&gt;
Tue Aug 16 15:25:28 CEST 2022&lt;br /&gt;
&amp;lt;/source&amp;gt;&lt;br /&gt;
&lt;br /&gt;
=== qlever control ===&lt;br /&gt;
&amp;lt;source lang='bash'&amp;gt;&lt;br /&gt;
git clone https://github.com/ad-freiburg/qlever-controlCloning into 'qlever-control'...&lt;br /&gt;
remote: Enumerating objects: 368, done.&lt;br /&gt;
remote: Counting objects: 100% (208/208), done.&lt;br /&gt;
remote: Compressing objects: 100% (135/135), done.&lt;br /&gt;
remote: Total 368 (delta 75), reused 183 (delta 72), pack-reused 160&lt;br /&gt;
Receiving objects: 100% (368/368), 117.76 KiB | 7.36 MiB/s, done.&lt;br /&gt;
Resolving deltas: 100% (130/130), done.&lt;br /&gt;
&amp;lt;/source&amp;gt;&lt;br /&gt;
== dblp stardog ==&lt;br /&gt;
&amp;lt;source lang='bash' highlight='1'&amp;gt;&lt;br /&gt;
docker pull stardog/stardog:latest&lt;br /&gt;
latest: Pulling from stardog/stardog&lt;br /&gt;
2d473b07cdd5: Pull complete &lt;br /&gt;
b0eac9aee9aa: Pull complete &lt;br /&gt;
8d5b89da19bc: Pull complete &lt;br /&gt;
91c2bc930138: Pull complete &lt;br /&gt;
265d7b96dd8f: Pull complete &lt;br /&gt;
Digest: sha256:7fc70e1bd3d17bdb1440f0cd810294b5318f1c53935425bb51526da4a949afc0&lt;br /&gt;
Status: Downloaded newer image for stardog/stardog:latest&lt;br /&gt;
docker.io/stardog/stardog:latest&lt;br /&gt;
&amp;lt;/source&amp;gt;&lt;br /&gt;
&lt;br /&gt;
&lt;br /&gt;
&lt;br /&gt;
= DBLP versus CEUR-WS Queries = &lt;br /&gt;
== All Volumes known to dblp ==&lt;br /&gt;
* expected 70% of 3185 volumes found 75%&lt;br /&gt;
* actual 75%&lt;br /&gt;
&amp;lt;source lang=&amp;quot;sparql&amp;quot;&amp;gt;&lt;br /&gt;
SELECT (COUNT(?proceeding) as ?count) (MIN(xsd:integer(?volNumber)) as ?min)  (MAX(xsd:integer(?volNumber)) as ?max) &lt;br /&gt;
WHERE { &lt;br /&gt;
    ?proceeding dblp:publishedIn &amp;quot;CEUR Workshop Proceedings&amp;quot;;&lt;br /&gt;
                dblp:publishedInSeriesVolume ?volNumber .&lt;br /&gt;
    }&lt;br /&gt;
LIMIT 5000&lt;br /&gt;
&amp;lt;/source&amp;gt;&lt;br /&gt;
{| class=&amp;quot;wikitable&amp;quot;&lt;br /&gt;
| count || min || max&lt;br /&gt;
|-&lt;br /&gt;
| 2375 || 1 || 3157&lt;br /&gt;
|}&lt;br /&gt;
&lt;br /&gt;
== All papers ==&lt;br /&gt;
* expected 70% of ~50000 papers&lt;br /&gt;
&amp;lt;source lang=&amp;quot;sparql&amp;quot;&amp;gt;&lt;br /&gt;
SELECT (COUNT(?paper) as ?count)&lt;br /&gt;
WHERE { &lt;br /&gt;
    ?proceeding dblp:publishedIn &amp;quot;CEUR Workshop Proceedings&amp;quot;.&lt;br /&gt;
    ?paper dblp:publishedAsPartOf ?proceeding.&lt;br /&gt;
}&lt;br /&gt;
&amp;lt;/source&amp;gt;&lt;br /&gt;
{| class=&amp;quot;wikitable&amp;quot;&lt;br /&gt;
| count&lt;br /&gt;
|-&lt;br /&gt;
| 44275&lt;br /&gt;
|}&lt;br /&gt;
&lt;br /&gt;
== All authors and editors ==&lt;br /&gt;
* authors: papers expected: &amp;lt;1:1 and &amp;gt;1:3 found 1.6 distinct authors in relation to distinct papers&lt;br /&gt;
* editors: volumes 3:1  found 4625 editors for 2377 volumes&lt;br /&gt;
&amp;lt;source lang=&amp;quot;sparql&amp;quot;&amp;gt;&lt;br /&gt;
SELECT (COUNT(DISTINCT ?author) as ?numberOfAuthors) &lt;br /&gt;
       (COUNT(DISTINCT ?paper) as ?numberOfPapers) &lt;br /&gt;
       (COUNT(DISTINCT ?editor) as ?numberOfEditors)&lt;br /&gt;
       (COUNT(DISTINCT ?proceeding) as ?numberOfVolumes)&lt;br /&gt;
WHERE { &lt;br /&gt;
    ?proceeding dblp:publishedIn &amp;quot;CEUR Workshop Proceedings&amp;quot;.&lt;br /&gt;
    OPTIONAL{?proceeding dblp:editedBy ?editor}&lt;br /&gt;
    OPTIONAL{&lt;br /&gt;
        ?paper dblp:publishedAsPartOf ?proceeding.&lt;br /&gt;
        OPTIONAL{?paper dblp:authoredBy ?author}&lt;br /&gt;
    }&lt;br /&gt;
    &lt;br /&gt;
}&lt;br /&gt;
&lt;br /&gt;
&amp;lt;/source&amp;gt;&lt;br /&gt;
{| class=&amp;quot;wikitable&amp;quot;&lt;br /&gt;
| numberOfAuthors || numberOfPapers || numberOfEditors || numberOfVolumes&lt;br /&gt;
|-&lt;br /&gt;
| 69846 || 44275 || 4625 || 2377&lt;br /&gt;
|}&lt;br /&gt;
Note: There are proceedings of ceurws without an volumeId&lt;br /&gt;
&lt;br /&gt;
Namely:&lt;br /&gt;
* https://dblp.org/rec/conf/www/2017ldow&lt;br /&gt;
* https://dblp.org/rec/conf/semweb/2017hybridsemstats&lt;br /&gt;
&lt;br /&gt;
== Cross-check against wikidata ==&lt;br /&gt;
* volumes in dblp and wikidata&lt;br /&gt;
&amp;lt;source lang=&amp;quot;sparql&amp;quot;&amp;gt;&lt;br /&gt;
SELECT DISTINCT ?proceeding ?wdProceedings ?urn&lt;br /&gt;
WHERE { &lt;br /&gt;
    ?proceeding dblp:publishedIn &amp;quot;CEUR Workshop Proceedings&amp;quot;;&lt;br /&gt;
                datacite:hasIdentifier [&lt;br /&gt;
                    datacite:usesIdentifierScheme datacite:urn ;&lt;br /&gt;
                    litre:hasLiteralValue ?urn ;&lt;br /&gt;
                    a datacite:ResourceIdentifier&lt;br /&gt;
                ] .&lt;br /&gt;
    service &amp;lt;https://query.wikidata.org/sparql&amp;gt; {&lt;br /&gt;
        ?wdProceedings wdt:P179 wd:Q27230297;&lt;br /&gt;
                       wdt:P4109 ?urn&lt;br /&gt;
    }&lt;br /&gt;
}&lt;br /&gt;
&amp;lt;/source&amp;gt;&lt;br /&gt;
&lt;br /&gt;
* volumes in wikidata missing in dblp&lt;br /&gt;
&amp;lt;source lang=&amp;quot;sparql&amp;quot;&amp;gt;&lt;br /&gt;
SELECT DISTINCT ?wdProceedings ?urn&lt;br /&gt;
WHERE { &lt;br /&gt;
    service &amp;lt;https://query.wikidata.org/sparql&amp;gt; {&lt;br /&gt;
        ?wdProceedings wdt:P179 wd:Q27230297;&lt;br /&gt;
                       wdt:P4109 ?urn&lt;br /&gt;
    }&lt;br /&gt;
    MINUS{&lt;br /&gt;
        ?proceeding dblp:publishedIn &amp;quot;CEUR Workshop Proceedings&amp;quot;;&lt;br /&gt;
                datacite:hasIdentifier [&lt;br /&gt;
                    datacite:usesIdentifierScheme datacite:urn ;&lt;br /&gt;
                    litre:hasLiteralValue ?urn ;&lt;br /&gt;
                    a datacite:ResourceIdentifier&lt;br /&gt;
                ] .&lt;br /&gt;
    }&lt;br /&gt;
}&lt;br /&gt;
&amp;lt;/source&amp;gt;&lt;br /&gt;
* volumes in dblp missing in wikidata&lt;br /&gt;
&amp;lt;source lang=&amp;quot;sparql&amp;quot;&amp;gt;&lt;br /&gt;
SELECT DISTINCT ?proceedings ?urn&lt;br /&gt;
WHERE { &lt;br /&gt;
    ?proceeding dblp:publishedIn &amp;quot;CEUR Workshop Proceedings&amp;quot;;&lt;br /&gt;
                datacite:hasIdentifier [&lt;br /&gt;
                    datacite:usesIdentifierScheme datacite:urn ;&lt;br /&gt;
                    litre:hasLiteralValue ?urn ;&lt;br /&gt;
                    a datacite:ResourceIdentifier&lt;br /&gt;
                ] .&lt;br /&gt;
&lt;br /&gt;
    MINUS{&lt;br /&gt;
        service &amp;lt;https://query.wikidata.org/sparql&amp;gt; {&lt;br /&gt;
        ?wdProceedings wdt:P179 wd:Q27230297;&lt;br /&gt;
                       wdt:P4109 ?urn.&lt;br /&gt;
        }&lt;br /&gt;
    }&lt;br /&gt;
}&lt;br /&gt;
&amp;lt;/source&amp;gt;&lt;br /&gt;
== Further Queries ==&lt;br /&gt;
* Cross-check against wikidata&lt;br /&gt;
** 53000 dblp authors out of 2.3m&lt;br /&gt;
** all editors MUST be in dblp acording to the rules of ceur-ws&lt;br /&gt;
** authors can be in dblp&lt;br /&gt;
* Disambiguation problem only on ceur-ws side&lt;br /&gt;
** idea: calculate distance of potential candidate authors to authors in the same volume&lt;/div&gt;</summary>
		<author><name>Tim Holzheim</name></author>
	</entry>
	<entry>
		<id>http://ceur-ws.bitplan.com/index.php?title=Workdocumentation_2022-08-16&amp;diff=703</id>
		<title>Workdocumentation 2022-08-16</title>
		<link rel="alternate" type="text/html" href="http://ceur-ws.bitplan.com/index.php?title=Workdocumentation_2022-08-16&amp;diff=703"/>
		<updated>2022-08-17T10:31:54Z</updated>

		<summary type="html">&lt;p&gt;Tim Holzheim: /* All authors and editors = */&lt;/p&gt;
&lt;hr /&gt;
&lt;div&gt;{{PageSequence|prev=Workdocumentation 2022-08-15|next=Workdocumentation 2022-08-17|category=Workdocumentation}}&lt;br /&gt;
= Participants =&lt;br /&gt;
* Tim&lt;br /&gt;
* Wolfgang&lt;br /&gt;
= Agenda =&lt;br /&gt;
dblp Qlever&lt;br /&gt;
&lt;br /&gt;
= dblp Qlever =&lt;br /&gt;
* https://github.com/WolfgangFahl/pyCEURmake/issues/18&lt;br /&gt;
&lt;br /&gt;
== on RWTH Aachen DBIS i5 server ==&lt;br /&gt;
&amp;lt;source lang='bash' highlight='1,12-13'&amp;gt;&lt;br /&gt;
wf@confident:/hd/torterra/dblp2022$ wget https://dblp.org/rdf/dblp.nt.gz&lt;br /&gt;
--2022-08-16 12:00:18--  https://dblp.org/rdf/dblp.nt.gz&lt;br /&gt;
Resolving dblp.org (dblp.org)... 192.76.146.204&lt;br /&gt;
Connecting to dblp.org (dblp.org)|192.76.146.204|:443... connected.&lt;br /&gt;
HTTP request sent, awaiting response... 200 OK&lt;br /&gt;
Length: 2789108619 (2.6G) [application/x-gzip]&lt;br /&gt;
Saving to: ‘dblp.nt.gz’&lt;br /&gt;
&lt;br /&gt;
dblp.nt.gz           36%[======&amp;gt;             ] 980.73M  38.9MB/s    eta 45s &lt;br /&gt;
dblp.nt.gz          100%[===================&amp;gt;]   2.60G  34.1MB/s    in 76s     &lt;br /&gt;
2022-08-16 12:01:34 (35.1 MB/s) - ‘dblp.nt.gz’ saved [2789108619/2789108619]&lt;br /&gt;
gunzip dblp.nt.gz&lt;br /&gt;
ls -l&lt;br /&gt;
total 34405612&lt;br /&gt;
-rw-rw-r-- 1 wf wf 35231339037 Aug 16 00:16 dblp.nt&lt;br /&gt;
&amp;lt;/source&amp;gt;&lt;br /&gt;
&amp;lt;source lang='bash' highlight='1'&amp;gt;&lt;br /&gt;
wget https://dblp.org/rdf/dblp.ttl.gz&lt;br /&gt;
--2022-08-16 15:36:37--  https://dblp.org/rdf/dblp.ttl.gz&lt;br /&gt;
Resolving dblp.org (dblp.org)... 192.76.146.204&lt;br /&gt;
Connecting to dblp.org (dblp.org)|192.76.146.204|:443... connected.&lt;br /&gt;
HTTP request sent, awaiting response... 200 OK&lt;br /&gt;
Length: 1065586620 (1016M) [application/x-gzip]&lt;br /&gt;
Saving to: ‘dblp.ttl.gz’&lt;br /&gt;
dblp.ttl.gz             47%[===========&amp;gt;              ] 483.91M  43.1MB/s    eta 16s&lt;br /&gt;
dblp.ttl.gz            100%[=========================&amp;gt;]   1016M  32.7MB/s    in 29s     &lt;br /&gt;
2022-08-16 15:37:06 (35.3 MB/s) - ‘dblp.ttl.gz’ saved [1065586620/1065586620]&lt;br /&gt;
gunzip dblp.ttl.gz&lt;br /&gt;
&amp;lt;/source&amp;gt;&lt;br /&gt;
&lt;br /&gt;
== QLever installation ==&lt;br /&gt;
https://wiki.bitplan.com/index.php/WikiData_Import_2022-01-29&lt;br /&gt;
=== Environment ===&lt;br /&gt;
&amp;lt;source lang='bash' highlight='1,7,9'&amp;gt;&lt;br /&gt;
lsb_release -a&lt;br /&gt;
No LSB modules are available.&lt;br /&gt;
Distributor ID:	Ubuntu&lt;br /&gt;
Description:	Ubuntu 20.04.4 LTS&lt;br /&gt;
Release:	20.04&lt;br /&gt;
Codename:	focal&lt;br /&gt;
wf@confident:/hd/torterra/qlever$ docker --version&lt;br /&gt;
Docker version 20.10.12, build 20.10.12-0ubuntu2~20.04.1&lt;br /&gt;
wf@confident:/hd/torterra/qlever$ free -h&lt;br /&gt;
              total        used        free      shared  buff/cache   available&lt;br /&gt;
Mem:           15Gi       1.2Gi        12Gi        45Mi       1.6Gi        13Gi&lt;br /&gt;
Swap:          11Gi          0B        11Gi&lt;br /&gt;
&amp;lt;/source&amp;gt;&lt;br /&gt;
===== Disk space =====&lt;br /&gt;
&amp;lt;source lang='bash'&amp;gt;&lt;br /&gt;
df | grep -v loop | grep -v tmp | grep -v udev&lt;br /&gt;
Filesystem      1K-blocks       Used Available Use% Mounted on&lt;br /&gt;
/dev/sda3       114226348   55228964  53148860  51% /&lt;br /&gt;
/dev/sdb1      3844590624 3266511864 382711544  90% /hd/torterra&lt;br /&gt;
&amp;lt;/source&amp;gt;&lt;br /&gt;
&lt;br /&gt;
=== QLever code clone ===&lt;br /&gt;
&amp;lt;source lang='bash' highlight='1-2'&amp;gt;&lt;br /&gt;
export QLEVER_HOME=$(pwd)&lt;br /&gt;
date;git clone --recursive https://github.com/ad-freiburg/qlever qlever-code;date&lt;br /&gt;
Tue Aug 16 15:10:09 CEST 2022&lt;br /&gt;
Cloning into 'qlever-code'...&lt;br /&gt;
remote: Enumerating objects: 14625, done.&lt;br /&gt;
remote: Counting objects: 100% (279/279), done.&lt;br /&gt;
remote: Compressing objects: 100% (227/227), done.&lt;br /&gt;
remote: Total 14625 (delta 133), reused 131 (delta 52), pack-reused 14346&lt;br /&gt;
Receiving objects: 100% (14625/14625), 190.60 MiB | 6.48 MiB/s, done.&lt;br /&gt;
...&lt;br /&gt;
Submodule path 'third_party/stxxl/extlib/foxxll/extlib/tlx': checked out 'ef81a598d9880cc7d242afc47de7328634f07f1d'&lt;br /&gt;
Tue Aug 16 15:10:56 CEST 2022&lt;br /&gt;
&amp;lt;/source&amp;gt;&lt;br /&gt;
==== Build ====&lt;br /&gt;
&amp;lt;source lang='bash' highlight='1'&amp;gt;&lt;br /&gt;
date;sudo docker build --file Dockerfiles/Dockerfile.Ubuntu20.04 -t qlever .;date&lt;br /&gt;
Tue Aug 16 15:13:02 CEST 2022&lt;br /&gt;
Sending build context to Docker daemon    453MB&lt;br /&gt;
Step 1/43 : FROM ubuntu:20.04 as base&lt;br /&gt;
20.04: Pulling from library/ubuntu&lt;br /&gt;
3b65ec22a9e9: Pull complete &lt;br /&gt;
&lt;br /&gt;
Removing intermediate container 1ccc2a50364e&lt;br /&gt;
 ---&amp;gt; d0018440a4cd&lt;br /&gt;
Successfully built d0018440a4cd&lt;br /&gt;
Successfully tagged qlever:latest&lt;br /&gt;
Tue Aug 16 15:25:28 CEST 2022&lt;br /&gt;
&amp;lt;/source&amp;gt;&lt;br /&gt;
&lt;br /&gt;
=== qlever control ===&lt;br /&gt;
&amp;lt;source lang='bash'&amp;gt;&lt;br /&gt;
git clone https://github.com/ad-freiburg/qlever-controlCloning into 'qlever-control'...&lt;br /&gt;
remote: Enumerating objects: 368, done.&lt;br /&gt;
remote: Counting objects: 100% (208/208), done.&lt;br /&gt;
remote: Compressing objects: 100% (135/135), done.&lt;br /&gt;
remote: Total 368 (delta 75), reused 183 (delta 72), pack-reused 160&lt;br /&gt;
Receiving objects: 100% (368/368), 117.76 KiB | 7.36 MiB/s, done.&lt;br /&gt;
Resolving deltas: 100% (130/130), done.&lt;br /&gt;
&amp;lt;/source&amp;gt;&lt;br /&gt;
== dblp stardog ==&lt;br /&gt;
&amp;lt;source lang='bash' highlight='1'&amp;gt;&lt;br /&gt;
docker pull stardog/stardog:latest&lt;br /&gt;
latest: Pulling from stardog/stardog&lt;br /&gt;
2d473b07cdd5: Pull complete &lt;br /&gt;
b0eac9aee9aa: Pull complete &lt;br /&gt;
8d5b89da19bc: Pull complete &lt;br /&gt;
91c2bc930138: Pull complete &lt;br /&gt;
265d7b96dd8f: Pull complete &lt;br /&gt;
Digest: sha256:7fc70e1bd3d17bdb1440f0cd810294b5318f1c53935425bb51526da4a949afc0&lt;br /&gt;
Status: Downloaded newer image for stardog/stardog:latest&lt;br /&gt;
docker.io/stardog/stardog:latest&lt;br /&gt;
&amp;lt;/source&amp;gt;&lt;br /&gt;
&lt;br /&gt;
&lt;br /&gt;
&lt;br /&gt;
= DBLP versus CEUR-WS Queries = &lt;br /&gt;
== All Volumes known to dblp ==&lt;br /&gt;
* expected 70% of 3185 volumes found 75%&lt;br /&gt;
* actual 75%&lt;br /&gt;
&amp;lt;source lang=&amp;quot;sparql&amp;quot;&amp;gt;&lt;br /&gt;
SELECT (COUNT(?proceeding) as ?count) (MIN(xsd:integer(?volNumber)) as ?min)  (MAX(xsd:integer(?volNumber)) as ?max) &lt;br /&gt;
WHERE { &lt;br /&gt;
    ?proceeding dblp:publishedIn &amp;quot;CEUR Workshop Proceedings&amp;quot;;&lt;br /&gt;
                dblp:publishedInSeriesVolume ?volNumber .&lt;br /&gt;
    }&lt;br /&gt;
LIMIT 5000&lt;br /&gt;
&amp;lt;/source&amp;gt;&lt;br /&gt;
{| class=&amp;quot;wikitable&amp;quot;&lt;br /&gt;
| count || min || max&lt;br /&gt;
|-&lt;br /&gt;
| 2375 || 1 || 3157&lt;br /&gt;
|}&lt;br /&gt;
&lt;br /&gt;
== All papers ==&lt;br /&gt;
* expected 70% of ~50000 papers&lt;br /&gt;
&amp;lt;source lang=&amp;quot;sparql&amp;quot;&amp;gt;&lt;br /&gt;
SELECT (COUNT(?paper) as ?count)&lt;br /&gt;
WHERE { &lt;br /&gt;
    ?proceeding dblp:publishedIn &amp;quot;CEUR Workshop Proceedings&amp;quot;.&lt;br /&gt;
    ?paper dblp:publishedAsPartOf ?proceeding.&lt;br /&gt;
}&lt;br /&gt;
&amp;lt;/source&amp;gt;&lt;br /&gt;
{| class=&amp;quot;wikitable&amp;quot;&lt;br /&gt;
| count&lt;br /&gt;
|-&lt;br /&gt;
| 44275&lt;br /&gt;
|}&lt;br /&gt;
&lt;br /&gt;
== All authors and editors ==&lt;br /&gt;
* authors: papers expected: &amp;lt;1:1 and &amp;gt;1:3 found 1.6 distinct authors in relation to distinct papers&lt;br /&gt;
* editors: volumes 3:1  found 4625 editors for 2377 volumes&lt;br /&gt;
&amp;lt;source lang=&amp;quot;sparql&amp;quot;&amp;gt;&lt;br /&gt;
SELECT (COUNT(DISTINCT ?author) as ?numberOfAuthors) &lt;br /&gt;
       (COUNT(DISTINCT ?paper) as ?numberOfPapers) &lt;br /&gt;
       (COUNT(DISTINCT ?editor) as ?numberOfEditors)&lt;br /&gt;
       (COUNT(DISTINCT ?proceeding) as ?numberOfVolumes)&lt;br /&gt;
WHERE { &lt;br /&gt;
    ?proceeding dblp:publishedIn &amp;quot;CEUR Workshop Proceedings&amp;quot;.&lt;br /&gt;
    OPTIONAL{?proceeding dblp:editedBy ?editor}&lt;br /&gt;
    OPTIONAL{&lt;br /&gt;
        ?paper dblp:publishedAsPartOf ?proceeding.&lt;br /&gt;
        OPTIONAL{?paper dblp:authoredBy ?author}&lt;br /&gt;
    }&lt;br /&gt;
    &lt;br /&gt;
}&lt;br /&gt;
&lt;br /&gt;
&amp;lt;/source&amp;gt;&lt;br /&gt;
{| class=&amp;quot;wikitable&amp;quot;&lt;br /&gt;
| numberOfAuthors || numberOfPapers || numberOfEditors || numberOfVolumes&lt;br /&gt;
|-&lt;br /&gt;
| 69846 || 44275 || 4625 || 2377&lt;br /&gt;
|}&lt;br /&gt;
Note: There are proceedings of ceurws without an volumeId&lt;br /&gt;
&lt;br /&gt;
Namely:&lt;br /&gt;
* https://dblp.org/rec/conf/www/2017ldow&lt;br /&gt;
* https://dblp.org/rec/conf/semweb/2017hybridsemstats&lt;br /&gt;
&lt;br /&gt;
== Further Queries ==&lt;br /&gt;
* Cross-check against wikidata&lt;br /&gt;
** 53000 dblp authors out of 2.3m&lt;br /&gt;
** all editors MUST be in dblp acording to the rules of ceur-ws&lt;br /&gt;
** authors can be in dblp&lt;br /&gt;
* Disambiguation problem only on ceur-ws side&lt;br /&gt;
** idea: calculate distance of potential candidate authors to authors in the same volume&lt;/div&gt;</summary>
		<author><name>Tim Holzheim</name></author>
	</entry>
	<entry>
		<id>http://ceur-ws.bitplan.com/index.php?title=Workdocumentation_2022-08-16&amp;diff=702</id>
		<title>Workdocumentation 2022-08-16</title>
		<link rel="alternate" type="text/html" href="http://ceur-ws.bitplan.com/index.php?title=Workdocumentation_2022-08-16&amp;diff=702"/>
		<updated>2022-08-17T10:28:59Z</updated>

		<summary type="html">&lt;p&gt;Tim Holzheim: /* DBLP versus CEUR-WS Queries */&lt;/p&gt;
&lt;hr /&gt;
&lt;div&gt;{{PageSequence|prev=Workdocumentation 2022-08-15|next=Workdocumentation 2022-08-17|category=Workdocumentation}}&lt;br /&gt;
= Participants =&lt;br /&gt;
* Tim&lt;br /&gt;
* Wolfgang&lt;br /&gt;
= Agenda =&lt;br /&gt;
dblp Qlever&lt;br /&gt;
&lt;br /&gt;
= dblp Qlever =&lt;br /&gt;
* https://github.com/WolfgangFahl/pyCEURmake/issues/18&lt;br /&gt;
&lt;br /&gt;
== on RWTH Aachen DBIS i5 server ==&lt;br /&gt;
&amp;lt;source lang='bash' highlight='1,12-13'&amp;gt;&lt;br /&gt;
wf@confident:/hd/torterra/dblp2022$ wget https://dblp.org/rdf/dblp.nt.gz&lt;br /&gt;
--2022-08-16 12:00:18--  https://dblp.org/rdf/dblp.nt.gz&lt;br /&gt;
Resolving dblp.org (dblp.org)... 192.76.146.204&lt;br /&gt;
Connecting to dblp.org (dblp.org)|192.76.146.204|:443... connected.&lt;br /&gt;
HTTP request sent, awaiting response... 200 OK&lt;br /&gt;
Length: 2789108619 (2.6G) [application/x-gzip]&lt;br /&gt;
Saving to: ‘dblp.nt.gz’&lt;br /&gt;
&lt;br /&gt;
dblp.nt.gz           36%[======&amp;gt;             ] 980.73M  38.9MB/s    eta 45s &lt;br /&gt;
dblp.nt.gz          100%[===================&amp;gt;]   2.60G  34.1MB/s    in 76s     &lt;br /&gt;
2022-08-16 12:01:34 (35.1 MB/s) - ‘dblp.nt.gz’ saved [2789108619/2789108619]&lt;br /&gt;
gunzip dblp.nt.gz&lt;br /&gt;
ls -l&lt;br /&gt;
total 34405612&lt;br /&gt;
-rw-rw-r-- 1 wf wf 35231339037 Aug 16 00:16 dblp.nt&lt;br /&gt;
&amp;lt;/source&amp;gt;&lt;br /&gt;
&amp;lt;source lang='bash' highlight='1'&amp;gt;&lt;br /&gt;
wget https://dblp.org/rdf/dblp.ttl.gz&lt;br /&gt;
--2022-08-16 15:36:37--  https://dblp.org/rdf/dblp.ttl.gz&lt;br /&gt;
Resolving dblp.org (dblp.org)... 192.76.146.204&lt;br /&gt;
Connecting to dblp.org (dblp.org)|192.76.146.204|:443... connected.&lt;br /&gt;
HTTP request sent, awaiting response... 200 OK&lt;br /&gt;
Length: 1065586620 (1016M) [application/x-gzip]&lt;br /&gt;
Saving to: ‘dblp.ttl.gz’&lt;br /&gt;
dblp.ttl.gz             47%[===========&amp;gt;              ] 483.91M  43.1MB/s    eta 16s&lt;br /&gt;
dblp.ttl.gz            100%[=========================&amp;gt;]   1016M  32.7MB/s    in 29s     &lt;br /&gt;
2022-08-16 15:37:06 (35.3 MB/s) - ‘dblp.ttl.gz’ saved [1065586620/1065586620]&lt;br /&gt;
gunzip dblp.ttl.gz&lt;br /&gt;
&amp;lt;/source&amp;gt;&lt;br /&gt;
&lt;br /&gt;
== QLever installation ==&lt;br /&gt;
https://wiki.bitplan.com/index.php/WikiData_Import_2022-01-29&lt;br /&gt;
=== Environment ===&lt;br /&gt;
&amp;lt;source lang='bash' highlight='1,7,9'&amp;gt;&lt;br /&gt;
lsb_release -a&lt;br /&gt;
No LSB modules are available.&lt;br /&gt;
Distributor ID:	Ubuntu&lt;br /&gt;
Description:	Ubuntu 20.04.4 LTS&lt;br /&gt;
Release:	20.04&lt;br /&gt;
Codename:	focal&lt;br /&gt;
wf@confident:/hd/torterra/qlever$ docker --version&lt;br /&gt;
Docker version 20.10.12, build 20.10.12-0ubuntu2~20.04.1&lt;br /&gt;
wf@confident:/hd/torterra/qlever$ free -h&lt;br /&gt;
              total        used        free      shared  buff/cache   available&lt;br /&gt;
Mem:           15Gi       1.2Gi        12Gi        45Mi       1.6Gi        13Gi&lt;br /&gt;
Swap:          11Gi          0B        11Gi&lt;br /&gt;
&amp;lt;/source&amp;gt;&lt;br /&gt;
===== Disk space =====&lt;br /&gt;
&amp;lt;source lang='bash'&amp;gt;&lt;br /&gt;
df | grep -v loop | grep -v tmp | grep -v udev&lt;br /&gt;
Filesystem      1K-blocks       Used Available Use% Mounted on&lt;br /&gt;
/dev/sda3       114226348   55228964  53148860  51% /&lt;br /&gt;
/dev/sdb1      3844590624 3266511864 382711544  90% /hd/torterra&lt;br /&gt;
&amp;lt;/source&amp;gt;&lt;br /&gt;
&lt;br /&gt;
=== QLever code clone ===&lt;br /&gt;
&amp;lt;source lang='bash' highlight='1-2'&amp;gt;&lt;br /&gt;
export QLEVER_HOME=$(pwd)&lt;br /&gt;
date;git clone --recursive https://github.com/ad-freiburg/qlever qlever-code;date&lt;br /&gt;
Tue Aug 16 15:10:09 CEST 2022&lt;br /&gt;
Cloning into 'qlever-code'...&lt;br /&gt;
remote: Enumerating objects: 14625, done.&lt;br /&gt;
remote: Counting objects: 100% (279/279), done.&lt;br /&gt;
remote: Compressing objects: 100% (227/227), done.&lt;br /&gt;
remote: Total 14625 (delta 133), reused 131 (delta 52), pack-reused 14346&lt;br /&gt;
Receiving objects: 100% (14625/14625), 190.60 MiB | 6.48 MiB/s, done.&lt;br /&gt;
...&lt;br /&gt;
Submodule path 'third_party/stxxl/extlib/foxxll/extlib/tlx': checked out 'ef81a598d9880cc7d242afc47de7328634f07f1d'&lt;br /&gt;
Tue Aug 16 15:10:56 CEST 2022&lt;br /&gt;
&amp;lt;/source&amp;gt;&lt;br /&gt;
==== Build ====&lt;br /&gt;
&amp;lt;source lang='bash' highlight='1'&amp;gt;&lt;br /&gt;
date;sudo docker build --file Dockerfiles/Dockerfile.Ubuntu20.04 -t qlever .;date&lt;br /&gt;
Tue Aug 16 15:13:02 CEST 2022&lt;br /&gt;
Sending build context to Docker daemon    453MB&lt;br /&gt;
Step 1/43 : FROM ubuntu:20.04 as base&lt;br /&gt;
20.04: Pulling from library/ubuntu&lt;br /&gt;
3b65ec22a9e9: Pull complete &lt;br /&gt;
&lt;br /&gt;
Removing intermediate container 1ccc2a50364e&lt;br /&gt;
 ---&amp;gt; d0018440a4cd&lt;br /&gt;
Successfully built d0018440a4cd&lt;br /&gt;
Successfully tagged qlever:latest&lt;br /&gt;
Tue Aug 16 15:25:28 CEST 2022&lt;br /&gt;
&amp;lt;/source&amp;gt;&lt;br /&gt;
&lt;br /&gt;
=== qlever control ===&lt;br /&gt;
&amp;lt;source lang='bash'&amp;gt;&lt;br /&gt;
git clone https://github.com/ad-freiburg/qlever-controlCloning into 'qlever-control'...&lt;br /&gt;
remote: Enumerating objects: 368, done.&lt;br /&gt;
remote: Counting objects: 100% (208/208), done.&lt;br /&gt;
remote: Compressing objects: 100% (135/135), done.&lt;br /&gt;
remote: Total 368 (delta 75), reused 183 (delta 72), pack-reused 160&lt;br /&gt;
Receiving objects: 100% (368/368), 117.76 KiB | 7.36 MiB/s, done.&lt;br /&gt;
Resolving deltas: 100% (130/130), done.&lt;br /&gt;
&amp;lt;/source&amp;gt;&lt;br /&gt;
== dblp stardog ==&lt;br /&gt;
&amp;lt;source lang='bash' highlight='1'&amp;gt;&lt;br /&gt;
docker pull stardog/stardog:latest&lt;br /&gt;
latest: Pulling from stardog/stardog&lt;br /&gt;
2d473b07cdd5: Pull complete &lt;br /&gt;
b0eac9aee9aa: Pull complete &lt;br /&gt;
8d5b89da19bc: Pull complete &lt;br /&gt;
91c2bc930138: Pull complete &lt;br /&gt;
265d7b96dd8f: Pull complete &lt;br /&gt;
Digest: sha256:7fc70e1bd3d17bdb1440f0cd810294b5318f1c53935425bb51526da4a949afc0&lt;br /&gt;
Status: Downloaded newer image for stardog/stardog:latest&lt;br /&gt;
docker.io/stardog/stardog:latest&lt;br /&gt;
&amp;lt;/source&amp;gt;&lt;br /&gt;
&lt;br /&gt;
&lt;br /&gt;
&lt;br /&gt;
= DBLP versus CEUR-WS Queries = &lt;br /&gt;
== All Volumes known to dblp ==&lt;br /&gt;
* expected 70% of 3185 volumes found 75%&lt;br /&gt;
* actual 75%&lt;br /&gt;
&amp;lt;source lang=&amp;quot;sparql&amp;quot;&amp;gt;&lt;br /&gt;
SELECT (COUNT(?proceeding) as ?count) (MIN(xsd:integer(?volNumber)) as ?min)  (MAX(xsd:integer(?volNumber)) as ?max) &lt;br /&gt;
WHERE { &lt;br /&gt;
    ?proceeding dblp:publishedIn &amp;quot;CEUR Workshop Proceedings&amp;quot;;&lt;br /&gt;
                dblp:publishedInSeriesVolume ?volNumber .&lt;br /&gt;
    }&lt;br /&gt;
LIMIT 5000&lt;br /&gt;
&amp;lt;/source&amp;gt;&lt;br /&gt;
{| class=&amp;quot;wikitable&amp;quot;&lt;br /&gt;
| count || min || max&lt;br /&gt;
|-&lt;br /&gt;
| 2375 || 1 || 3157&lt;br /&gt;
|}&lt;br /&gt;
&lt;br /&gt;
== All papers ==&lt;br /&gt;
* expected 70% of ~50000 papers&lt;br /&gt;
&amp;lt;source lang=&amp;quot;sparql&amp;quot;&amp;gt;&lt;br /&gt;
SELECT (COUNT(?paper) as ?count)&lt;br /&gt;
WHERE { &lt;br /&gt;
    ?proceeding dblp:publishedIn &amp;quot;CEUR Workshop Proceedings&amp;quot;.&lt;br /&gt;
    ?paper dblp:publishedAsPartOf ?proceeding.&lt;br /&gt;
}&lt;br /&gt;
&amp;lt;/source&amp;gt;&lt;br /&gt;
{| class=&amp;quot;wikitable&amp;quot;&lt;br /&gt;
| count&lt;br /&gt;
|-&lt;br /&gt;
| 44275&lt;br /&gt;
|}&lt;br /&gt;
&lt;br /&gt;
== All authors and editors ===&lt;br /&gt;
* authors: papers expected: &amp;lt;1:1 and &amp;gt;1:3 found 1.6 distinct authors in relation to distinct papers&lt;br /&gt;
* editors: volumes 3:1  found 4625 editors for 2377 volumes&lt;br /&gt;
&amp;lt;source lang=&amp;quot;sparql&amp;quot;&amp;gt;&lt;br /&gt;
SELECT (COUNT(DISTINCT ?author) as ?numberOfAuthors) &lt;br /&gt;
       (COUNT(DISTINCT ?paper) as ?numberOfPapers) &lt;br /&gt;
       (COUNT(DISTINCT ?editor) as ?numberOfEditors)&lt;br /&gt;
       (COUNT(DISTINCT ?proceeding) as ?numberOfVolumes)&lt;br /&gt;
WHERE { &lt;br /&gt;
    ?proceeding dblp:publishedIn &amp;quot;CEUR Workshop Proceedings&amp;quot;.&lt;br /&gt;
    OPTIONAL{?proceeding dblp:editedBy ?editor}&lt;br /&gt;
    OPTIONAL{&lt;br /&gt;
        ?paper dblp:publishedAsPartOf ?proceeding.&lt;br /&gt;
        OPTIONAL{?paper dblp:authoredBy ?author}&lt;br /&gt;
    }&lt;br /&gt;
    &lt;br /&gt;
}&lt;br /&gt;
&lt;br /&gt;
&amp;lt;/source&amp;gt;&lt;br /&gt;
{| class=&amp;quot;wikitable&amp;quot;&lt;br /&gt;
| numberOfAuthors || numberOfPapers || numberOfEditors || numberOfVolumes&lt;br /&gt;
|-&lt;br /&gt;
| 69846 || 44275 || 4625 || 2377&lt;br /&gt;
|}&lt;br /&gt;
Note: There are proceedings of ceurws without an volumeId&lt;br /&gt;
Namely:&lt;br /&gt;
* https://dblp.org/rec/conf/www/2017ldow&lt;br /&gt;
* https://dblp.org/rec/conf/semweb/2017hybridsemstats&lt;br /&gt;
&lt;br /&gt;
== Further Queries ==&lt;br /&gt;
* Cross-check against wikidata&lt;br /&gt;
** 53000 dblp authors out of 2.3m&lt;br /&gt;
** all editors MUST be in dblp acording to the rules of ceur-ws&lt;br /&gt;
** authors can be in dblp&lt;br /&gt;
* Disambiguation problem only on ceur-ws side&lt;br /&gt;
** idea: calculate distance of potential candidate authors to authors in the same volume&lt;/div&gt;</summary>
		<author><name>Tim Holzheim</name></author>
	</entry>
	<entry>
		<id>http://ceur-ws.bitplan.com/index.php?title=Workdocumentation_2022-08-16&amp;diff=701</id>
		<title>Workdocumentation 2022-08-16</title>
		<link rel="alternate" type="text/html" href="http://ceur-ws.bitplan.com/index.php?title=Workdocumentation_2022-08-16&amp;diff=701"/>
		<updated>2022-08-17T10:26:35Z</updated>

		<summary type="html">&lt;p&gt;Tim Holzheim: /* All authors and editors = */&lt;/p&gt;
&lt;hr /&gt;
&lt;div&gt;{{PageSequence|prev=Workdocumentation 2022-08-15|next=Workdocumentation 2022-08-17|category=Workdocumentation}}&lt;br /&gt;
= Participants =&lt;br /&gt;
* Tim&lt;br /&gt;
* Wolfgang&lt;br /&gt;
= Agenda =&lt;br /&gt;
dblp Qlever&lt;br /&gt;
&lt;br /&gt;
= dblp Qlever =&lt;br /&gt;
* https://github.com/WolfgangFahl/pyCEURmake/issues/18&lt;br /&gt;
&lt;br /&gt;
== on RWTH Aachen DBIS i5 server ==&lt;br /&gt;
&amp;lt;source lang='bash' highlight='1,12-13'&amp;gt;&lt;br /&gt;
wf@confident:/hd/torterra/dblp2022$ wget https://dblp.org/rdf/dblp.nt.gz&lt;br /&gt;
--2022-08-16 12:00:18--  https://dblp.org/rdf/dblp.nt.gz&lt;br /&gt;
Resolving dblp.org (dblp.org)... 192.76.146.204&lt;br /&gt;
Connecting to dblp.org (dblp.org)|192.76.146.204|:443... connected.&lt;br /&gt;
HTTP request sent, awaiting response... 200 OK&lt;br /&gt;
Length: 2789108619 (2.6G) [application/x-gzip]&lt;br /&gt;
Saving to: ‘dblp.nt.gz’&lt;br /&gt;
&lt;br /&gt;
dblp.nt.gz           36%[======&amp;gt;             ] 980.73M  38.9MB/s    eta 45s &lt;br /&gt;
dblp.nt.gz          100%[===================&amp;gt;]   2.60G  34.1MB/s    in 76s     &lt;br /&gt;
2022-08-16 12:01:34 (35.1 MB/s) - ‘dblp.nt.gz’ saved [2789108619/2789108619]&lt;br /&gt;
gunzip dblp.nt.gz&lt;br /&gt;
ls -l&lt;br /&gt;
total 34405612&lt;br /&gt;
-rw-rw-r-- 1 wf wf 35231339037 Aug 16 00:16 dblp.nt&lt;br /&gt;
&amp;lt;/source&amp;gt;&lt;br /&gt;
&amp;lt;source lang='bash' highlight='1'&amp;gt;&lt;br /&gt;
wget https://dblp.org/rdf/dblp.ttl.gz&lt;br /&gt;
--2022-08-16 15:36:37--  https://dblp.org/rdf/dblp.ttl.gz&lt;br /&gt;
Resolving dblp.org (dblp.org)... 192.76.146.204&lt;br /&gt;
Connecting to dblp.org (dblp.org)|192.76.146.204|:443... connected.&lt;br /&gt;
HTTP request sent, awaiting response... 200 OK&lt;br /&gt;
Length: 1065586620 (1016M) [application/x-gzip]&lt;br /&gt;
Saving to: ‘dblp.ttl.gz’&lt;br /&gt;
dblp.ttl.gz             47%[===========&amp;gt;              ] 483.91M  43.1MB/s    eta 16s&lt;br /&gt;
dblp.ttl.gz            100%[=========================&amp;gt;]   1016M  32.7MB/s    in 29s     &lt;br /&gt;
2022-08-16 15:37:06 (35.3 MB/s) - ‘dblp.ttl.gz’ saved [1065586620/1065586620]&lt;br /&gt;
gunzip dblp.ttl.gz&lt;br /&gt;
&amp;lt;/source&amp;gt;&lt;br /&gt;
&lt;br /&gt;
== QLever installation ==&lt;br /&gt;
https://wiki.bitplan.com/index.php/WikiData_Import_2022-01-29&lt;br /&gt;
=== Environment ===&lt;br /&gt;
&amp;lt;source lang='bash' highlight='1,7,9'&amp;gt;&lt;br /&gt;
lsb_release -a&lt;br /&gt;
No LSB modules are available.&lt;br /&gt;
Distributor ID:	Ubuntu&lt;br /&gt;
Description:	Ubuntu 20.04.4 LTS&lt;br /&gt;
Release:	20.04&lt;br /&gt;
Codename:	focal&lt;br /&gt;
wf@confident:/hd/torterra/qlever$ docker --version&lt;br /&gt;
Docker version 20.10.12, build 20.10.12-0ubuntu2~20.04.1&lt;br /&gt;
wf@confident:/hd/torterra/qlever$ free -h&lt;br /&gt;
              total        used        free      shared  buff/cache   available&lt;br /&gt;
Mem:           15Gi       1.2Gi        12Gi        45Mi       1.6Gi        13Gi&lt;br /&gt;
Swap:          11Gi          0B        11Gi&lt;br /&gt;
&amp;lt;/source&amp;gt;&lt;br /&gt;
===== Disk space =====&lt;br /&gt;
&amp;lt;source lang='bash'&amp;gt;&lt;br /&gt;
df | grep -v loop | grep -v tmp | grep -v udev&lt;br /&gt;
Filesystem      1K-blocks       Used Available Use% Mounted on&lt;br /&gt;
/dev/sda3       114226348   55228964  53148860  51% /&lt;br /&gt;
/dev/sdb1      3844590624 3266511864 382711544  90% /hd/torterra&lt;br /&gt;
&amp;lt;/source&amp;gt;&lt;br /&gt;
&lt;br /&gt;
=== QLever code clone ===&lt;br /&gt;
&amp;lt;source lang='bash' highlight='1-2'&amp;gt;&lt;br /&gt;
export QLEVER_HOME=$(pwd)&lt;br /&gt;
date;git clone --recursive https://github.com/ad-freiburg/qlever qlever-code;date&lt;br /&gt;
Tue Aug 16 15:10:09 CEST 2022&lt;br /&gt;
Cloning into 'qlever-code'...&lt;br /&gt;
remote: Enumerating objects: 14625, done.&lt;br /&gt;
remote: Counting objects: 100% (279/279), done.&lt;br /&gt;
remote: Compressing objects: 100% (227/227), done.&lt;br /&gt;
remote: Total 14625 (delta 133), reused 131 (delta 52), pack-reused 14346&lt;br /&gt;
Receiving objects: 100% (14625/14625), 190.60 MiB | 6.48 MiB/s, done.&lt;br /&gt;
...&lt;br /&gt;
Submodule path 'third_party/stxxl/extlib/foxxll/extlib/tlx': checked out 'ef81a598d9880cc7d242afc47de7328634f07f1d'&lt;br /&gt;
Tue Aug 16 15:10:56 CEST 2022&lt;br /&gt;
&amp;lt;/source&amp;gt;&lt;br /&gt;
==== Build ====&lt;br /&gt;
&amp;lt;source lang='bash' highlight='1'&amp;gt;&lt;br /&gt;
date;sudo docker build --file Dockerfiles/Dockerfile.Ubuntu20.04 -t qlever .;date&lt;br /&gt;
Tue Aug 16 15:13:02 CEST 2022&lt;br /&gt;
Sending build context to Docker daemon    453MB&lt;br /&gt;
Step 1/43 : FROM ubuntu:20.04 as base&lt;br /&gt;
20.04: Pulling from library/ubuntu&lt;br /&gt;
3b65ec22a9e9: Pull complete &lt;br /&gt;
&lt;br /&gt;
Removing intermediate container 1ccc2a50364e&lt;br /&gt;
 ---&amp;gt; d0018440a4cd&lt;br /&gt;
Successfully built d0018440a4cd&lt;br /&gt;
Successfully tagged qlever:latest&lt;br /&gt;
Tue Aug 16 15:25:28 CEST 2022&lt;br /&gt;
&amp;lt;/source&amp;gt;&lt;br /&gt;
&lt;br /&gt;
=== qlever control ===&lt;br /&gt;
&amp;lt;source lang='bash'&amp;gt;&lt;br /&gt;
git clone https://github.com/ad-freiburg/qlever-controlCloning into 'qlever-control'...&lt;br /&gt;
remote: Enumerating objects: 368, done.&lt;br /&gt;
remote: Counting objects: 100% (208/208), done.&lt;br /&gt;
remote: Compressing objects: 100% (135/135), done.&lt;br /&gt;
remote: Total 368 (delta 75), reused 183 (delta 72), pack-reused 160&lt;br /&gt;
Receiving objects: 100% (368/368), 117.76 KiB | 7.36 MiB/s, done.&lt;br /&gt;
Resolving deltas: 100% (130/130), done.&lt;br /&gt;
&amp;lt;/source&amp;gt;&lt;br /&gt;
== dblp stardog ==&lt;br /&gt;
&amp;lt;source lang='bash' highlight='1'&amp;gt;&lt;br /&gt;
docker pull stardog/stardog:latest&lt;br /&gt;
latest: Pulling from stardog/stardog&lt;br /&gt;
2d473b07cdd5: Pull complete &lt;br /&gt;
b0eac9aee9aa: Pull complete &lt;br /&gt;
8d5b89da19bc: Pull complete &lt;br /&gt;
91c2bc930138: Pull complete &lt;br /&gt;
265d7b96dd8f: Pull complete &lt;br /&gt;
Digest: sha256:7fc70e1bd3d17bdb1440f0cd810294b5318f1c53935425bb51526da4a949afc0&lt;br /&gt;
Status: Downloaded newer image for stardog/stardog:latest&lt;br /&gt;
docker.io/stardog/stardog:latest&lt;br /&gt;
&amp;lt;/source&amp;gt;&lt;br /&gt;
&lt;br /&gt;
&lt;br /&gt;
&lt;br /&gt;
= DBLP versus CEUR-WS Queries = &lt;br /&gt;
== All Volumes known to dblp ==&lt;br /&gt;
** expected 70% of 3185 volumes found 75%&lt;br /&gt;
** actual 75%&lt;br /&gt;
&amp;lt;source lang=&amp;quot;sparql&amp;quot;&amp;gt;&lt;br /&gt;
SELECT (COUNT(?proceeding) as ?count) (MIN(xsd:integer(?volNumber)) as ?min)  (MAX(xsd:integer(?volNumber)) as ?max) &lt;br /&gt;
WHERE { &lt;br /&gt;
    ?proceeding dblp:publishedIn &amp;quot;CEUR Workshop Proceedings&amp;quot;;&lt;br /&gt;
                dblp:publishedInSeriesVolume ?volNumber .&lt;br /&gt;
    }&lt;br /&gt;
LIMIT 5000&lt;br /&gt;
&amp;lt;/source&amp;gt;&lt;br /&gt;
{| class=&amp;quot;wikitable&amp;quot;&lt;br /&gt;
| count || min || max&lt;br /&gt;
|-&lt;br /&gt;
| 2375 || 1 || 3157&lt;br /&gt;
|}&lt;br /&gt;
&lt;br /&gt;
== All papers ==&lt;br /&gt;
** expected 70% of ~50000 papers&lt;br /&gt;
&amp;lt;source lang=&amp;quot;sparql&amp;quot;&amp;gt;&lt;br /&gt;
SELECT (COUNT(?paper) as ?count)&lt;br /&gt;
WHERE { &lt;br /&gt;
    ?proceeding dblp:publishedIn &amp;quot;CEUR Workshop Proceedings&amp;quot;.&lt;br /&gt;
    ?paper dblp:publishedAsPartOf ?proceeding.&lt;br /&gt;
}&lt;br /&gt;
&amp;lt;/source&amp;gt;&lt;br /&gt;
{| class=&amp;quot;wikitable&amp;quot;&lt;br /&gt;
| count&lt;br /&gt;
|-&lt;br /&gt;
| 44275&lt;br /&gt;
|}&lt;br /&gt;
&lt;br /&gt;
== All authors and editors ===&lt;br /&gt;
** authors: papers expected: &amp;lt;1:1 and &amp;gt;1:3 found 1.6 distinct authors in relation to distinct papers&lt;br /&gt;
** editors: volumes 3:1  found 4625 editors for 2377 volumes&lt;br /&gt;
&amp;lt;source lang=&amp;quot;sparql&amp;quot;&amp;gt;&lt;br /&gt;
SELECT (COUNT(DISTINCT ?author) as ?numberOfAuthors) &lt;br /&gt;
       (COUNT(DISTINCT ?paper) as ?numberOfPapers) &lt;br /&gt;
       (COUNT(DISTINCT ?editor) as ?numberOfEditors)&lt;br /&gt;
       (COUNT(DISTINCT ?proceeding) as ?numberOfVolumes)&lt;br /&gt;
WHERE { &lt;br /&gt;
    ?proceeding dblp:publishedIn &amp;quot;CEUR Workshop Proceedings&amp;quot;.&lt;br /&gt;
    OPTIONAL{?proceeding dblp:editedBy ?editor}&lt;br /&gt;
    OPTIONAL{&lt;br /&gt;
        ?paper dblp:publishedAsPartOf ?proceeding.&lt;br /&gt;
        OPTIONAL{?paper dblp:authoredBy ?author}&lt;br /&gt;
    }&lt;br /&gt;
    &lt;br /&gt;
}&lt;br /&gt;
&lt;br /&gt;
&amp;lt;/source&amp;gt;&lt;br /&gt;
{| class=&amp;quot;wikitable&amp;quot;&lt;br /&gt;
| numberOfAuthors || numberOfPapers || numberOfEditors || numberOfVolumes&lt;br /&gt;
|-&lt;br /&gt;
| 69846 || 44275 || 4625 || 2377&lt;br /&gt;
|}&lt;br /&gt;
Note: There are proceedings of ceurws without an volumeId&lt;br /&gt;
* Cross-check against wikidata&lt;br /&gt;
** 53000 dblp authors out of 2.3m&lt;br /&gt;
** all editors MUST be in dblp acording to the rules of ceur-ws&lt;br /&gt;
** authors can be in dblp&lt;br /&gt;
* Disambiguation problem only on ceur-ws side&lt;br /&gt;
** idea: calculate distance of potential candidate authors to authors in the same volume&lt;/div&gt;</summary>
		<author><name>Tim Holzheim</name></author>
	</entry>
	<entry>
		<id>http://ceur-ws.bitplan.com/index.php?title=Workdocumentation_2022-08-16&amp;diff=697</id>
		<title>Workdocumentation 2022-08-16</title>
		<link rel="alternate" type="text/html" href="http://ceur-ws.bitplan.com/index.php?title=Workdocumentation_2022-08-16&amp;diff=697"/>
		<updated>2022-08-17T10:18:56Z</updated>

		<summary type="html">&lt;p&gt;Tim Holzheim: /* DBLP Queries */&lt;/p&gt;
&lt;hr /&gt;
&lt;div&gt;{{PageSequence|prev=Workdocumentation 2022-08-15|next=Workdocumentation 2022-08-17|category=Workdocumentation}}&lt;br /&gt;
= Participants =&lt;br /&gt;
* Tim&lt;br /&gt;
* Wolfgang&lt;br /&gt;
= Agenda =&lt;br /&gt;
dblp Qlever&lt;br /&gt;
&lt;br /&gt;
= dblp Qlever =&lt;br /&gt;
* https://github.com/WolfgangFahl/pyCEURmake/issues/18&lt;br /&gt;
&lt;br /&gt;
== on RWTH Aachen DBIS i5 server ==&lt;br /&gt;
&amp;lt;source lang='bash' highlight='1,12-13'&amp;gt;&lt;br /&gt;
wf@confident:/hd/torterra/dblp2022$ wget https://dblp.org/rdf/dblp.nt.gz&lt;br /&gt;
--2022-08-16 12:00:18--  https://dblp.org/rdf/dblp.nt.gz&lt;br /&gt;
Resolving dblp.org (dblp.org)... 192.76.146.204&lt;br /&gt;
Connecting to dblp.org (dblp.org)|192.76.146.204|:443... connected.&lt;br /&gt;
HTTP request sent, awaiting response... 200 OK&lt;br /&gt;
Length: 2789108619 (2.6G) [application/x-gzip]&lt;br /&gt;
Saving to: ‘dblp.nt.gz’&lt;br /&gt;
&lt;br /&gt;
dblp.nt.gz           36%[======&amp;gt;             ] 980.73M  38.9MB/s    eta 45s &lt;br /&gt;
dblp.nt.gz          100%[===================&amp;gt;]   2.60G  34.1MB/s    in 76s     &lt;br /&gt;
2022-08-16 12:01:34 (35.1 MB/s) - ‘dblp.nt.gz’ saved [2789108619/2789108619]&lt;br /&gt;
gunzip dblp.nt.gz&lt;br /&gt;
ls -l&lt;br /&gt;
total 34405612&lt;br /&gt;
-rw-rw-r-- 1 wf wf 35231339037 Aug 16 00:16 dblp.nt&lt;br /&gt;
&amp;lt;/source&amp;gt;&lt;br /&gt;
&amp;lt;source lang='bash' highlight='1'&amp;gt;&lt;br /&gt;
wget https://dblp.org/rdf/dblp.ttl.gz&lt;br /&gt;
--2022-08-16 15:36:37--  https://dblp.org/rdf/dblp.ttl.gz&lt;br /&gt;
Resolving dblp.org (dblp.org)... 192.76.146.204&lt;br /&gt;
Connecting to dblp.org (dblp.org)|192.76.146.204|:443... connected.&lt;br /&gt;
HTTP request sent, awaiting response... 200 OK&lt;br /&gt;
Length: 1065586620 (1016M) [application/x-gzip]&lt;br /&gt;
Saving to: ‘dblp.ttl.gz’&lt;br /&gt;
dblp.ttl.gz             47%[===========&amp;gt;              ] 483.91M  43.1MB/s    eta 16s&lt;br /&gt;
dblp.ttl.gz            100%[=========================&amp;gt;]   1016M  32.7MB/s    in 29s     &lt;br /&gt;
2022-08-16 15:37:06 (35.3 MB/s) - ‘dblp.ttl.gz’ saved [1065586620/1065586620]&lt;br /&gt;
gunzip dblp.ttl.gz&lt;br /&gt;
&amp;lt;/source&amp;gt;&lt;br /&gt;
&lt;br /&gt;
== QLever installation ==&lt;br /&gt;
https://wiki.bitplan.com/index.php/WikiData_Import_2022-01-29&lt;br /&gt;
=== Environment ===&lt;br /&gt;
&amp;lt;source lang='bash' highlight='1,7,9'&amp;gt;&lt;br /&gt;
lsb_release -a&lt;br /&gt;
No LSB modules are available.&lt;br /&gt;
Distributor ID:	Ubuntu&lt;br /&gt;
Description:	Ubuntu 20.04.4 LTS&lt;br /&gt;
Release:	20.04&lt;br /&gt;
Codename:	focal&lt;br /&gt;
wf@confident:/hd/torterra/qlever$ docker --version&lt;br /&gt;
Docker version 20.10.12, build 20.10.12-0ubuntu2~20.04.1&lt;br /&gt;
wf@confident:/hd/torterra/qlever$ free -h&lt;br /&gt;
              total        used        free      shared  buff/cache   available&lt;br /&gt;
Mem:           15Gi       1.2Gi        12Gi        45Mi       1.6Gi        13Gi&lt;br /&gt;
Swap:          11Gi          0B        11Gi&lt;br /&gt;
&amp;lt;/source&amp;gt;&lt;br /&gt;
===== Disk space =====&lt;br /&gt;
&amp;lt;source lang='bash'&amp;gt;&lt;br /&gt;
df | grep -v loop | grep -v tmp | grep -v udev&lt;br /&gt;
Filesystem      1K-blocks       Used Available Use% Mounted on&lt;br /&gt;
/dev/sda3       114226348   55228964  53148860  51% /&lt;br /&gt;
/dev/sdb1      3844590624 3266511864 382711544  90% /hd/torterra&lt;br /&gt;
&amp;lt;/source&amp;gt;&lt;br /&gt;
&lt;br /&gt;
=== QLever code clone ===&lt;br /&gt;
&amp;lt;source lang='bash' highlight='1-2'&amp;gt;&lt;br /&gt;
export QLEVER_HOME=$(pwd)&lt;br /&gt;
date;git clone --recursive https://github.com/ad-freiburg/qlever qlever-code;date&lt;br /&gt;
Tue Aug 16 15:10:09 CEST 2022&lt;br /&gt;
Cloning into 'qlever-code'...&lt;br /&gt;
remote: Enumerating objects: 14625, done.&lt;br /&gt;
remote: Counting objects: 100% (279/279), done.&lt;br /&gt;
remote: Compressing objects: 100% (227/227), done.&lt;br /&gt;
remote: Total 14625 (delta 133), reused 131 (delta 52), pack-reused 14346&lt;br /&gt;
Receiving objects: 100% (14625/14625), 190.60 MiB | 6.48 MiB/s, done.&lt;br /&gt;
...&lt;br /&gt;
Submodule path 'third_party/stxxl/extlib/foxxll/extlib/tlx': checked out 'ef81a598d9880cc7d242afc47de7328634f07f1d'&lt;br /&gt;
Tue Aug 16 15:10:56 CEST 2022&lt;br /&gt;
&amp;lt;/source&amp;gt;&lt;br /&gt;
==== Build ====&lt;br /&gt;
&amp;lt;source lang='bash' highlight='1'&amp;gt;&lt;br /&gt;
date;sudo docker build --file Dockerfiles/Dockerfile.Ubuntu20.04 -t qlever .;date&lt;br /&gt;
Tue Aug 16 15:13:02 CEST 2022&lt;br /&gt;
Sending build context to Docker daemon    453MB&lt;br /&gt;
Step 1/43 : FROM ubuntu:20.04 as base&lt;br /&gt;
20.04: Pulling from library/ubuntu&lt;br /&gt;
3b65ec22a9e9: Pull complete &lt;br /&gt;
&lt;br /&gt;
Removing intermediate container 1ccc2a50364e&lt;br /&gt;
 ---&amp;gt; d0018440a4cd&lt;br /&gt;
Successfully built d0018440a4cd&lt;br /&gt;
Successfully tagged qlever:latest&lt;br /&gt;
Tue Aug 16 15:25:28 CEST 2022&lt;br /&gt;
&amp;lt;/source&amp;gt;&lt;br /&gt;
&lt;br /&gt;
=== qlever control ===&lt;br /&gt;
&amp;lt;source lang='bash'&amp;gt;&lt;br /&gt;
git clone https://github.com/ad-freiburg/qlever-controlCloning into 'qlever-control'...&lt;br /&gt;
remote: Enumerating objects: 368, done.&lt;br /&gt;
remote: Counting objects: 100% (208/208), done.&lt;br /&gt;
remote: Compressing objects: 100% (135/135), done.&lt;br /&gt;
remote: Total 368 (delta 75), reused 183 (delta 72), pack-reused 160&lt;br /&gt;
Receiving objects: 100% (368/368), 117.76 KiB | 7.36 MiB/s, done.&lt;br /&gt;
Resolving deltas: 100% (130/130), done.&lt;br /&gt;
&amp;lt;/source&amp;gt;&lt;br /&gt;
== dblp stardog ==&lt;br /&gt;
&amp;lt;source lang='bash' highlight='1'&amp;gt;&lt;br /&gt;
docker pull stardog/stardog:latest&lt;br /&gt;
latest: Pulling from stardog/stardog&lt;br /&gt;
2d473b07cdd5: Pull complete &lt;br /&gt;
b0eac9aee9aa: Pull complete &lt;br /&gt;
8d5b89da19bc: Pull complete &lt;br /&gt;
91c2bc930138: Pull complete &lt;br /&gt;
265d7b96dd8f: Pull complete &lt;br /&gt;
Digest: sha256:7fc70e1bd3d17bdb1440f0cd810294b5318f1c53935425bb51526da4a949afc0&lt;br /&gt;
Status: Downloaded newer image for stardog/stardog:latest&lt;br /&gt;
docker.io/stardog/stardog:latest&lt;br /&gt;
&amp;lt;/source&amp;gt;&lt;br /&gt;
&lt;br /&gt;
&lt;br /&gt;
&lt;br /&gt;
= DBLP Queries = &lt;br /&gt;
* All volumes known to dblp (from ceur-ws)&lt;br /&gt;
** expected 70% of 3185 volumes&lt;br /&gt;
** actual 75%&lt;br /&gt;
&amp;lt;source lang=&amp;quot;sparql&amp;quot;&amp;gt;&lt;br /&gt;
SELECT (COUNT(?proceeding) as ?count) (MIN(xsd:integer(?volNumber)) as ?min)  (MAX(xsd:integer(?volNumber)) as ?max) &lt;br /&gt;
WHERE { &lt;br /&gt;
    ?proceeding dblp:publishedIn &amp;quot;CEUR Workshop Proceedings&amp;quot;;&lt;br /&gt;
                dblp:publishedInSeriesVolume ?volNumber .&lt;br /&gt;
    }&lt;br /&gt;
LIMIT 5000&lt;br /&gt;
&amp;lt;/source&amp;gt;&lt;br /&gt;
{| class=&amp;quot;wikitable&amp;quot;&lt;br /&gt;
| count || min || max&lt;br /&gt;
|-&lt;br /&gt;
| 2375 || 1 || 3157&lt;br /&gt;
|}&lt;br /&gt;
&lt;br /&gt;
* All papers known to dblp (from ceur-ws)&lt;br /&gt;
** expected 70% of ~50000 papers&lt;br /&gt;
&amp;lt;source lang=&amp;quot;sparql&amp;quot;&amp;gt;&lt;br /&gt;
SELECT (COUNT(?paper) as ?count)&lt;br /&gt;
WHERE { &lt;br /&gt;
    ?proceeding dblp:publishedIn &amp;quot;CEUR Workshop Proceedings&amp;quot;.&lt;br /&gt;
    ?paper dblp:publishedAsPartOf ?proceeding.&lt;br /&gt;
}&lt;br /&gt;
&amp;lt;/source&amp;gt;&lt;br /&gt;
{| class=&amp;quot;wikitable&amp;quot;&lt;br /&gt;
| count&lt;br /&gt;
|-&lt;br /&gt;
| 44275&lt;br /&gt;
|}&lt;br /&gt;
&lt;br /&gt;
* All authors and editors&lt;br /&gt;
** authors: papers &amp;lt;1:1 and &amp;gt;1:3&lt;br /&gt;
** editors: volumes 3:1&lt;br /&gt;
&amp;lt;source lang=&amp;quot;sparql&amp;quot;&amp;gt;&lt;br /&gt;
SELECT (COUNT(DISTINCT ?author) as ?numberOfAuthors) &lt;br /&gt;
       (COUNT(DISTINCT ?paper) as ?numberOfPapers) &lt;br /&gt;
       (COUNT(DISTINCT ?editor) as ?numberOfEditors)&lt;br /&gt;
WHERE { &lt;br /&gt;
    ?proceeding dblp:publishedIn &amp;quot;CEUR Workshop Proceedings&amp;quot;.&lt;br /&gt;
    OPTIONAL{?proceeding dblp:editedBy ?editor}&lt;br /&gt;
    OPTIONAL{&lt;br /&gt;
        ?paper dblp:publishedAsPartOf ?proceeding.&lt;br /&gt;
        OPTIONAL{?paper dblp:authoredBy ?author}&lt;br /&gt;
    }&lt;br /&gt;
    &lt;br /&gt;
}&lt;br /&gt;
&amp;lt;/source&amp;gt;&lt;br /&gt;
{| class=&amp;quot;wikitable&amp;quot;&lt;br /&gt;
| numberOfAuthors || numberOfPapers || numberOfEditors&lt;br /&gt;
|-&lt;br /&gt;
| 69846 || 44275 || 4625&lt;br /&gt;
|}&lt;br /&gt;
&lt;br /&gt;
* Cross-check against wikidata&lt;br /&gt;
** 53000 dblp authors out of 2.3m&lt;br /&gt;
** all editors MUST be in dblp acording to the rules of ceur-ws&lt;br /&gt;
** authors can be in dblp&lt;br /&gt;
* Disambiguation problem only on ceur-ws side&lt;br /&gt;
** idea: calculate distance of potential candidate authors to authors in the same volume&lt;/div&gt;</summary>
		<author><name>Tim Holzheim</name></author>
	</entry>
	<entry>
		<id>http://ceur-ws.bitplan.com/index.php?title=Workdocumentation_2022-08-16&amp;diff=696</id>
		<title>Workdocumentation 2022-08-16</title>
		<link rel="alternate" type="text/html" href="http://ceur-ws.bitplan.com/index.php?title=Workdocumentation_2022-08-16&amp;diff=696"/>
		<updated>2022-08-17T10:16:14Z</updated>

		<summary type="html">&lt;p&gt;Tim Holzheim: /* DBLP Queries */&lt;/p&gt;
&lt;hr /&gt;
&lt;div&gt;{{PageSequence|prev=Workdocumentation 2022-08-15|next=Workdocumentation 2022-08-17|category=Workdocumentation}}&lt;br /&gt;
= Participants =&lt;br /&gt;
* Tim&lt;br /&gt;
* Wolfgang&lt;br /&gt;
= Agenda =&lt;br /&gt;
dblp Qlever&lt;br /&gt;
&lt;br /&gt;
= dblp Qlever =&lt;br /&gt;
* https://github.com/WolfgangFahl/pyCEURmake/issues/18&lt;br /&gt;
&lt;br /&gt;
== on RWTH Aachen DBIS i5 server ==&lt;br /&gt;
&amp;lt;source lang='bash' highlight='1,12-13'&amp;gt;&lt;br /&gt;
wf@confident:/hd/torterra/dblp2022$ wget https://dblp.org/rdf/dblp.nt.gz&lt;br /&gt;
--2022-08-16 12:00:18--  https://dblp.org/rdf/dblp.nt.gz&lt;br /&gt;
Resolving dblp.org (dblp.org)... 192.76.146.204&lt;br /&gt;
Connecting to dblp.org (dblp.org)|192.76.146.204|:443... connected.&lt;br /&gt;
HTTP request sent, awaiting response... 200 OK&lt;br /&gt;
Length: 2789108619 (2.6G) [application/x-gzip]&lt;br /&gt;
Saving to: ‘dblp.nt.gz’&lt;br /&gt;
&lt;br /&gt;
dblp.nt.gz           36%[======&amp;gt;             ] 980.73M  38.9MB/s    eta 45s &lt;br /&gt;
dblp.nt.gz          100%[===================&amp;gt;]   2.60G  34.1MB/s    in 76s     &lt;br /&gt;
2022-08-16 12:01:34 (35.1 MB/s) - ‘dblp.nt.gz’ saved [2789108619/2789108619]&lt;br /&gt;
gunzip dblp.nt.gz&lt;br /&gt;
ls -l&lt;br /&gt;
total 34405612&lt;br /&gt;
-rw-rw-r-- 1 wf wf 35231339037 Aug 16 00:16 dblp.nt&lt;br /&gt;
&amp;lt;/source&amp;gt;&lt;br /&gt;
&amp;lt;source lang='bash' highlight='1'&amp;gt;&lt;br /&gt;
wget https://dblp.org/rdf/dblp.ttl.gz&lt;br /&gt;
--2022-08-16 15:36:37--  https://dblp.org/rdf/dblp.ttl.gz&lt;br /&gt;
Resolving dblp.org (dblp.org)... 192.76.146.204&lt;br /&gt;
Connecting to dblp.org (dblp.org)|192.76.146.204|:443... connected.&lt;br /&gt;
HTTP request sent, awaiting response... 200 OK&lt;br /&gt;
Length: 1065586620 (1016M) [application/x-gzip]&lt;br /&gt;
Saving to: ‘dblp.ttl.gz’&lt;br /&gt;
dblp.ttl.gz             47%[===========&amp;gt;              ] 483.91M  43.1MB/s    eta 16s&lt;br /&gt;
dblp.ttl.gz            100%[=========================&amp;gt;]   1016M  32.7MB/s    in 29s     &lt;br /&gt;
2022-08-16 15:37:06 (35.3 MB/s) - ‘dblp.ttl.gz’ saved [1065586620/1065586620]&lt;br /&gt;
gunzip dblp.ttl.gz&lt;br /&gt;
&amp;lt;/source&amp;gt;&lt;br /&gt;
&lt;br /&gt;
== QLever installation ==&lt;br /&gt;
https://wiki.bitplan.com/index.php/WikiData_Import_2022-01-29&lt;br /&gt;
=== Environment ===&lt;br /&gt;
&amp;lt;source lang='bash' highlight='1,7,9'&amp;gt;&lt;br /&gt;
lsb_release -a&lt;br /&gt;
No LSB modules are available.&lt;br /&gt;
Distributor ID:	Ubuntu&lt;br /&gt;
Description:	Ubuntu 20.04.4 LTS&lt;br /&gt;
Release:	20.04&lt;br /&gt;
Codename:	focal&lt;br /&gt;
wf@confident:/hd/torterra/qlever$ docker --version&lt;br /&gt;
Docker version 20.10.12, build 20.10.12-0ubuntu2~20.04.1&lt;br /&gt;
wf@confident:/hd/torterra/qlever$ free -h&lt;br /&gt;
              total        used        free      shared  buff/cache   available&lt;br /&gt;
Mem:           15Gi       1.2Gi        12Gi        45Mi       1.6Gi        13Gi&lt;br /&gt;
Swap:          11Gi          0B        11Gi&lt;br /&gt;
&amp;lt;/source&amp;gt;&lt;br /&gt;
===== Disk space =====&lt;br /&gt;
&amp;lt;source lang='bash'&amp;gt;&lt;br /&gt;
df | grep -v loop | grep -v tmp | grep -v udev&lt;br /&gt;
Filesystem      1K-blocks       Used Available Use% Mounted on&lt;br /&gt;
/dev/sda3       114226348   55228964  53148860  51% /&lt;br /&gt;
/dev/sdb1      3844590624 3266511864 382711544  90% /hd/torterra&lt;br /&gt;
&amp;lt;/source&amp;gt;&lt;br /&gt;
&lt;br /&gt;
=== QLever code clone ===&lt;br /&gt;
&amp;lt;source lang='bash' highlight='1-2'&amp;gt;&lt;br /&gt;
export QLEVER_HOME=$(pwd)&lt;br /&gt;
date;git clone --recursive https://github.com/ad-freiburg/qlever qlever-code;date&lt;br /&gt;
Tue Aug 16 15:10:09 CEST 2022&lt;br /&gt;
Cloning into 'qlever-code'...&lt;br /&gt;
remote: Enumerating objects: 14625, done.&lt;br /&gt;
remote: Counting objects: 100% (279/279), done.&lt;br /&gt;
remote: Compressing objects: 100% (227/227), done.&lt;br /&gt;
remote: Total 14625 (delta 133), reused 131 (delta 52), pack-reused 14346&lt;br /&gt;
Receiving objects: 100% (14625/14625), 190.60 MiB | 6.48 MiB/s, done.&lt;br /&gt;
...&lt;br /&gt;
Submodule path 'third_party/stxxl/extlib/foxxll/extlib/tlx': checked out 'ef81a598d9880cc7d242afc47de7328634f07f1d'&lt;br /&gt;
Tue Aug 16 15:10:56 CEST 2022&lt;br /&gt;
&amp;lt;/source&amp;gt;&lt;br /&gt;
==== Build ====&lt;br /&gt;
&amp;lt;source lang='bash' highlight='1'&amp;gt;&lt;br /&gt;
date;sudo docker build --file Dockerfiles/Dockerfile.Ubuntu20.04 -t qlever .;date&lt;br /&gt;
Tue Aug 16 15:13:02 CEST 2022&lt;br /&gt;
Sending build context to Docker daemon    453MB&lt;br /&gt;
Step 1/43 : FROM ubuntu:20.04 as base&lt;br /&gt;
20.04: Pulling from library/ubuntu&lt;br /&gt;
3b65ec22a9e9: Pull complete &lt;br /&gt;
&lt;br /&gt;
Removing intermediate container 1ccc2a50364e&lt;br /&gt;
 ---&amp;gt; d0018440a4cd&lt;br /&gt;
Successfully built d0018440a4cd&lt;br /&gt;
Successfully tagged qlever:latest&lt;br /&gt;
Tue Aug 16 15:25:28 CEST 2022&lt;br /&gt;
&amp;lt;/source&amp;gt;&lt;br /&gt;
&lt;br /&gt;
=== qlever control ===&lt;br /&gt;
&amp;lt;source lang='bash'&amp;gt;&lt;br /&gt;
git clone https://github.com/ad-freiburg/qlever-controlCloning into 'qlever-control'...&lt;br /&gt;
remote: Enumerating objects: 368, done.&lt;br /&gt;
remote: Counting objects: 100% (208/208), done.&lt;br /&gt;
remote: Compressing objects: 100% (135/135), done.&lt;br /&gt;
remote: Total 368 (delta 75), reused 183 (delta 72), pack-reused 160&lt;br /&gt;
Receiving objects: 100% (368/368), 117.76 KiB | 7.36 MiB/s, done.&lt;br /&gt;
Resolving deltas: 100% (130/130), done.&lt;br /&gt;
&amp;lt;/source&amp;gt;&lt;br /&gt;
== dblp stardog ==&lt;br /&gt;
&amp;lt;source lang='bash' highlight='1'&amp;gt;&lt;br /&gt;
docker pull stardog/stardog:latest&lt;br /&gt;
latest: Pulling from stardog/stardog&lt;br /&gt;
2d473b07cdd5: Pull complete &lt;br /&gt;
b0eac9aee9aa: Pull complete &lt;br /&gt;
8d5b89da19bc: Pull complete &lt;br /&gt;
91c2bc930138: Pull complete &lt;br /&gt;
265d7b96dd8f: Pull complete &lt;br /&gt;
Digest: sha256:7fc70e1bd3d17bdb1440f0cd810294b5318f1c53935425bb51526da4a949afc0&lt;br /&gt;
Status: Downloaded newer image for stardog/stardog:latest&lt;br /&gt;
docker.io/stardog/stardog:latest&lt;br /&gt;
&amp;lt;/source&amp;gt;&lt;br /&gt;
&lt;br /&gt;
&lt;br /&gt;
&lt;br /&gt;
= DBLP Queries = &lt;br /&gt;
* All volumes known to dblp (from ceur-ws)&lt;br /&gt;
** expected 70% of 3185 volumes&lt;br /&gt;
&amp;lt;source lang=&amp;quot;sparql&amp;quot;&amp;gt;&lt;br /&gt;
SELECT (COUNT(?proceeding) as ?count) (MIN(xsd:integer(?volNumber)) as ?min)  (MAX(xsd:integer(?volNumber)) as ?max) &lt;br /&gt;
WHERE { &lt;br /&gt;
    ?proceeding dblp:publishedIn &amp;quot;CEUR Workshop Proceedings&amp;quot;;&lt;br /&gt;
                dblp:publishedInSeriesVolume ?volNumber .&lt;br /&gt;
    }&lt;br /&gt;
LIMIT 5000&lt;br /&gt;
&amp;lt;/source&amp;gt;&lt;br /&gt;
{| class=&amp;quot;wikitable&amp;quot;&lt;br /&gt;
| count || min || max&lt;br /&gt;
|-&lt;br /&gt;
| 2375 || 1 || 3157&lt;br /&gt;
|}&lt;br /&gt;
&lt;br /&gt;
* All papers known to dblp (from ceur-ws)&lt;br /&gt;
** expected 70% of ~50000 papers&lt;br /&gt;
&amp;lt;source lang=&amp;quot;sparql&amp;quot;&amp;gt;&lt;br /&gt;
SELECT (COUNT(?paper) as ?count)&lt;br /&gt;
WHERE { &lt;br /&gt;
    ?proceeding dblp:publishedIn &amp;quot;CEUR Workshop Proceedings&amp;quot;.&lt;br /&gt;
    ?paper dblp:publishedAsPartOf ?proceeding.&lt;br /&gt;
}&lt;br /&gt;
&amp;lt;/source&amp;gt;&lt;br /&gt;
{| class=&amp;quot;wikitable&amp;quot;&lt;br /&gt;
| count&lt;br /&gt;
|-&lt;br /&gt;
| 44275&lt;br /&gt;
|}&lt;br /&gt;
&lt;br /&gt;
* All authors and editors&lt;br /&gt;
** authors: papers &amp;lt;1:1 and &amp;gt;1:3&lt;br /&gt;
** editors: volumes 3:1&lt;br /&gt;
&amp;lt;source lang=&amp;quot;sparql&amp;quot;&amp;gt;&lt;br /&gt;
SELECT (COUNT(DISTINCT ?author) as ?numberOfAuthors) &lt;br /&gt;
       (COUNT(DISTINCT ?paper) as ?numberOfPapers) &lt;br /&gt;
       (COUNT(DISTINCT ?editor) as ?numberOfEditors)&lt;br /&gt;
WHERE { &lt;br /&gt;
    ?proceeding dblp:publishedIn &amp;quot;CEUR Workshop Proceedings&amp;quot;.&lt;br /&gt;
    OPTIONAL{?proceeding dblp:editedBy ?editor}&lt;br /&gt;
    OPTIONAL{&lt;br /&gt;
        ?paper dblp:publishedAsPartOf ?proceeding.&lt;br /&gt;
        OPTIONAL{?paper dblp:authoredBy ?author}&lt;br /&gt;
    }&lt;br /&gt;
    &lt;br /&gt;
}&lt;br /&gt;
&amp;lt;/source&amp;gt;&lt;br /&gt;
{| class=&amp;quot;wikitable&amp;quot;&lt;br /&gt;
| numberOfAuthors || numberOfPapers || numberOfEditors&lt;br /&gt;
|-&lt;br /&gt;
| 69846 || 44275 || 4625&lt;br /&gt;
|}&lt;br /&gt;
&lt;br /&gt;
* Cross-check against wikidata&lt;br /&gt;
** 53000 dblp authors out of 2.3m&lt;br /&gt;
** all editors MUST be in dblp acording to the rules of ceur-ws&lt;br /&gt;
** authors can be in dblp&lt;br /&gt;
* Disambiguation problem only on ceur-ws side&lt;br /&gt;
** idea: calculate distance of potential candidate authors to authors in the same volume&lt;/div&gt;</summary>
		<author><name>Tim Holzheim</name></author>
	</entry>
	<entry>
		<id>http://ceur-ws.bitplan.com/index.php?title=Workdocumentation_2022-08-16&amp;diff=692</id>
		<title>Workdocumentation 2022-08-16</title>
		<link rel="alternate" type="text/html" href="http://ceur-ws.bitplan.com/index.php?title=Workdocumentation_2022-08-16&amp;diff=692"/>
		<updated>2022-08-16T14:40:10Z</updated>

		<summary type="html">&lt;p&gt;Tim Holzheim: &lt;/p&gt;
&lt;hr /&gt;
&lt;div&gt;{{PageSequence|prev=Workdocumentation 2022-08-15|next=Workdocumentation 2022-08-17|category=Workdocumentation}}&lt;br /&gt;
= Participants =&lt;br /&gt;
* Tim&lt;br /&gt;
* Wolfgang&lt;br /&gt;
= Agenda =&lt;br /&gt;
dblp Qlever&lt;br /&gt;
&lt;br /&gt;
= dblp Qlever =&lt;br /&gt;
* https://github.com/WolfgangFahl/pyCEURmake/issues/18&lt;br /&gt;
&lt;br /&gt;
== on RWTH Aachen DBIS i5 server ==&lt;br /&gt;
&amp;lt;source lang='bash' highlight='1,12-13'&amp;gt;&lt;br /&gt;
wf@confident:/hd/torterra/dblp2022$ wget https://dblp.org/rdf/dblp.nt.gz&lt;br /&gt;
--2022-08-16 12:00:18--  https://dblp.org/rdf/dblp.nt.gz&lt;br /&gt;
Resolving dblp.org (dblp.org)... 192.76.146.204&lt;br /&gt;
Connecting to dblp.org (dblp.org)|192.76.146.204|:443... connected.&lt;br /&gt;
HTTP request sent, awaiting response... 200 OK&lt;br /&gt;
Length: 2789108619 (2.6G) [application/x-gzip]&lt;br /&gt;
Saving to: ‘dblp.nt.gz’&lt;br /&gt;
&lt;br /&gt;
dblp.nt.gz           36%[======&amp;gt;             ] 980.73M  38.9MB/s    eta 45s &lt;br /&gt;
dblp.nt.gz          100%[===================&amp;gt;]   2.60G  34.1MB/s    in 76s     &lt;br /&gt;
2022-08-16 12:01:34 (35.1 MB/s) - ‘dblp.nt.gz’ saved [2789108619/2789108619]&lt;br /&gt;
gunzip dblp.nt.gz&lt;br /&gt;
ls -l&lt;br /&gt;
total 34405612&lt;br /&gt;
-rw-rw-r-- 1 wf wf 35231339037 Aug 16 00:16 dblp.nt&lt;br /&gt;
&amp;lt;/source&amp;gt;&lt;br /&gt;
&amp;lt;source lang='bash' highlight='1'&amp;gt;&lt;br /&gt;
wget https://dblp.org/rdf/dblp.ttl.gz&lt;br /&gt;
--2022-08-16 15:36:37--  https://dblp.org/rdf/dblp.ttl.gz&lt;br /&gt;
Resolving dblp.org (dblp.org)... 192.76.146.204&lt;br /&gt;
Connecting to dblp.org (dblp.org)|192.76.146.204|:443... connected.&lt;br /&gt;
HTTP request sent, awaiting response... 200 OK&lt;br /&gt;
Length: 1065586620 (1016M) [application/x-gzip]&lt;br /&gt;
Saving to: ‘dblp.ttl.gz’&lt;br /&gt;
dblp.ttl.gz             47%[===========&amp;gt;              ] 483.91M  43.1MB/s    eta 16s&lt;br /&gt;
dblp.ttl.gz            100%[=========================&amp;gt;]   1016M  32.7MB/s    in 29s     &lt;br /&gt;
2022-08-16 15:37:06 (35.3 MB/s) - ‘dblp.ttl.gz’ saved [1065586620/1065586620]&lt;br /&gt;
gunzip dblp.ttl.gz&lt;br /&gt;
&amp;lt;/source&amp;gt;&lt;br /&gt;
&lt;br /&gt;
== QLever installation ==&lt;br /&gt;
https://wiki.bitplan.com/index.php/WikiData_Import_2022-01-29&lt;br /&gt;
=== Environment ===&lt;br /&gt;
&amp;lt;source lang='bash' highlight='1,7,9'&amp;gt;&lt;br /&gt;
lsb_release -a&lt;br /&gt;
No LSB modules are available.&lt;br /&gt;
Distributor ID:	Ubuntu&lt;br /&gt;
Description:	Ubuntu 20.04.4 LTS&lt;br /&gt;
Release:	20.04&lt;br /&gt;
Codename:	focal&lt;br /&gt;
wf@confident:/hd/torterra/qlever$ docker --version&lt;br /&gt;
Docker version 20.10.12, build 20.10.12-0ubuntu2~20.04.1&lt;br /&gt;
wf@confident:/hd/torterra/qlever$ free -h&lt;br /&gt;
              total        used        free      shared  buff/cache   available&lt;br /&gt;
Mem:           15Gi       1.2Gi        12Gi        45Mi       1.6Gi        13Gi&lt;br /&gt;
Swap:          11Gi          0B        11Gi&lt;br /&gt;
&amp;lt;/source&amp;gt;&lt;br /&gt;
===== Disk space =====&lt;br /&gt;
&amp;lt;source lang='bash'&amp;gt;&lt;br /&gt;
df | grep -v loop | grep -v tmp | grep -v udev&lt;br /&gt;
Filesystem      1K-blocks       Used Available Use% Mounted on&lt;br /&gt;
/dev/sda3       114226348   55228964  53148860  51% /&lt;br /&gt;
/dev/sdb1      3844590624 3266511864 382711544  90% /hd/torterra&lt;br /&gt;
&amp;lt;/source&amp;gt;&lt;br /&gt;
&lt;br /&gt;
=== QLever code clone ===&lt;br /&gt;
&amp;lt;source lang='bash' highlight='1-2'&amp;gt;&lt;br /&gt;
export QLEVER_HOME=$(pwd)&lt;br /&gt;
date;git clone --recursive https://github.com/ad-freiburg/qlever qlever-code;date&lt;br /&gt;
Tue Aug 16 15:10:09 CEST 2022&lt;br /&gt;
Cloning into 'qlever-code'...&lt;br /&gt;
remote: Enumerating objects: 14625, done.&lt;br /&gt;
remote: Counting objects: 100% (279/279), done.&lt;br /&gt;
remote: Compressing objects: 100% (227/227), done.&lt;br /&gt;
remote: Total 14625 (delta 133), reused 131 (delta 52), pack-reused 14346&lt;br /&gt;
Receiving objects: 100% (14625/14625), 190.60 MiB | 6.48 MiB/s, done.&lt;br /&gt;
...&lt;br /&gt;
Submodule path 'third_party/stxxl/extlib/foxxll/extlib/tlx': checked out 'ef81a598d9880cc7d242afc47de7328634f07f1d'&lt;br /&gt;
Tue Aug 16 15:10:56 CEST 2022&lt;br /&gt;
&amp;lt;/source&amp;gt;&lt;br /&gt;
==== Build ====&lt;br /&gt;
&amp;lt;source lang='bash' highlight='1'&amp;gt;&lt;br /&gt;
date;sudo docker build --file Dockerfiles/Dockerfile.Ubuntu20.04 -t qlever .;date&lt;br /&gt;
Tue Aug 16 15:13:02 CEST 2022&lt;br /&gt;
Sending build context to Docker daemon    453MB&lt;br /&gt;
Step 1/43 : FROM ubuntu:20.04 as base&lt;br /&gt;
20.04: Pulling from library/ubuntu&lt;br /&gt;
3b65ec22a9e9: Pull complete &lt;br /&gt;
&lt;br /&gt;
Removing intermediate container 1ccc2a50364e&lt;br /&gt;
 ---&amp;gt; d0018440a4cd&lt;br /&gt;
Successfully built d0018440a4cd&lt;br /&gt;
Successfully tagged qlever:latest&lt;br /&gt;
Tue Aug 16 15:25:28 CEST 2022&lt;br /&gt;
&amp;lt;/source&amp;gt;&lt;br /&gt;
&lt;br /&gt;
=== qlever control ===&lt;br /&gt;
&amp;lt;source lang='bash'&amp;gt;&lt;br /&gt;
git clone https://github.com/ad-freiburg/qlever-controlCloning into 'qlever-control'...&lt;br /&gt;
remote: Enumerating objects: 368, done.&lt;br /&gt;
remote: Counting objects: 100% (208/208), done.&lt;br /&gt;
remote: Compressing objects: 100% (135/135), done.&lt;br /&gt;
remote: Total 368 (delta 75), reused 183 (delta 72), pack-reused 160&lt;br /&gt;
Receiving objects: 100% (368/368), 117.76 KiB | 7.36 MiB/s, done.&lt;br /&gt;
Resolving deltas: 100% (130/130), done.&lt;br /&gt;
&amp;lt;/source&amp;gt;&lt;br /&gt;
== dblp stardog ==&lt;br /&gt;
&amp;lt;source lang='bash' highlight='1'&amp;gt;&lt;br /&gt;
docker pull stardog/stardog:latest&lt;br /&gt;
latest: Pulling from stardog/stardog&lt;br /&gt;
2d473b07cdd5: Pull complete &lt;br /&gt;
b0eac9aee9aa: Pull complete &lt;br /&gt;
8d5b89da19bc: Pull complete &lt;br /&gt;
91c2bc930138: Pull complete &lt;br /&gt;
265d7b96dd8f: Pull complete &lt;br /&gt;
Digest: sha256:7fc70e1bd3d17bdb1440f0cd810294b5318f1c53935425bb51526da4a949afc0&lt;br /&gt;
Status: Downloaded newer image for stardog/stardog:latest&lt;br /&gt;
docker.io/stardog/stardog:latest&lt;br /&gt;
&amp;lt;/source&amp;gt;&lt;br /&gt;
&lt;br /&gt;
&lt;br /&gt;
&lt;br /&gt;
= DBLP Queries = &lt;br /&gt;
* All volumes known to dblp (from ceur-ws)&lt;br /&gt;
** expected 70% of 3185 volumes&lt;br /&gt;
* All papers known to dblp (from ceur-ws)&lt;br /&gt;
** expected 70% of ~50000 papers&lt;br /&gt;
* All authors and editors&lt;br /&gt;
** authors: papers &amp;lt;1:1 and &amp;gt;1:3&lt;br /&gt;
** editors: volumes 3:1&lt;br /&gt;
* Cross-check against wikidata&lt;br /&gt;
** 53000 dblp authors out of 2.3m&lt;br /&gt;
** all editors MUST be in dblp acording to the rules of ceur-ws&lt;br /&gt;
** authors can be in dblp&lt;br /&gt;
* Disambiguation problem only on ceur-ws side&lt;br /&gt;
** idea: calculate distance of potential candidate authors to authors in the same volume&lt;/div&gt;</summary>
		<author><name>Tim Holzheim</name></author>
	</entry>
	<entry>
		<id>http://ceur-ws.bitplan.com/index.php?title=Workdocumentation_2022-08-16&amp;diff=685</id>
		<title>Workdocumentation 2022-08-16</title>
		<link rel="alternate" type="text/html" href="http://ceur-ws.bitplan.com/index.php?title=Workdocumentation_2022-08-16&amp;diff=685"/>
		<updated>2022-08-16T13:22:50Z</updated>

		<summary type="html">&lt;p&gt;Tim Holzheim: &lt;/p&gt;
&lt;hr /&gt;
&lt;div&gt;{{PageSequence|prev=Workdocumentation 2022-08-15|next=Workdocumentation 2022-08-17|category=Workdocumentation}}&lt;br /&gt;
= Participants =&lt;br /&gt;
* Tim&lt;br /&gt;
* Wolfgang&lt;br /&gt;
= Agenda =&lt;br /&gt;
dblp Qlever&lt;br /&gt;
&lt;br /&gt;
= dblp Qlever =&lt;br /&gt;
* https://github.com/WolfgangFahl/pyCEURmake/issues/18&lt;br /&gt;
&lt;br /&gt;
== on RWTH Aachen DBIS i5 server ==&lt;br /&gt;
&amp;lt;source lang='bash' highlight='1,12-13'&amp;gt;&lt;br /&gt;
wf@confident:/hd/torterra/dblp2022$ wget https://dblp.org/rdf/dblp.nt.gz&lt;br /&gt;
--2022-08-16 12:00:18--  https://dblp.org/rdf/dblp.nt.gz&lt;br /&gt;
Resolving dblp.org (dblp.org)... 192.76.146.204&lt;br /&gt;
Connecting to dblp.org (dblp.org)|192.76.146.204|:443... connected.&lt;br /&gt;
HTTP request sent, awaiting response... 200 OK&lt;br /&gt;
Length: 2789108619 (2.6G) [application/x-gzip]&lt;br /&gt;
Saving to: ‘dblp.nt.gz’&lt;br /&gt;
&lt;br /&gt;
dblp.nt.gz           36%[======&amp;gt;             ] 980.73M  38.9MB/s    eta 45s &lt;br /&gt;
dblp.nt.gz          100%[===================&amp;gt;]   2.60G  34.1MB/s    in 76s     &lt;br /&gt;
2022-08-16 12:01:34 (35.1 MB/s) - ‘dblp.nt.gz’ saved [2789108619/2789108619]&lt;br /&gt;
gunzip dblp.nt.gz&lt;br /&gt;
ls -l&lt;br /&gt;
total 34405612&lt;br /&gt;
-rw-rw-r-- 1 wf wf 35231339037 Aug 16 00:16 dblp.nt&lt;br /&gt;
&amp;lt;/source&amp;gt;&lt;br /&gt;
&lt;br /&gt;
== QLever installation ==&lt;br /&gt;
https://wiki.bitplan.com/index.php/WikiData_Import_2022-01-29&lt;br /&gt;
=== Environment ===&lt;br /&gt;
&amp;lt;source lang='bash' highlight='1,7,9'&amp;gt;&lt;br /&gt;
lsb_release -a&lt;br /&gt;
No LSB modules are available.&lt;br /&gt;
Distributor ID:	Ubuntu&lt;br /&gt;
Description:	Ubuntu 20.04.4 LTS&lt;br /&gt;
Release:	20.04&lt;br /&gt;
Codename:	focal&lt;br /&gt;
wf@confident:/hd/torterra/qlever$ docker --version&lt;br /&gt;
Docker version 20.10.12, build 20.10.12-0ubuntu2~20.04.1&lt;br /&gt;
wf@confident:/hd/torterra/qlever$ free -h&lt;br /&gt;
              total        used        free      shared  buff/cache   available&lt;br /&gt;
Mem:           15Gi       1.2Gi        12Gi        45Mi       1.6Gi        13Gi&lt;br /&gt;
Swap:          11Gi          0B        11Gi&lt;br /&gt;
&amp;lt;/source&amp;gt;&lt;br /&gt;
===== Disk space =====&lt;br /&gt;
&amp;lt;source lang='bash'&amp;gt;&lt;br /&gt;
df | grep -v loop | grep -v tmp | grep -v udev&lt;br /&gt;
Filesystem      1K-blocks       Used Available Use% Mounted on&lt;br /&gt;
/dev/sda3       114226348   55228964  53148860  51% /&lt;br /&gt;
/dev/sdb1      3844590624 3266511864 382711544  90% /hd/torterra&lt;br /&gt;
&amp;lt;/source&amp;gt;&lt;br /&gt;
&lt;br /&gt;
=== QLever code clone ===&lt;br /&gt;
&amp;lt;source lang='bash' highlight='1-2'&amp;gt;&lt;br /&gt;
export QLEVER_HOME=$(pwd)&lt;br /&gt;
date;git clone --recursive https://github.com/ad-freiburg/qlever qlever-code;date&lt;br /&gt;
Tue Aug 16 15:10:09 CEST 2022&lt;br /&gt;
Cloning into 'qlever-code'...&lt;br /&gt;
remote: Enumerating objects: 14625, done.&lt;br /&gt;
remote: Counting objects: 100% (279/279), done.&lt;br /&gt;
remote: Compressing objects: 100% (227/227), done.&lt;br /&gt;
remote: Total 14625 (delta 133), reused 131 (delta 52), pack-reused 14346&lt;br /&gt;
Receiving objects: 100% (14625/14625), 190.60 MiB | 6.48 MiB/s, done.&lt;br /&gt;
...&lt;br /&gt;
Submodule path 'third_party/stxxl/extlib/foxxll/extlib/tlx': checked out 'ef81a598d9880cc7d242afc47de7328634f07f1d'&lt;br /&gt;
Tue Aug 16 15:10:56 CEST 2022&lt;br /&gt;
&amp;lt;/source&amp;gt;&lt;br /&gt;
&lt;br /&gt;
&lt;br /&gt;
===docker build (xx mins)===&lt;br /&gt;
&amp;lt;source lang='bash'&amp;gt;&lt;br /&gt;
&amp;lt;/source&amp;gt;&lt;br /&gt;
=== dblp dump download ===&lt;br /&gt;
see https://dblp.org/rdf/&lt;br /&gt;
&amp;lt;source lang='bash'&amp;gt;&lt;br /&gt;
$ wget -P /hd/torterra/dblp2022 https://dblp.org/rdf/dblp.nt.gz&lt;br /&gt;
$ cd /hd/torterra/dblp2022&lt;br /&gt;
$ gzip -d dblp.nt.gz&lt;br /&gt;
&amp;lt;/source&amp;gt;&lt;/div&gt;</summary>
		<author><name>Tim Holzheim</name></author>
	</entry>
	<entry>
		<id>http://ceur-ws.bitplan.com/index.php?title=Workdocumentation_2022-08-16&amp;diff=684</id>
		<title>Workdocumentation 2022-08-16</title>
		<link rel="alternate" type="text/html" href="http://ceur-ws.bitplan.com/index.php?title=Workdocumentation_2022-08-16&amp;diff=684"/>
		<updated>2022-08-16T13:22:23Z</updated>

		<summary type="html">&lt;p&gt;Tim Holzheim: &lt;/p&gt;
&lt;hr /&gt;
&lt;div&gt;{{PageSequence|prev=Workdocumentation 2022-08-15|next=Workdocumentation 2022-08-17|category=Workdocumentation}}&lt;br /&gt;
= Participants =&lt;br /&gt;
* Tim&lt;br /&gt;
* Wolfgang&lt;br /&gt;
= Agenda =&lt;br /&gt;
dblp Qlever&lt;br /&gt;
&lt;br /&gt;
= dblp Qlever =&lt;br /&gt;
* https://github.com/WolfgangFahl/pyCEURmake/issues/18&lt;br /&gt;
&lt;br /&gt;
== on RWTH Aachen DBIS i5 server ==&lt;br /&gt;
&amp;lt;source lang='bash' highlight='1,12-13'&amp;gt;&lt;br /&gt;
wf@confident:/hd/torterra/dblp2022$ wget https://dblp.org/rdf/dblp.nt.gz&lt;br /&gt;
--2022-08-16 12:00:18--  https://dblp.org/rdf/dblp.nt.gz&lt;br /&gt;
Resolving dblp.org (dblp.org)... 192.76.146.204&lt;br /&gt;
Connecting to dblp.org (dblp.org)|192.76.146.204|:443... connected.&lt;br /&gt;
HTTP request sent, awaiting response... 200 OK&lt;br /&gt;
Length: 2789108619 (2.6G) [application/x-gzip]&lt;br /&gt;
Saving to: ‘dblp.nt.gz’&lt;br /&gt;
&lt;br /&gt;
dblp.nt.gz           36%[======&amp;gt;             ] 980.73M  38.9MB/s    eta 45s &lt;br /&gt;
dblp.nt.gz          100%[===================&amp;gt;]   2.60G  34.1MB/s    in 76s     &lt;br /&gt;
2022-08-16 12:01:34 (35.1 MB/s) - ‘dblp.nt.gz’ saved [2789108619/2789108619]&lt;br /&gt;
gunzip dblp.nt.gz&lt;br /&gt;
ls -l&lt;br /&gt;
total 34405612&lt;br /&gt;
-rw-rw-r-- 1 wf wf 35231339037 Aug 16 00:16 dblp.nt&lt;br /&gt;
&amp;lt;/source&amp;gt;&lt;br /&gt;
&lt;br /&gt;
== QLever installation ==&lt;br /&gt;
https://wiki.bitplan.com/index.php/WikiData_Import_2022-01-29&lt;br /&gt;
=== Environment ===&lt;br /&gt;
&amp;lt;source lang='bash' highlight='1,7,9'&amp;gt;&lt;br /&gt;
lsb_release -a&lt;br /&gt;
No LSB modules are available.&lt;br /&gt;
Distributor ID:	Ubuntu&lt;br /&gt;
Description:	Ubuntu 20.04.4 LTS&lt;br /&gt;
Release:	20.04&lt;br /&gt;
Codename:	focal&lt;br /&gt;
wf@confident:/hd/torterra/qlever$ docker --version&lt;br /&gt;
Docker version 20.10.12, build 20.10.12-0ubuntu2~20.04.1&lt;br /&gt;
wf@confident:/hd/torterra/qlever$ free -h&lt;br /&gt;
              total        used        free      shared  buff/cache   available&lt;br /&gt;
Mem:           15Gi       1.2Gi        12Gi        45Mi       1.6Gi        13Gi&lt;br /&gt;
Swap:          11Gi          0B        11Gi&lt;br /&gt;
&amp;lt;/source&amp;gt;&lt;br /&gt;
===== Disk space =====&lt;br /&gt;
&amp;lt;source lang='bash'&amp;gt;&lt;br /&gt;
df | grep -v loop | grep -v tmp | grep -v udev&lt;br /&gt;
Filesystem      1K-blocks       Used Available Use% Mounted on&lt;br /&gt;
/dev/sda3       114226348   55228964  53148860  51% /&lt;br /&gt;
/dev/sdb1      3844590624 3266511864 382711544  90% /hd/torterra&lt;br /&gt;
&amp;lt;/source&amp;gt;&lt;br /&gt;
&lt;br /&gt;
=== QLever code clone ===&lt;br /&gt;
&amp;lt;source lang='bash' highlight='1-2'&amp;gt;&lt;br /&gt;
export QLEVER_HOME=$(pwd)&lt;br /&gt;
date;git clone --recursive https://github.com/ad-freiburg/qlever qlever-code;date&lt;br /&gt;
Tue Aug 16 15:10:09 CEST 2022&lt;br /&gt;
Cloning into 'qlever-code'...&lt;br /&gt;
remote: Enumerating objects: 14625, done.&lt;br /&gt;
remote: Counting objects: 100% (279/279), done.&lt;br /&gt;
remote: Compressing objects: 100% (227/227), done.&lt;br /&gt;
remote: Total 14625 (delta 133), reused 131 (delta 52), pack-reused 14346&lt;br /&gt;
Receiving objects: 100% (14625/14625), 190.60 MiB | 6.48 MiB/s, done.&lt;br /&gt;
...&lt;br /&gt;
Submodule path 'third_party/stxxl/extlib/foxxll/extlib/tlx': checked out 'ef81a598d9880cc7d242afc47de7328634f07f1d'&lt;br /&gt;
Tue Aug 16 15:10:56 CEST 2022&lt;br /&gt;
&amp;lt;/source&amp;gt;&lt;br /&gt;
&lt;br /&gt;
&lt;br /&gt;
==docker build (xx mins)==&lt;br /&gt;
&amp;lt;source lang='bash'&amp;gt;&lt;br /&gt;
&amp;lt;/source&amp;gt;&lt;br /&gt;
== dblp dump download ==&lt;br /&gt;
see https://dblp.org/rdf/&lt;br /&gt;
&amp;lt;source lang='bash'&amp;gt;&lt;br /&gt;
$ wget -P /hd/torterra/dblp2022 https://dblp.org/rdf/dblp.nt.gz&lt;br /&gt;
$ cd /hd/torterra/dblp2022&lt;br /&gt;
$ gzip -d dblp.nt.gz&lt;br /&gt;
&amp;lt;/source&amp;gt;&lt;/div&gt;</summary>
		<author><name>Tim Holzheim</name></author>
	</entry>
	<entry>
		<id>http://ceur-ws.bitplan.com/index.php?title=Workdocumentation_2022-08-16&amp;diff=683</id>
		<title>Workdocumentation 2022-08-16</title>
		<link rel="alternate" type="text/html" href="http://ceur-ws.bitplan.com/index.php?title=Workdocumentation_2022-08-16&amp;diff=683"/>
		<updated>2022-08-16T13:22:00Z</updated>

		<summary type="html">&lt;p&gt;Tim Holzheim: /* = QLever code clone */&lt;/p&gt;
&lt;hr /&gt;
&lt;div&gt;{{PageSequence|prev=Workdocumentation 2022-08-15|next=Workdocumentation 2022-08-17|category=Workdocumentation}}&lt;br /&gt;
= Participants =&lt;br /&gt;
* Tim&lt;br /&gt;
* Wolfgang&lt;br /&gt;
= Agenda =&lt;br /&gt;
dblp Qlever&lt;br /&gt;
&lt;br /&gt;
= dblp Qlever =&lt;br /&gt;
* https://github.com/WolfgangFahl/pyCEURmake/issues/18&lt;br /&gt;
&lt;br /&gt;
== on RWTH Aachen DBIS i5 server ==&lt;br /&gt;
&amp;lt;source lang='bash' highlight='1,12-13'&amp;gt;&lt;br /&gt;
wf@confident:/hd/torterra/dblp2022$ wget https://dblp.org/rdf/dblp.nt.gz&lt;br /&gt;
--2022-08-16 12:00:18--  https://dblp.org/rdf/dblp.nt.gz&lt;br /&gt;
Resolving dblp.org (dblp.org)... 192.76.146.204&lt;br /&gt;
Connecting to dblp.org (dblp.org)|192.76.146.204|:443... connected.&lt;br /&gt;
HTTP request sent, awaiting response... 200 OK&lt;br /&gt;
Length: 2789108619 (2.6G) [application/x-gzip]&lt;br /&gt;
Saving to: ‘dblp.nt.gz’&lt;br /&gt;
&lt;br /&gt;
dblp.nt.gz           36%[======&amp;gt;             ] 980.73M  38.9MB/s    eta 45s &lt;br /&gt;
dblp.nt.gz          100%[===================&amp;gt;]   2.60G  34.1MB/s    in 76s     &lt;br /&gt;
2022-08-16 12:01:34 (35.1 MB/s) - ‘dblp.nt.gz’ saved [2789108619/2789108619]&lt;br /&gt;
gunzip dblp.nt.gz&lt;br /&gt;
ls -l&lt;br /&gt;
total 34405612&lt;br /&gt;
-rw-rw-r-- 1 wf wf 35231339037 Aug 16 00:16 dblp.nt&lt;br /&gt;
&amp;lt;/source&amp;gt;&lt;br /&gt;
&lt;br /&gt;
== QLever installation ==&lt;br /&gt;
https://wiki.bitplan.com/index.php/WikiData_Import_2022-01-29&lt;br /&gt;
=== Environment ===&lt;br /&gt;
&amp;lt;source lang='bash' highlight='1,7,9'&amp;gt;&lt;br /&gt;
lsb_release -a&lt;br /&gt;
No LSB modules are available.&lt;br /&gt;
Distributor ID:	Ubuntu&lt;br /&gt;
Description:	Ubuntu 20.04.4 LTS&lt;br /&gt;
Release:	20.04&lt;br /&gt;
Codename:	focal&lt;br /&gt;
wf@confident:/hd/torterra/qlever$ docker --version&lt;br /&gt;
Docker version 20.10.12, build 20.10.12-0ubuntu2~20.04.1&lt;br /&gt;
wf@confident:/hd/torterra/qlever$ free -h&lt;br /&gt;
              total        used        free      shared  buff/cache   available&lt;br /&gt;
Mem:           15Gi       1.2Gi        12Gi        45Mi       1.6Gi        13Gi&lt;br /&gt;
Swap:          11Gi          0B        11Gi&lt;br /&gt;
&amp;lt;/source&amp;gt;&lt;br /&gt;
===== Disk space =====&lt;br /&gt;
&amp;lt;source lang='bash'&amp;gt;&lt;br /&gt;
df | grep -v loop | grep -v tmp | grep -v udev&lt;br /&gt;
Filesystem      1K-blocks       Used Available Use% Mounted on&lt;br /&gt;
/dev/sda3       114226348   55228964  53148860  51% /&lt;br /&gt;
/dev/sdb1      3844590624 3266511864 382711544  90% /hd/torterra&lt;br /&gt;
&amp;lt;/source&amp;gt;&lt;br /&gt;
&lt;br /&gt;
&lt;br /&gt;
==docker build (xx mins)==&lt;br /&gt;
&amp;lt;source lang='bash'&amp;gt;&lt;br /&gt;
&amp;lt;/source&amp;gt;&lt;br /&gt;
== dblp dump download ==&lt;br /&gt;
see https://dblp.org/rdf/&lt;br /&gt;
&amp;lt;source lang='bash'&amp;gt;&lt;br /&gt;
$ wget -P /hd/torterra/dblp2022 https://dblp.org/rdf/dblp.nt.gz&lt;br /&gt;
$ cd /hd/torterra/dblp2022&lt;br /&gt;
$ gzip -d dblp.nt.gz&lt;br /&gt;
&amp;lt;/source&amp;gt;&lt;br /&gt;
&lt;br /&gt;
=== QLever code clone ===&lt;br /&gt;
&amp;lt;source lang='bash' highlight='1-2'&amp;gt;&lt;br /&gt;
export QLEVER_HOME=$(pwd)&lt;br /&gt;
date;git clone --recursive https://github.com/ad-freiburg/qlever qlever-code;date&lt;br /&gt;
Tue Aug 16 15:10:09 CEST 2022&lt;br /&gt;
Cloning into 'qlever-code'...&lt;br /&gt;
remote: Enumerating objects: 14625, done.&lt;br /&gt;
remote: Counting objects: 100% (279/279), done.&lt;br /&gt;
remote: Compressing objects: 100% (227/227), done.&lt;br /&gt;
remote: Total 14625 (delta 133), reused 131 (delta 52), pack-reused 14346&lt;br /&gt;
Receiving objects: 100% (14625/14625), 190.60 MiB | 6.48 MiB/s, done.&lt;br /&gt;
...&lt;br /&gt;
Submodule path 'third_party/stxxl/extlib/foxxll/extlib/tlx': checked out 'ef81a598d9880cc7d242afc47de7328634f07f1d'&lt;br /&gt;
Tue Aug 16 15:10:56 CEST 2022&lt;br /&gt;
&amp;lt;/source&amp;gt;&lt;/div&gt;</summary>
		<author><name>Tim Holzheim</name></author>
	</entry>
	<entry>
		<id>http://ceur-ws.bitplan.com/index.php?title=Workdocumentation_2022-08-16&amp;diff=682</id>
		<title>Workdocumentation 2022-08-16</title>
		<link rel="alternate" type="text/html" href="http://ceur-ws.bitplan.com/index.php?title=Workdocumentation_2022-08-16&amp;diff=682"/>
		<updated>2022-08-16T13:21:29Z</updated>

		<summary type="html">&lt;p&gt;Tim Holzheim: /* Environment */&lt;/p&gt;
&lt;hr /&gt;
&lt;div&gt;{{PageSequence|prev=Workdocumentation 2022-08-15|next=Workdocumentation 2022-08-17|category=Workdocumentation}}&lt;br /&gt;
= Participants =&lt;br /&gt;
* Tim&lt;br /&gt;
* Wolfgang&lt;br /&gt;
= Agenda =&lt;br /&gt;
dblp Qlever&lt;br /&gt;
&lt;br /&gt;
= dblp Qlever =&lt;br /&gt;
* https://github.com/WolfgangFahl/pyCEURmake/issues/18&lt;br /&gt;
&lt;br /&gt;
== on RWTH Aachen DBIS i5 server ==&lt;br /&gt;
&amp;lt;source lang='bash' highlight='1,12-13'&amp;gt;&lt;br /&gt;
wf@confident:/hd/torterra/dblp2022$ wget https://dblp.org/rdf/dblp.nt.gz&lt;br /&gt;
--2022-08-16 12:00:18--  https://dblp.org/rdf/dblp.nt.gz&lt;br /&gt;
Resolving dblp.org (dblp.org)... 192.76.146.204&lt;br /&gt;
Connecting to dblp.org (dblp.org)|192.76.146.204|:443... connected.&lt;br /&gt;
HTTP request sent, awaiting response... 200 OK&lt;br /&gt;
Length: 2789108619 (2.6G) [application/x-gzip]&lt;br /&gt;
Saving to: ‘dblp.nt.gz’&lt;br /&gt;
&lt;br /&gt;
dblp.nt.gz           36%[======&amp;gt;             ] 980.73M  38.9MB/s    eta 45s &lt;br /&gt;
dblp.nt.gz          100%[===================&amp;gt;]   2.60G  34.1MB/s    in 76s     &lt;br /&gt;
2022-08-16 12:01:34 (35.1 MB/s) - ‘dblp.nt.gz’ saved [2789108619/2789108619]&lt;br /&gt;
gunzip dblp.nt.gz&lt;br /&gt;
ls -l&lt;br /&gt;
total 34405612&lt;br /&gt;
-rw-rw-r-- 1 wf wf 35231339037 Aug 16 00:16 dblp.nt&lt;br /&gt;
&amp;lt;/source&amp;gt;&lt;br /&gt;
&lt;br /&gt;
== QLever installation ==&lt;br /&gt;
https://wiki.bitplan.com/index.php/WikiData_Import_2022-01-29&lt;br /&gt;
=== Environment ===&lt;br /&gt;
&amp;lt;source lang='bash' highlight='1,7,9'&amp;gt;&lt;br /&gt;
lsb_release -a&lt;br /&gt;
No LSB modules are available.&lt;br /&gt;
Distributor ID:	Ubuntu&lt;br /&gt;
Description:	Ubuntu 20.04.4 LTS&lt;br /&gt;
Release:	20.04&lt;br /&gt;
Codename:	focal&lt;br /&gt;
wf@confident:/hd/torterra/qlever$ docker --version&lt;br /&gt;
Docker version 20.10.12, build 20.10.12-0ubuntu2~20.04.1&lt;br /&gt;
wf@confident:/hd/torterra/qlever$ free -h&lt;br /&gt;
              total        used        free      shared  buff/cache   available&lt;br /&gt;
Mem:           15Gi       1.2Gi        12Gi        45Mi       1.6Gi        13Gi&lt;br /&gt;
Swap:          11Gi          0B        11Gi&lt;br /&gt;
&amp;lt;/source&amp;gt;&lt;br /&gt;
===== Disk space =====&lt;br /&gt;
&amp;lt;source lang='bash'&amp;gt;&lt;br /&gt;
df | grep -v loop | grep -v tmp | grep -v udev&lt;br /&gt;
Filesystem      1K-blocks       Used Available Use% Mounted on&lt;br /&gt;
/dev/sda3       114226348   55228964  53148860  51% /&lt;br /&gt;
/dev/sdb1      3844590624 3266511864 382711544  90% /hd/torterra&lt;br /&gt;
&amp;lt;/source&amp;gt;&lt;br /&gt;
&lt;br /&gt;
&lt;br /&gt;
==docker build (xx mins)==&lt;br /&gt;
&amp;lt;source lang='bash'&amp;gt;&lt;br /&gt;
&amp;lt;/source&amp;gt;&lt;br /&gt;
== dblp dump download ==&lt;br /&gt;
see https://dblp.org/rdf/&lt;br /&gt;
&amp;lt;source lang='bash'&amp;gt;&lt;br /&gt;
$ wget -P /hd/torterra/dblp2022 https://dblp.org/rdf/dblp.nt.gz&lt;br /&gt;
$ cd /hd/torterra/dblp2022&lt;br /&gt;
$ gzip -d dblp.nt.gz&lt;br /&gt;
&amp;lt;/source&amp;gt;&lt;br /&gt;
&lt;br /&gt;
==== QLever code clone ===&lt;br /&gt;
&amp;lt;source lang='bash' highlight='1-2'&amp;gt;&lt;br /&gt;
export QLEVER_HOME=$(pwd)&lt;br /&gt;
date;git clone --recursive https://github.com/ad-freiburg/qlever qlever-code;date&lt;br /&gt;
Tue Aug 16 15:10:09 CEST 2022&lt;br /&gt;
Cloning into 'qlever-code'...&lt;br /&gt;
remote: Enumerating objects: 14625, done.&lt;br /&gt;
remote: Counting objects: 100% (279/279), done.&lt;br /&gt;
remote: Compressing objects: 100% (227/227), done.&lt;br /&gt;
remote: Total 14625 (delta 133), reused 131 (delta 52), pack-reused 14346&lt;br /&gt;
Receiving objects: 100% (14625/14625), 190.60 MiB | 6.48 MiB/s, done.&lt;br /&gt;
...&lt;br /&gt;
Submodule path 'third_party/stxxl/extlib/foxxll/extlib/tlx': checked out 'ef81a598d9880cc7d242afc47de7328634f07f1d'&lt;br /&gt;
Tue Aug 16 15:10:56 CEST 2022&lt;br /&gt;
&amp;lt;/source&amp;gt;&lt;/div&gt;</summary>
		<author><name>Tim Holzheim</name></author>
	</entry>
</feed>