
- Citation Needed: A Taxonomy and Algorithmic Assessment of Wikipedia’s Verifiability (2019).
- Asynchronous Training of Word Embeddings for Large Text Corpora in WSDM ’19 (2019). 168–176.
- Neural Based Statement Classification for Biased Language (2019).
- EventKG - the Hub of Event Knowledge on the Web - and Biographical Timeline Generation (2019).
- TableNet: A Knowledge Graph of Interlinked Wikipedia Tables (2019).
- DistrustRank: Spotting False News Domains in WebSci’18 (2018).
- Building and Querying Semantic Layers for Web Archives (Extended Version) (2018).
- RDF Dataset Profiling - a Survey of Features, Methods, Vocabularies and Applications (2018). 9(5) 677–705.
- A Trio Neural Model for Dynamic Entity Relatedness Ranking (2018).
- Tracking the History and Evolution of Entities: Entity-centric Temporal Analysis of Large Social Media Archives (2018).
- Posthoc Interpretability of Learning to Rank Models using Secondary Training Data (2018).
- Heuristics-based Query Reordering for Federated Queries in SPARQL 1.1 and SPARQL-LD (2018).
- EventKG: A Multilingual Event-Centric Temporal Knowledge Graph in Lecture Notes in Computer Science (2018). 272–287.
- Detecting Biased Statements in Wikipedia. P.-A. Champin, F. L. Gandon, M. Lalmas, P. G. Ipeirotis (eds.) (2018). 1779–1786.
- TweetsKB: A Public and Large-Scale RDF Corpus of Annotated Tweets (2018).
- Towards Better Understanding Researcher Strategies in Cross-Lingual Event Analytics. in Lecture Notes in Computer Science, E. Méndez, F. Crestani, C. Ribeiro, G. David, J. C. Lopes (eds.) (2018). (Vol. 11057) 139–151.
- EventKG+TL: Creating Cross-Lingual Timelines from an Event-Centric Knowledge Graph (2018). 164–169.
- Learning under Feature Drifts in Textual Streams (2018).
- Fine Grained Citation Span for References in Wikipedia (2017).
- ArchiveWeb: Collaboratively Extending and Exploring Web Archive Collections. How would you like to work with your collections? (N. Adam; R. Furuta; E. Neuhold, eds.) (2017).
- Multi-aspect Entity-Centric Analysis of Big Social Media Archives. in Lecture Notes in Computer Science, J. Kamps, G. Tsakonas, Y. Manolopoulos, L. Iliadis, I. Karydis (eds.) (2017). 261–273.
- Software citation, landing pages, and the swMATH service (2017, October).
- Universal Distant Reading through Metadata Proxies with ArchiveSpark (2017).
- Time-Aware Entity Linking (2017).
- Fine Grained Citation Span for References in Wikipedia. (2017). abs/1707.07278
- Accessing web archives from different perspectives with potential synergies (2017).
- Multi-aspect Entity-centric Analysis of Big Social Media Archives (2017). 261–273.
- Towards a Ranking Model for Semantic Layers over Digital Archives (2017). 336–337.
- Ongoing Events in Wikipedia: A Cross-lingual Case Study (2017). 387–388.
- Software as a first-class citizen in web archives (2017, May).
- Search As Research Practices on the Web: The SaR-Web Platform for Cross-language Engine Results Analysis in WebSci ’16 (2016). 367–369.
- SaR-Web - {A} Tool to Support Search as Learning Processes in {CEUR} Workshop Proceedings, J. Gwizdka, P. Hansen, C. Hauff, J. He, N. Kando (eds.) (2016). (Vol. 1647)
- How to Search the Internet Archive Without Indexing It in Lecture Notes in Computer Science, N. Fuhr, L. Kov{{á}}cs, T. Risse, W. Nejdl (eds.) (2016). (Vol. 9819) 147–160.
- Cobwebs from the Past and Present: Extracting Large Social Networks Using Internet Archive Data in SIGIR ’16 (2016). 1093–1096.
- Finding News Citations for Wikipedia in CIKM ’16 (2016). 337–346.
- History by Diversity: Helping Historians Search News Archives in CHIIR ’16 (2016). 183–192.
- Who Likes Me More?: Analysing Entity-centric Language-specific Bias in Multilingual Wikipedia in SAC ’16 (2016). 750–757.
- Who likes me more? Analysing entity-centric language-specific bias in multilingual Wikipedia (2016).
- On the Applicability of Delicious for Temporal Search on Web Archives in SIGIR ’16 (2016). 929–932.
- Analysing Temporal Evolution of Interlingual Wikipedia Article Pairs. R. Perego, F. Sebastiani, J. A. Aslam, I. Ruthven, J. Zobel (eds.) (2016). 1089–1092.
- ArchiveSpark: Efficient Web Archive Access, Extraction and Derivation in JCDL ’16 (2016). 83–92.
- Temporal Information Retrieval in SIGIR ’16 (2016). 1235–1238.
- Semantic Annotation for Microblog Topics Using Wikipedia Temporal Information (2015).
- Improving Entity Retrieval on Structured Data in Lecture Notes in Computer Science (2015). (Vol. 9366) 474–491.
- Semantic URL Analytics to Support Efficient Annotation of Large Scale Web Archives (2015). 153–166.
- Groupsourcing: Team Competition Designs for Crowdsourcing in WWW ’15 (2015). 906–915.
- Learning to Detect Event-Related Queries for Web Search in WWW ’15 Companion (2015). 1339–1344.
- Named Entity Evolution Recognition on the Blogosphere (2015). 15(2-4) 209–235.
- Balancing Novelty and Salience: Adaptive Learning to Rank Entities for Timeline Summarization of High-impact Events in CIKM ’15 (2015). 1201–1210.
- Herausforderungen für die nationale, regionale und thematische Webarchivierung und deren Nutzung (2015). 62(3-4) 160–171.
- Who With Whom And How?: Extracting Large Social Networks Using Search Engines in CIKM ’15 (2015). 1491–1500.
- Time-travel Translator: Automatically Contextualizing News Articles in WWW’2015 (2015).
- Named entity evolution recognition on the Blogosphere (2015). 15(2-4) 209–235.
- Learning to Detect Event-Related Queries for Web Search in TempWeb’2015 (2015).
- Mining Relevant Time for Query Subtopics in Web Archives in TempWeb’2015 (2015).
- The iCrawl Wizard – Supporting Interactive Focused Crawl Specification (2015).
- Named Entity Evolution Analysis on Wikipedia in WebSci ’14 (2014). 241–242.
- Extraction of Evolution Descriptions from the Web in JCDL ’14 (2014). 413–414.
- Proceedings of the 1st International Workshop on Dataset PROFIling & fEderated Search for Linked Data (PROFILES 2014), co-located with the 11th Extended Semantic Web Conference (ESWC 2014), Anissaras, Crete, Greece, 26 May 2014. (2014). (Vol. 1151) CEUR Workshop Proceedings.
- Hedera: Scalable Indexing and Exploring Entities in Wikipedia Revision History (2014).
- A Burstiness-aware Approach for Document Dating (2014).
- iCrawl: An integrated focused crawling toolbox for Web Science N. Brügger (ed.) (2014).
- Competitive Game Designs for Improving the Cost Effectiveness of Crowdsourcing in CIKM ’14 (2014). 1469–1478.
- What Triggers Human Remembering of Events? A Large-Scale Analysis of Catalysts for Collective Memory in Wikipedia (2014).
- Bridging Temporal Context Gaps Using Time-aware Re-contextualization in SIGIR ’14 (2014). 1127–1130.
- Analyzing Relative Incompleteness of Movie Descriptions in the Web of Data: A Case Study (2014). (Vol. Vol-1272) 197–200.
- Named Entity Evolution Analysis on Wikipedia in WebSci ’14 (2014). 241–242.
- What Do You Want to Collect from the Web? (2014).
- On the Value of Temporal Anchor Texts in Wikipedia (2014).
- Analysing the Duration of Trending Topics in Twitter using Wikipedia (2014).
- Analysing and Enriching Focused Semantic Web Archives for Parliament Applications (2014). 6(3) 433.
- Insights into Entity Name Evolution on Wikipedia B. Benatallah, A. Bestavros, Y. Manolopoulos, A. Vakali, Y. Zhang (eds.) (2014). (Vol. 8787) 47–61.
- Leveraging Dynamic Query Subtopics for Time-Aware Search Result Diversification in Lecture Notes in Computer Science, M. de Rijke, T. Kenter, A. P. de Vries, C. Zhai, F. de Jong, K. Radinsky, K. Hofmann (eds.) (2014). (Vol. 8416) 222–234.
- Bridging Temporal Context Gaps using Time-Aware Re-Contextualization (2014).
- ASTERIX: an open source system for "Big Data" management and analysis (demo) (2012). 5(12) 1898–1901.
- The History of Web Archiving (2012). 100(Special Centennial Issue) 144–1443.
- Information integration over time in unreliable and uncertain environments (2012). 789–798.
- Creating a searchable web archive (2012).
- Discovering URLs through user feedback (2011). 77–86.
- Collaborative search in electronic health records (2011). 18(3) 282–291.
- User browsing behavior-driven web crawling (2011). 87–92.
- Dremel: interactive analysis of web-scale datasets (2010). 3(1-2) 330–339.
- Annotating named entities in Twitter data with crowdsourcing (2010). 80–88.
- Do you want to take notes?: identifying research missions in Yahoo! search pad (2010). 321–330.
- A Taxonomy of Collaboration in Online Information Seeking (2009). abs/0908.0704
- Webarchivierung und Web Archive Mining: Notwendigkeit, Probleme und Lösungsansätze (M. Knoll; A. Meier, eds.) (2009). 268
- A study of link farm distribution and evolution using a time series of web snapshots (2009). 9–16.
- Finding high-quality content in social media (2008). 183–194.
- Socio-Sense: A System for Analysing the Societal Behavior from Long Term Web Archive Y. Zhang, G. Yu, E. Bertino, G. Xu (eds.) (2008). (Vol. 4976) 1–8.
- RankMass crawler: a crawler with high personalized pagerank coverage guarantee (2007). 375–386.
- A user reputation model for a user-interactive question answering system (2007). 19(15) 2091–2103.
- User-centric Web crawling (2005). 401–411.
- Distributed Indexing of Large-Scale Web Collections (2005). 3(1) 2–8.
- Archiving the World Wide Web (2002). 38–51.
