
- Asynchronous Training of Word Embeddings for Large Text Corpora in WSDM ’19 (2019). 168–176.
- Citation Needed: A Taxonomy and Algorithmic Assessment of Wikipedia’s Verifiability (2019).
- TableNet: A Knowledge Graph of Interlinked Wikipedia Tables (2019).
- Neural Based Statement Classification for Biased Language (2019).
- EventKG - the Hub of Event Knowledge on the Web - and Biographical Timeline Generation (2019).
- TweetsKB: A Public and Large-Scale RDF Corpus of Annotated Tweets (2018).
- EventKG: A Multilingual Event-Centric Temporal Knowledge Graph in Lecture Notes in Computer Science (2018). 272–287.
- User Fairness in Recommender Systems in WWW ’18, P.-A. Champin, F. L. Gandon, M. Lalmas, P. G. Ipeirotis (eds.) (2018). 101–102.
- EventKG+TL: Creating Cross-Lingual Timelines from an Event-Centric Knowledge Graph (2018). 164–169.
- Heuristics-based Query Reordering for Federated Queries in SPARQL 1.1 and SPARQL-LD (2018).
- Tracking the History and Evolution of Entities: Entity-centric Temporal Analysis of Large Social Media Archives (2018).
- A Trio Neural Model for Dynamic Entity Relatedness Ranking (2018).
- Posthoc Interpretability of Learning to Rank Models using Secondary Training Data (2018).
- Building and Querying Semantic Layers for Web Archives (Extended Version) (2018).
- Detecting Biased Statements in Wikipedia. P.-A. Champin, F. L. Gandon, M. Lalmas, P. G. Ipeirotis (eds.) (2018). 1779–1786.
- DistrustRank: Spotting False News Domains in WebSci’18 (2018).
- Learning under Feature Drifts in Textual Streams (2018).
- RDF Dataset Profiling - a Survey of Features, Methods, Vocabularies and Applications (2018). 9(5) 677–705.
- Ongoing Events in Wikipedia: A Cross-lingual Case Study (2017). 387–388.
- ArchiveWeb: collaboratively extending and exploring web archive collections - How would you like to work with your collections? (2017).
- Fine Grained Citation Span for References in Wikipedia. (2017). abs/1707.07278
- Software as a first-class citizen in web archives (2017, May).
- Multi-aspect Entity-Centric Analysis of Big Social Media Archives. in Lecture Notes in Computer Science, J. Kamps, G. Tsakonas, Y. Manolopoulos, L. Iliadis, I. Karydis (eds.) (2017). 261–273.
- Towards a Ranking Model for Semantic Layers over Digital Archives (2017). 336–337.
- Fine Grained Citation Span for References in Wikipedia (2017).
- Time-Aware Entity Linking (2017).
- ArchiveWeb: collaboratively extending and exploring web archive collections---How would you like to work with your collections? (2017). 1–17.
- Universal Distant Reading through Metadata Proxies with ArchiveSpark (2017).
- Multi-aspect Entity-centric Analysis of Big Social Media Archives (2017). 261–273.
- Accessing web archives from different perspectives with potential synergies (2017).
- What’s new? Analysing language-specific Wikipedia entity contexts to support entity-centric news retrieval. (N. Nguyen; R. Kowalczyk; A. Pinto; J. Cardoso, eds.) (2017). 10190 210–231.
- Tempas: Temporal Archive Search Based on Tags. (2017). abs/1702.01076
- Designing Search Tasks for Archive Search in CHIIR ’17 (2017). 361–364.
- Modeling Event Importance for Ranking Daily News Events in WSDM ’17 (2017). 231–240.
- ArchiveWeb: Collaboratively Extending and Exploring Web Archive Collections. How would you like to work with your collections? (N. Adam; R. Furuta; E. Neuhold, eds.) (2017).
- Software citation, landing pages, and the swMATH service (2017, October).
- Semi-supervised Identification of Rarely Appearing Persons in Video by Correcting Weak Labels in ICMR ’16 (2016). 381–384.
- Analyzing Web Archives Through Topic and Event Focused Sub-collections in WebSci ’16 (2016). 291–295.
- Archiving Software Surrogates on the Web for Future Reference N. Fuhr, L. Kov{á}cs, T. Risse, W. Nejdl (eds.) (2016). 215–226.
- Linking Mathematical Software in Web Archives G.-M. Greuel, T. Koch, P. Paule, A. Sommese (eds.) (2016). 419–422.
- Search As Research Practices on the Web: The SaR-Web Platform for Cross-language Engine Results Analysis in WebSci ’16 (2016). 367–369.
- Cobwebs from the Past and Present: Extracting Large Social Networks Using Internet Archive Data in SIGIR ’16 (2016). 1093–1096.
- Analysing Temporal Evolution of Interlingual Wikipedia Article Pairs. R. Perego, F. Sebastiani, J. A. Aslam, I. Ruthven, J. Zobel (eds.) (2016). 1089–1092.
- SaR-Web - {A} Tool to Support Search as Learning Processes in {CEUR} Workshop Proceedings, J. Gwizdka, P. Hansen, C. Hauff, J. He, N. Kando (eds.) (2016). (Vol. 1647)
- Who Likes Me More?: Analysing Entity-centric Language-specific Bias in Multilingual Wikipedia in SAC ’16 (2016). 750–757.
- Exploring the past of the web: alexandria & archive-it hackathon. W. Nejdl, W. Hall, P. Parigi, S. Staab (eds.) (2016). 14.
- History by Diversity: Helping Historians Search News Archives in CHIIR ’16 (2016). 183–192.
- Analysing Temporal Evolution of Interlingual Wikipedia Article Pairs (2016).
- Finding News Citations for Wikipedia in CIKM ’16 (2016). 337–346.
- On the Applicability of Delicious for Temporal Search on Web Archives in SIGIR ’16 (2016). 929–932.
- ArchiveSpark: Efficient Web Archive Access, Extraction and Derivation in JCDL ’16 (2016). 83–92.
- How to Search the Internet Archive Without Indexing It in Lecture Notes in Computer Science, N. Fuhr, L. Kov{{á}}cs, T. Risse, W. Nejdl (eds.) (2016). (Vol. 9819) 147–160.
- Temporal Information Retrieval in SIGIR ’16 (2016). 1235–1238.
- Named Entity Evolution Recognition on the Blogosphere (2015). 15(2-4) 209–235.
- Herausforderungen für die nationale, regionale und thematische Webarchivierung und deren Nutzung (2015). 62(3-4) 160–171.
- iCrawl: Improving the Freshness of Web Collections by Integrating Social Web and Focused Web Crawling in JCDL ’15 (2015). 75–84.
- Learning to Detect Event-Related Queries for Web Search in WWW ’15 Companion (2015). 1339–1344.
- Groupsourcing: Team Competition Designs for Crowdsourcing in WWW ’15 (2015). 906–915.
- Semantic Annotation for Microblog Topics Using Wikipedia Temporal Information (2015).
- Improving Entity Retrieval on Structured Data in Lecture Notes in Computer Science (2015). (Vol. 9366) 474–491.
- Balancing Novelty and Salience: Adaptive Learning to Rank Entities for Timeline Summarization of High-impact Events in CIKM ’15 (2015). 1201–1210.
- Who With Whom And How?: Extracting Large Social Networks Using Search Engines in CIKM ’15 (2015). 1491–1500.
- Time-travel Translator: Automatically Contextualizing News Articles in WWW’2015 (2015).
- Semantic URL Analytics to Support Efficient Annotation of Large Scale Web Archives (2015). 153–166.
- The iCrawl Wizard – Supporting Interactive Focused Crawl Specification (2015).
- Extraction of evolution descriptions from the web (2014).
- A Burstiness-aware Approach for Document Dating (2014).
- iCrawl: An integrated focused crawling toolbox for Web Science N. Brügger (ed.) (2014).
- Hedera: Scalable Indexing and Exploring Entities in Wikipedia Revision History (2014).
- Insights into Entity Name Evolution on Wikipedia B. Benatallah, A. Bestavros, Y. Manolopoulos, A. Vakali, Y. Zhang (eds.) (2014). (Vol. 8787) 47–61.
- What Do You Want to Collect from the Web? (2014).
- Analysing and Enriching Focused Semantic Web Archives for Parliament Applications (2014). 6(3) 433.
- Analysing the Duration of Trending Topics in Twitter using Wikipedia (2014).
- Bridging Temporal Context Gaps Using Time-aware Re-contextualization in SIGIR ’14 (2014). 1127–1130.
- Bridging Temporal Context Gaps using Time-Aware Re-Contextualization (2014).
- On the Value of Temporal Anchor Texts in Wikipedia (2014).
- Named Entity Evolution Analysis on Wikipedia in WebSci ’14 (2014). 241–242.
- Leveraging Dynamic Query Subtopics for Time-Aware Search Result Diversification in Lecture Notes in Computer Science, M. de Rijke, T. Kenter, A. P. de Vries, C. Zhai, F. de Jong, K. Radinsky, K. Hofmann (eds.) (2014). (Vol. 8416) 222–234.
- Extraction of Evolution Descriptions from the Web in JCDL ’14 (2014). 413–414.
- What Triggers Human Remembering of Events? A Large-Scale Analysis of Catalysts for Collective Memory in Wikipedia (2014).
- Competitive Game Designs for Improving the Cost Effectiveness of Crowdsourcing in CIKM ’14 (2014). 1469–1478.
- Named Entity Evolution Analysis on Wikipedia in WebSci ’14 (2014). 241–242.
- The History of Web Archiving (2012). 100(Special Centennial Issue) 144–1443.
- Discovering URLs through user feedback (2011). 77–86.
- Dremel: interactive analysis of web-scale datasets (2010). 3(1-2) 330–339.
- Socio-Sense: A System for Analysing the Societal Behavior from Long Term Web Archive Y. Zhang, G. Yu, E. Bertino, G. Xu (eds.) (2008). (Vol. 4976) 1–8.
- A user reputation model for a user-interactive question answering system (2007). 19(15) 2091–2103.
- User-centric Web crawling (2005). 401–411.