
- Asynchronous Training of Word Embeddings for Large Text Corpora in WSDM ’19 (2019). 168–176.
- TableNet: A Knowledge Graph of Interlinked Wikipedia Tables (2019).
- Neural Based Statement Classification for Biased Language (2019).
- EventKG - the Hub of Event Knowledge on the Web - and Biographical Timeline Generation (2019).
- Citation Needed: A Taxonomy and Algorithmic Assessment of Wikipedia’s Verifiability (2019).
- DistrustRank: Spotting False News Domains in WebSci’18 (2018).
- Posthoc Interpretability of Learning to Rank Models using Secondary Training Data (2018).
- Towards Better Understanding Researcher Strategies in Cross-Lingual Event Analytics. in Lecture Notes in Computer Science, E. Méndez, F. Crestani, C. Ribeiro, G. David, J. C. Lopes (eds.) (2018). (Vol. 11057) 139–151.
- Detecting Biased Statements in Wikipedia. P.-A. Champin, F. L. Gandon, M. Lalmas, P. G. Ipeirotis (eds.) (2018). 1779–1786.
- EventKG: A Multilingual Event-Centric Temporal Knowledge Graph in Lecture Notes in Computer Science (2018). 272–287.
- A Trio Neural Model for Dynamic Entity Relatedness Ranking (2018).
- Building and Querying Semantic Layers for Web Archives (Extended Version) (2018).
- Learning under Feature Drifts in Textual Streams (2018).
- RDF Dataset Profiling - a Survey of Features, Methods, Vocabularies and Applications (2018). 9(5) 677–705.
- TweetsKB: A Public and Large-Scale RDF Corpus of Annotated Tweets (2018).
- Tracking the History and Evolution of Entities: Entity-centric Temporal Analysis of Large Social Media Archives (2018).
- Heuristics-based Query Reordering for Federated Queries in SPARQL 1.1 and SPARQL-LD (2018).
- User Fairness in Recommender Systems in WWW ’18, P.-A. Champin, F. L. Gandon, M. Lalmas, P. G. Ipeirotis (eds.) (2018). 101–102.
- EventKG+TL: Creating Cross-Lingual Timelines from an Event-Centric Knowledge Graph (2018). 164–169.
- Towards a Ranking Model for Semantic Layers over Digital Archives (2017). 336–337.
- Designing Search Tasks for Archive Search in CHIIR ’17 (2017). 361–364.
- ArchiveWeb: collaboratively extending and exploring web archive collections - How would you like to work with your collections? (2017).
- Fine Grained Citation Span for References in Wikipedia (2017).
- What’s new? Analysing language-specific Wikipedia entity contexts to support entity-centric news retrieval. (N. Nguyen; R. Kowalczyk; A. Pinto; J. Cardoso, eds.) (2017). 10190 210–231.
- Software as a first-class citizen in web archives (2017, May).
- ArchiveWeb: collaboratively extending and exploring web archive collections---How would you like to work with your collections? (2017). 1–17.
- Multi-aspect Entity-Centric Analysis of Big Social Media Archives. in Lecture Notes in Computer Science, J. Kamps, G. Tsakonas, Y. Manolopoulos, L. Iliadis, I. Karydis (eds.) (2017). 261–273.
- Building and Querying Semantic Layers for Web Archives. (2017). 11–20.
- Software citation, landing pages, and the swMATH service (2017, October).
- Universal Distant Reading through Metadata Proxies with ArchiveSpark (2017).
- Ongoing Events in Wikipedia: A Cross-lingual Case Study (2017). 387–388.
- Modeling Event Importance for Ranking Daily News Events in WSDM ’17 (2017). 231–240.
- Tempas: Temporal Archive Search Based on Tags. (2017). abs/1702.01076
- Multi-aspect Entity-centric Analysis of Big Social Media Archives (2017). 261–273.
- Time-Aware Entity Linking (2017).
- ArchiveWeb: Collaboratively Extending and Exploring Web Archive Collections. How would you like to work with your collections? (N. Adam; R. Furuta; E. Neuhold, eds.) (2017).
- On Analyzing User Topic-Specific Platform Preferences Across Multiple Social Media Sites in WWW ’17 (2017). 1351–1359.
- Fine Grained Citation Span for References in Wikipedia. (2017). abs/1707.07278
- Accessing web archives from different perspectives with potential synergies (2017).
- SaR-Web - {A} Tool to Support Search as Learning Processes in {CEUR} Workshop Proceedings, J. Gwizdka, P. Hansen, C. Hauff, J. He, N. Kando (eds.) (2016). (Vol. 1647)
- On the Applicability of Delicious for Temporal Search on Web Archives in SIGIR ’16 (2016). 929–932.
- Analysing Temporal Evolution of Interlingual Wikipedia Article Pairs. R. Perego, F. Sebastiani, J. A. Aslam, I. Ruthven, J. Zobel (eds.) (2016). 1089–1092.
- Search As Research Practices on the Web: The SaR-Web Platform for Cross-language Engine Results Analysis in WebSci ’16 (2016). 367–369.
- Exploring the past of the web: alexandria & archive-it hackathon. W. Nejdl, W. Hall, P. Parigi, S. Staab (eds.) (2016). 14.
- Who Likes Me More?: Analysing Entity-centric Language-specific Bias in Multilingual Wikipedia in SAC ’16 (2016). 750–757.
- Semi-supervised Identification of Rarely Appearing Persons in Video by Correcting Weak Labels in ICMR ’16 (2016). 381–384.
- Linking Mathematical Software in Web Archives G.-M. Greuel, T. Koch, P. Paule, A. Sommese (eds.) (2016). 419–422.
- Archiving Software Surrogates on the Web for Future Reference N. Fuhr, L. Kov{á}cs, T. Risse, W. Nejdl (eds.) (2016). 215–226.
- ArchiveSpark: Efficient Web Archive Access, Extraction and Derivation in JCDL ’16 (2016). 83–92.
- Finding News Citations for Wikipedia in CIKM ’16 (2016). 337–346.
- Analyzing Web Archives Through Topic and Event Focused Sub-collections in WebSci ’16 (2016). 291–295.
- Cobwebs from the Past and Present: Extracting Large Social Networks Using Internet Archive Data in SIGIR ’16 (2016). 1093–1096.
- How to Search the Internet Archive Without Indexing It in Lecture Notes in Computer Science, N. Fuhr, L. Kov{{á}}cs, T. Risse, W. Nejdl (eds.) (2016). (Vol. 9819) 147–160.
- Temporal Information Retrieval in SIGIR ’16 (2016). 1235–1238.
- Analysing Temporal Evolution of Interlingual Wikipedia Article Pairs (2016).
- Who likes me more? Analysing entity-centric language-specific bias in multilingual Wikipedia (2016).
- iCrawl: Improving the Freshness of Web Collections by Integrating Social Web and Focused Web Crawling in JCDL ’15 (2015). 75–84.
- Herausforderungen für die nationale, regionale und thematische Webarchivierung und deren Nutzung (2015). 62(3-4) 160–171.
- Learning to Detect Event-Related Queries for Web Search in WWW ’15 Companion (2015). 1339–1344.
- The iCrawl Wizard – Supporting Interactive Focused Crawl Specification (2015).
- Who With Whom And How?: Extracting Large Social Networks Using Search Engines in CIKM ’15 (2015). 1491–1500.
- Improving Entity Retrieval on Structured Data in Lecture Notes in Computer Science (2015). (Vol. 9366) 474–491.
- Balancing Novelty and Salience: Adaptive Learning to Rank Entities for Timeline Summarization of High-impact Events in CIKM ’15 (2015). 1201–1210.
- Semantic Annotation for Microblog Topics Using Wikipedia Temporal Information (2015).
- Semantic URL Analytics to Support Efficient Annotation of Large Scale Web Archives (2015). 153–166.
- Time-travel Translator: Automatically Contextualizing News Articles in WWW’2015 (2015).
- Extraction of Evolution Descriptions from the Web in JCDL ’14 (2014). 413–414.
- Hedera: Scalable Indexing and Exploring Entities in Wikipedia Revision History (2014).
- Competitive Game Designs for Improving the Cost Effectiveness of Crowdsourcing in CIKM ’14 (2014). 1469–1478.
- On the Value of Temporal Anchor Texts in Wikipedia (2014).
- iCrawl: An integrated focused crawling toolbox for Web Science N. Brügger (ed.) (2014).
- Named Entity Evolution Analysis on Wikipedia in WebSci ’14 (2014). 241–242.
- Analyzing Relative Incompleteness of Movie Descriptions in the Web of Data: A Case Study (2014). (Vol. Vol-1272) 197–200.
- Bridging Temporal Context Gaps Using Time-aware Re-contextualization in SIGIR ’14 (2014). 1127–1130.
- Extraction of evolution descriptions from the web (2014).
- Analysing the Duration of Trending Topics in Twitter using Wikipedia (2014).
- Insights into Entity Name Evolution on Wikipedia B. Benatallah, A. Bestavros, Y. Manolopoulos, A. Vakali, Y. Zhang (eds.) (2014). (Vol. 8787) 47–61.
- What Do You Want to Collect from the Web? (2014).
- Leveraging Dynamic Query Subtopics for Time-Aware Search Result Diversification in Lecture Notes in Computer Science, M. de Rijke, T. Kenter, A. P. de Vries, C. Zhai, F. de Jong, K. Radinsky, K. Hofmann (eds.) (2014). (Vol. 8416) 222–234.
- Bridging Temporal Context Gaps using Time-Aware Re-Contextualization (2014).
- A Burstiness-aware Approach for Document Dating (2014).
- Named Entity Evolution Analysis on Wikipedia in WebSci ’14 (2014). 241–242.
- The History of Web Archiving (2012). 100(Special Centennial Issue) 144–1443.
- User browsing behavior-driven web crawling (2011). 87–92.
- Discovering URLs through user feedback (2011). 77–86.
- Annotating named entities in Twitter data with crowdsourcing (2010). 80–88.
- Dremel: interactive analysis of web-scale datasets (2010). 3(1-2) 330–339.
- Socio-Sense: A System for Analysing the Societal Behavior from Long Term Web Archive Y. Zhang, G. Yu, E. Bertino, G. Xu (eds.) (2008). (Vol. 4976) 1–8.
- RankMass crawler: a crawler with high personalized pagerank coverage guarantee (2007). 375–386.
- A user reputation model for a user-interactive question answering system (2007). 19(15) 2091–2103.
- User-centric Web crawling (2005). 401–411.
