
- TableNet: A Knowledge Graph of Interlinked Wikipedia Tables (2019).
- EventKG - the Hub of Event Knowledge on the Web - and Biographical Timeline Generation (2019).
- Neural Based Statement Classification for Biased Language (2019).
- RDF Dataset Profiling - a Survey of Features, Methods, Vocabularies and Applications (2018). 9(5) 677–705.
- Learning under Feature Drifts in Textual Streams (2018).
- Detecting Biased Statements in Wikipedia. P.-A. Champin, F. L. Gandon, M. Lalmas, P. G. Ipeirotis (eds.) (2018). 1779–1786.
- Posthoc Interpretability of Learning to Rank Models using Secondary Training Data (2018).
- Building and Querying Semantic Layers for Web Archives (Extended Version) (2018).
- Heuristics-based Query Reordering for Federated Queries in SPARQL 1.1 and SPARQL-LD (2018).
- Tracking the History and Evolution of Entities: Entity-centric Temporal Analysis of Large Social Media Archives (2018).
- Towards Better Understanding Researcher Strategies in Cross-Lingual Event Analytics. in Lecture Notes in Computer Science, E. Méndez, F. Crestani, C. Ribeiro, G. David, J. C. Lopes (eds.) (2018). (Vol. 11057) 139–151.
- A Trio Neural Model for Dynamic Entity Relatedness Ranking (2018).
- EventKG+TL: Creating Cross-Lingual Timelines from an Event-Centric Knowledge Graph (2018). 164–169.
- EventKG: A Multilingual Event-Centric Temporal Knowledge Graph in Lecture Notes in Computer Science (2018). 272–287.
- User Fairness in Recommender Systems in WWW ’18, P.-A. Champin, F. L. Gandon, M. Lalmas, P. G. Ipeirotis (eds.) (2018). 101–102.
- Building and Querying Semantic Layers for Web Archives. (2017). 11–20.
- Software as a first-class citizen in web archives (2017, May).
- Software citation, landing pages, and the swMATH service (2017, October).
- Multi-aspect Entity-Centric Analysis of Big Social Media Archives. in Lecture Notes in Computer Science, J. Kamps, G. Tsakonas, Y. Manolopoulos, L. Iliadis, I. Karydis (eds.) (2017). 261–273.
- Ongoing Events in Wikipedia: A Cross-lingual Case Study (2017). 387–388.
- ArchiveWeb: Collaboratively Extending and Exploring Web Archive Collections. How would you like to work with your collections? (N. Adam; R. Furuta; E. Neuhold, eds.) (2017).
- Tempas: Temporal Archive Search Based on Tags. (2017). abs/1702.01076
- Universal Distant Reading through Metadata Proxies with ArchiveSpark (2017).
- Towards a Ranking Model for Semantic Layers over Digital Archives (2017). 336–337.
- Fine Grained Citation Span for References in Wikipedia (2017).
- ArchiveWeb: collaboratively extending and exploring web archive collections---How would you like to work with your collections? (2017). 1–17.
- Modeling Event Importance for Ranking Daily News Events in WSDM ’17 (2017). 231–240.
- Search As Research Practices on the Web: The SaR-Web Platform for Cross-language Engine Results Analysis in WebSci ’16 (2016). 367–369.
- SaR-Web - {A} Tool to Support Search as Learning Processes in {CEUR} Workshop Proceedings, J. Gwizdka, P. Hansen, C. Hauff, J. He, N. Kando (eds.) (2016). (Vol. 1647)
- On the Applicability of Delicious for Temporal Search on Web Archives in SIGIR ’16 (2016). 929–932.
- How to Search the Internet Archive Without Indexing It in Lecture Notes in Computer Science, N. Fuhr, L. Kov{{á}}cs, T. Risse, W. Nejdl (eds.) (2016). (Vol. 9819) 147–160.
- Analyzing Web Archives Through Topic and Event Focused Sub-collections in WebSci ’16 (2016). 291–295.
- Archiving Software Surrogates on the Web for Future Reference N. Fuhr, L. Kov{á}cs, T. Risse, W. Nejdl (eds.) (2016). 215–226.
- Who likes me more? Analysing entity-centric language-specific bias in multilingual Wikipedia (2016).
- Temporal Information Retrieval in SIGIR ’16 (2016). 1235–1238.
- History by Diversity: Helping Historians Search News Archives in CHIIR ’16 (2016). 183–192.
- ArchiveSpark: Efficient Web Archive Access, Extraction and Derivation in JCDL ’16 (2016). 83–92.
- Analysing Temporal Evolution of Interlingual Wikipedia Article Pairs. R. Perego, F. Sebastiani, J. A. Aslam, I. Ruthven, J. Zobel (eds.) (2016). 1089–1092.
- Finding News Citations for Wikipedia in CIKM ’16 (2016). 337–346.
- Semi-supervised Identification of Rarely Appearing Persons in Video by Correcting Weak Labels in ICMR ’16 (2016). 381–384.
- Cobwebs from the Past and Present: Extracting Large Social Networks Using Internet Archive Data in SIGIR ’16 (2016). 1093–1096.
- iCrawl: Improving the Freshness of Web Collections by Integrating Social Web and Focused Web Crawling in JCDL ’15 (2015). 75–84.
- Improving Entity Retrieval on Structured Data (2015).
- Named Entity Evolution Recognition on the Blogosphere (2015). 15(2-4) 209–235.
- Named entity evolution recognition on the Blogosphere (2015). 15(2-4) 209–235.
- Herausforderungen für die nationale, regionale und thematische Webarchivierung und deren Nutzung (2015). 62(3-4) 160–171.
- Mining Relevant Time for Query Subtopics in Web Archives in TempWeb’2015 (2015).
- Learning to Detect Event-Related Queries for Web Search in TempWeb’2015 (2015).
- Semantic Annotation for Microblog Topics Using Wikipedia Temporal Information (2015).
- Semantic URL Analytics to Support Efficient Annotation of Large Scale Web Archives (2015). 153–166.
- Improving Entity Retrieval on Structured Data in Lecture Notes in Computer Science (2015). (Vol. 9366) 474–491.
- Balancing Novelty and Salience: Adaptive Learning to Rank Entities for Timeline Summarization of High-impact Events in CIKM ’15 (2015). 1201–1210.
- The iCrawl Wizard – Supporting Interactive Focused Crawl Specification (2015).
- Time-travel Translator: Automatically Contextualizing News Articles in WWW’2015 (2015).
- Who With Whom And How?: Extracting Large Social Networks Using Search Engines in CIKM ’15 (2015). 1491–1500.
- Bridging Temporal Context Gaps using Time-Aware Re-Contextualization (2014).
- Insights into Entity Name Evolution on Wikipedia B. Benatallah, A. Bestavros, Y. Manolopoulos, A. Vakali, Y. Zhang (eds.) (2014). (Vol. 8787) 47–61.
- Extraction of evolution descriptions from the web (2014).
- Hedera: Scalable Indexing and Exploring Entities in Wikipedia Revision History (2014).
- Analyzing Relative Incompleteness of Movie Descriptions in the Web of Data: A Case Study (2014). (Vol. Vol-1272) 197–200.
- Named Entity Evolution Analysis on Wikipedia in WebSci ’14 (2014). 241–242.
- What Do You Want to Collect from the Web? (2014).
- A Burstiness-aware Approach for Document Dating (2014).
- iCrawl: An integrated focused crawling toolbox for Web Science N. Brügger (ed.) (2014).
- Analysing the Duration of Trending Topics in Twitter using Wikipedia (2014).
- Leveraging Dynamic Query Subtopics for Time-Aware Search Result Diversification in Lecture Notes in Computer Science, M. de Rijke, T. Kenter, A. P. de Vries, C. Zhai, F. de Jong, K. Radinsky, K. Hofmann (eds.) (2014). (Vol. 8416) 222–234.
- Competitive Game Designs for Improving the Cost Effectiveness of Crowdsourcing in CIKM ’14 (2014). 1469–1478.
- Extraction of Evolution Descriptions from the Web in JCDL ’14 (2014). 413–414.
- Proceedings of the 1st International Workshop on Dataset PROFIling & fEderated Search for Linked Data (PROFILES 2014), co-located with the 11th Extended Semantic Web Conference (ESWC 2014), Anissaras, Crete, Greece, 26 May 2014. (2014). (Vol. 1151) CEUR Workshop Proceedings.
- What Triggers Human Remembering of Events? A Large-Scale Analysis of Catalysts for Collective Memory in Wikipedia (2014).
- Named Entity Evolution Analysis on Wikipedia in WebSci ’14 (2014). 241–242.
- ASTERIX: an open source system for "Big Data" management and analysis (demo) (2012). 5(12) 1898–1901.
- The History of Web Archiving (2012). 100(Special Centennial Issue) 144–1443.
- Creating a searchable web archive (2012).
- Information integration over time in unreliable and uncertain environments (2012). 789–798.
- Discovering URLs through user feedback (2011). 77–86.
- User browsing behavior-driven web crawling (2011). 87–92.
- Collaborative search in electronic health records (2011). 18(3) 282–291.
- Dremel: interactive analysis of web-scale datasets (2010). 3(1-2) 330–339.
- Annotating named entities in Twitter data with crowdsourcing (2010). 80–88.
- Do you want to take notes?: identifying research missions in Yahoo! search pad (2010). 321–330.
- Webarchivierung und Web Archive Mining: Notwendigkeit, Probleme und Lösungsansätze (M. Knoll; A. Meier, eds.) (2009). 268
- A study of link farm distribution and evolution using a time series of web snapshots (2009). 9–16.
- A Taxonomy of Collaboration in Online Information Seeking (2009). abs/0908.0704
- Socio-Sense: A System for Analysing the Societal Behavior from Long Term Web Archive Y. Zhang, G. Yu, E. Bertino, G. Xu (eds.) (2008). (Vol. 4976) 1–8.
- Finding high-quality content in social media (2008). 183–194.
- A user reputation model for a user-interactive question answering system (2007). 19(15) 2091–2103.
- RankMass crawler: a crawler with high personalized pagerank coverage guarantee (2007). 375–386.
- Distributed Indexing of Large-Scale Web Collections (2005). 3(1) 2–8.
- User-centric Web crawling (2005). 401–411.
- Archiving the World Wide Web (2002). 38–51.
