
- Neural Based Statement Classification for Biased Language (2019).
- Asynchronous Training of Word Embeddings for Large Text Corpora in WSDM ’19 (2019). 168–176.
- TableNet: A Knowledge Graph of Interlinked Wikipedia Tables (2019).
- EventKG - the Hub of Event Knowledge on the Web - and Biographical Timeline Generation (2019).
- Posthoc Interpretability of Learning to Rank Models using Secondary Training Data (2018).
- A Trio Neural Model for Dynamic Entity Relatedness Ranking (2018).
- Building and Querying Semantic Layers for Web Archives (Extended Version) (2018).
- DistrustRank: Spotting False News Domains in WebSci’18 (2018).
- Detecting Biased Statements in Wikipedia. P.-A. Champin, F. L. Gandon, M. Lalmas, P. G. Ipeirotis (eds.) (2018). 1779–1786.
- EventKG: A Multilingual Event-Centric Temporal Knowledge Graph in Lecture Notes in Computer Science (2018). 272–287.
- Towards Better Understanding Researcher Strategies in Cross-Lingual Event Analytics. in Lecture Notes in Computer Science, E. Méndez, F. Crestani, C. Ribeiro, G. David, J. C. Lopes (eds.) (2018). (Vol. 11057) 139–151.
- Tracking the History and Evolution of Entities: Entity-centric Temporal Analysis of Large Social Media Archives (2018).
- RDF Dataset Profiling - a Survey of Features, Methods, Vocabularies and Applications (2018). 9(5) 677–705.
- Heuristics-based Query Reordering for Federated Queries in SPARQL 1.1 and SPARQL-LD (2018).
- TweetsKB: A Public and Large-Scale RDF Corpus of Annotated Tweets (2018).
- Learning under Feature Drifts in Textual Streams (2018).
- Fine Grained Citation Span for References in Wikipedia (2017).
- Universal Distant Reading through Metadata Proxies with ArchiveSpark (2017).
- What’s new? Analysing language-specific Wikipedia entity contexts to support entity-centric news retrieval. (N. Nguyen; R. Kowalczyk; A. Pinto; J. Cardoso, eds.) (2017). 10190 210–231.
- Multi-aspect Entity-centric Analysis of Big Social Media Archives (2017). 261–273.
- Accessing web archives from different perspectives with potential synergies (2017).
- Software as a first-class citizen in web archives (2017, May).
- Software citation, landing pages, and the swMATH service (2017, October).
- Fine Grained Citation Span for References in Wikipedia. (2017). abs/1707.07278
- Time-Aware Entity Linking (2017).
- Designing Search Tasks for Archive Search in CHIIR ’17 (2017). 361–364.
- Tempas: Temporal Archive Search Based on Tags. (2017). abs/1702.01076
- Modeling Event Importance for Ranking Daily News Events in WSDM ’17 (2017). 231–240.
- Multi-aspect Entity-Centric Analysis of Big Social Media Archives. in Lecture Notes in Computer Science, J. Kamps, G. Tsakonas, Y. Manolopoulos, L. Iliadis, I. Karydis (eds.) (2017). 261–273.
- Ongoing Events in Wikipedia: A Cross-lingual Case Study (2017). 387–388.
- ArchiveWeb: Collaboratively Extending and Exploring Web Archive Collections. How would you like to work with your collections? (N. Adam; R. Furuta; E. Neuhold, eds.) (2017).
- Towards a Ranking Model for Semantic Layers over Digital Archives (2017). 336–337.
- ArchiveWeb: collaboratively extending and exploring web archive collections---How would you like to work with your collections? (2017). 1–17.
- Analyzing Web Archives Through Topic and Event Focused Sub-collections in WebSci ’16 (2016). 291–295.
- SaR-Web - {A} Tool to Support Search as Learning Processes in {CEUR} Workshop Proceedings, J. Gwizdka, P. Hansen, C. Hauff, J. He, N. Kando (eds.) (2016). (Vol. 1647)
- Analysing Temporal Evolution of Interlingual Wikipedia Article Pairs (2016).
- Exploring the past of the web: alexandria & archive-it hackathon. W. Nejdl, W. Hall, P. Parigi, S. Staab (eds.) (2016). 14.
- Cobwebs from the Past and Present: Extracting Large Social Networks Using Internet Archive Data in SIGIR ’16 (2016). 1093–1096.
- Linking Mathematical Software in Web Archives G.-M. Greuel, T. Koch, P. Paule, A. Sommese (eds.) (2016). 419–422.
- ArchiveSpark: Efficient Web Archive Access, Extraction and Derivation in JCDL ’16 (2016). 83–92.
- Analysing Temporal Evolution of Interlingual Wikipedia Article Pairs. R. Perego, F. Sebastiani, J. A. Aslam, I. Ruthven, J. Zobel (eds.) (2016). 1089–1092.
- Archiving Software Surrogates on the Web for Future Reference N. Fuhr, L. Kov{á}cs, T. Risse, W. Nejdl (eds.) (2016). 215–226.
- Semi-supervised Identification of Rarely Appearing Persons in Video by Correcting Weak Labels in ICMR ’16 (2016). 381–384.
- Finding News Citations for Wikipedia in CIKM ’16 (2016). 337–346.
- On the Applicability of Delicious for Temporal Search on Web Archives in SIGIR ’16 (2016). 929–932.
- Search As Research Practices on the Web: The SaR-Web Platform for Cross-language Engine Results Analysis in WebSci ’16 (2016). 367–369.
- Temporal Information Retrieval in SIGIR ’16 (2016). 1235–1238.
- How to Search the Internet Archive Without Indexing It in Lecture Notes in Computer Science, N. Fuhr, L. Kov{{á}}cs, T. Risse, W. Nejdl (eds.) (2016). (Vol. 9819) 147–160.
- Named Entity Evolution Recognition on the Blogosphere (2015). 15(2-4) 209–235.
- Mining Relevant Time for Query Subtopics in Web Archives in TempWeb’2015 (2015).
- Learning to Detect Event-Related Queries for Web Search in TempWeb’2015 (2015).
- Time-travel Translator: Automatically Contextualizing News Articles in WWW’2015 (2015).
- Named entity evolution recognition on the Blogosphere (2015). 15(2-4) 209–235.
- Semantic Annotation for Microblog Topics Using Wikipedia Temporal Information (2015).
- Semantic URL Analytics to Support Efficient Annotation of Large Scale Web Archives (2015). 153–166.
- iCrawl: Improving the Freshness of Web Collections by Integrating Social Web and Focused Web Crawling in JCDL ’15 (2015). 75–84.
- Balancing Novelty and Salience: Adaptive Learning to Rank Entities for Timeline Summarization of High-impact Events in CIKM ’15 (2015). 1201–1210.
- The iCrawl Wizard – Supporting Interactive Focused Crawl Specification (2015).
- Improving Entity Retrieval on Structured Data in Lecture Notes in Computer Science (2015). (Vol. 9366) 474–491.
- Who With Whom And How?: Extracting Large Social Networks Using Search Engines in CIKM ’15 (2015). 1491–1500.
- Improving Entity Retrieval on Structured Data (2015).
- Extraction of evolution descriptions from the web (2014).
- Leveraging Dynamic Query Subtopics for Time-Aware Search Result Diversification in Lecture Notes in Computer Science, M. de Rijke, T. Kenter, A. P. de Vries, C. Zhai, F. de Jong, K. Radinsky, K. Hofmann (eds.) (2014). (Vol. 8416) 222–234.
- Named Entity Evolution Analysis on Wikipedia in WebSci ’14 (2014). 241–242.
- A Burstiness-aware Approach for Document Dating (2014).
- Named Entity Evolution Analysis on Wikipedia in WebSci ’14 (2014). 241–242.
- Insights into Entity Name Evolution on Wikipedia B. Benatallah, A. Bestavros, Y. Manolopoulos, A. Vakali, Y. Zhang (eds.) (2014). (Vol. 8787) 47–61.
- Competitive Game Designs for Improving the Cost Effectiveness of Crowdsourcing in CIKM ’14 (2014). 1469–1478.
- What Do You Want to Collect from the Web? (2014).
- Analysing the Duration of Trending Topics in Twitter using Wikipedia (2014).
- Hedera: Scalable Indexing and Exploring Entities in Wikipedia Revision History (2014).
- Bridging Temporal Context Gaps using Time-Aware Re-Contextualization (2014).
- Extraction of Evolution Descriptions from the Web in JCDL ’14 (2014). 413–414.
- iCrawl: An integrated focused crawling toolbox for Web Science N. Brügger (ed.) (2014).
- Proceedings of the 1st International Workshop on Dataset PROFIling & fEderated Search for Linked Data (PROFILES 2014), co-located with the 11th Extended Semantic Web Conference (ESWC 2014), Anissaras, Crete, Greece, 26 May 2014. (2014). (Vol. 1151) CEUR Workshop Proceedings.
- What Triggers Human Remembering of Events? A Large-Scale Analysis of Catalysts for Collective Memory in Wikipedia (2014).
- Information integration over time in unreliable and uncertain environments (2012). 789–798.
- The History of Web Archiving (2012). 100(Special Centennial Issue) 144–1443.
- Creating a searchable web archive (2012).
- User browsing behavior-driven web crawling (2011). 87–92.
- Discovering URLs through user feedback (2011). 77–86.
- Dremel: interactive analysis of web-scale datasets (2010). 3(1-2) 330–339.
- A study of link farm distribution and evolution using a time series of web snapshots (2009). 9–16.
- Socio-Sense: A System for Analysing the Societal Behavior from Long Term Web Archive Y. Zhang, G. Yu, E. Bertino, G. Xu (eds.) (2008). (Vol. 4976) 1–8.
- RankMass crawler: a crawler with high personalized pagerank coverage guarantee (2007). 375–386.
- A user reputation model for a user-interactive question answering system (2007). 19(15) 2091–2103.
- User-centric Web crawling (2005). 401–411.
- Distributed Indexing of Large-Scale Web Collections (2005). 3(1) 2–8.
- Archiving the World Wide Web (2002). 38–51.