April 30 - May 1, 2023 @Austin, TX, USA

ProvenanceWeek will take place on April 30-May 1. Following successful previous ProvenanceWeek events, this year’s instalment will again co-locate the IPAW and TaPP workshops in conjunction with WebConf 2023. IPAW and TaPP build on a successful history of provenance workshops that bring together researchers from a wide range of computer science fields including workflows, semantic web, databases, high performance computing, distributed systems, operating systems, programming languages, and software engineering, as well as researchers from other fields, such as biology and physics that have urgent provenance needs.

Provenance is increasingly important in data science, workflow systems, and many other areas, particularly to support transparency, accountability and explanations. By providing a record of the data creation process and of dependencies between data, provenance information is essential for tracing errors in transformed data back to erroneous inputs, access control, auditing, repeatability and reproducibility, evaluating data quality, and establishing ownership of data.

Happy Birthday W3C PROV! The W3C PROV standard is 10years old. As a part of ProvenanceWeek 2023, we will be celebrating the standards creation and impact. There will be cake.

The 2023 ACM Web Conference is an in-person conference with virtual components including live streaming of ceremonies and keynotes, access to pre-recorded videos of talks, and the Whova platform for interaction with all conference attendees.

All speakers and presenters participating in any way are expected to attend the conference in person. For exceptional reasons, if you are not able to attend in person to present, you may assign a proxy who must be in person.

Workshop Program

Sunday, April 30

8:45-9:00 am: Welcome and Introduction.

9:00-10:00 am: Keynote by Vanessa Braganholo

10:00-10:30 am Break

10:30 am-12:00 pm Session 1 (Chair: TBD)

  • 10:30-11:00 am: Chenjie Li, Boris Glavic, Sudeepa Roy and Zhengjie Miao, CaJaDE: Explaining Query Results by Augmenting Provenance with Context
  • 11:00-11:30 am: Thomas Delva, Anastasia Dimou, Maxime Jakubowski and Jan Van den Bussche, Data Provenance for SHACL
  • 11:30-11:45 am: Sara Moshtaghi and Seokki Lee, Efficiently Sampling Big Provenance
  • 11:45 am-12:00 pm: Kerstin Gierend, Judith A.H. Wodke, Sascha Genehr, Robert Gött, Ron Henkel, Frank Krüger, Markus Mandalka, Lea Michaelis, Alexander Scheuerlein, Max Schröder, Atinkut Zeleke and Dagmar Waltemath, Defining standard provenance information for clinical research data and workflows - Obstacles and opportunities

12:00-1:30 pm Lunch break

1:30-2:30 pm : Keynote by Boris Glavic

2:30-3:00 pm Town Hall

3:00-3:30 pm Break

3:30-4:30 pm Posters and Demo session

  • All workshop papers

Monday, May 1

9:00-10:00 am: Keynote by Sudeepa Roy

10:00-10:30 am Break

10:30 am-12:00 pm Session 2 (Chair: TBD)

  • 10:30-11:00 am: Adriane Chapman, Paolo Missier, Giulia Simonelli and Riccardo Torlone, Capturing and Querying Fine-grained Provenance of Preprocessing Pipelines in Data Science
  • 11:00-11:30 am: Stefan Grafberger, Paul Groth, Julia Stoyanovich and Sebastian Schelter, Data Distribution Debugging in Machine Learning Pipelines
  • 11:30-11:45 am: Seokki Lee, Boris Glavic, Adriane Chapman and Bertram Ludäscher, Hybrid Query and Instance Explanations and Repairs
  • 11:45 am-12:00 pm: Aniket Modi, Moaz Reyad, Tanu Malik and Ashish Gehani, Querying Container Provenance

12:00-1:30 pm Lunch break

1:30-3:00 pm Session 3 (Chair: TBD)

  • Luc Moreau, Nicola Hogan and Nick O'Donnell, Implementing an Environmental Management System Using Provenance-By-Design
  • Débora Pina, Adriane Chapman, Daniel de Oliveira and Marta Mattoso, Deep Learning Provenance Data Integration: a Practical Approach
  • Nikolaus Nova Parulian and Bertram Ludäscher, Trust the Process: Analyzing Prospective Provenance for Data Cleaning
  • Tanja Auge, Gunnar Bali, Meike Klettke, Bertram Ludäscher, Wolfgang Söldner, Simon Weishäupl and Tilo Wettig, Provenance for Lattice QCD workflows

3:00-3:30 pm Break

3:30-4:30 pm : W3C PROV 10 years celebration Panel

  • Moderator: Paul Groth
  • Panelists: Bryon Jacob (, Deborah McGuinness (Rensselaer Polytechnic Institute), Paolo Missier (Newcastle University), Luc Moreau (King's College London)



  • Yuval Moskovich


  • Daniel Deutch


  • Paul Groth

TaPP SC Chair

  • Adriane Chapman

Program Committee

  • Khalid Belhajjame, PSL, Université Paris-Dauphine, LAMSADE
  • Ghita Berrada, King's College London
  • Elisa Bertino, Purdue University
  • Pierre Bourhis, CNRS CRIStAL
  • Vanessa Braganholo, UFF
  • Vasa Curcin, King's College London
  • Daniel de Oliveira, Fluminense Federal University
  • David Eyers, University of Otago
  • Irini Fundulaki, ICS-FORTH
  • Amir Gilad, Duke University
  • Boris Glavic, Illinois Institute of Technology
  • Paul Groth, University of Amsterdam
  • Xueyuan Han, Wake Forest University
  • Melanie Herschel, University of Stuttgart
  • Matteo Interlandi, Microsoft
  • David Koop, Northern Illinois University
  • Seokki Lee, University of Cincinnati
  • Bertram Ludäscher, University of Illinois at Urbana-Champaign
  • Tanu Malik, DePaul University
  • Marta Mattoso, COPPE- Federal Univ. Rio de Janeiro
  • Paolo Missier, Newcastle University
  • Tope Omitola, University of Southampton
  • Liat Peterfreund, CNRS
  • Sudeepa Roy, Duke University
  • Pierre Senellart, DI, École normale supérieure, PSL Research University
  • Roee Shraga, Northeastern University
  • Gianmaria Silvello, University of Padua
  • Xiao Yu, Stellar Cyber

Call for Papers

We invite innovative and creative contributions, including papers outlining new formal approaches to provenance, innovative use of provenance, experience-based insights, and visionary ideas.

Topics of interest include, but are not limited to:

  • Provenance visualization, and human interaction with provenance
  • Provenance for big data and extreme computing
  • Provenance for attribution and trust
  • Provenance for transparency and accountability
  • Security and privacy implications of provenance
  • Provenance, social media, and the semantic web
  • Provenance analytics, discovery, and reasoning about provenance and its quality
  • Data sharing and data citation
  • Provenance of workflows and annotations
  • Standardization of provenance models, services, and representations
  • Provenance management system prototypes and commercial solutions
  • Applications of provenance in real-life settings
  • Theoretical foundations of provenance
  • Connections between provenance and established topics in other research fields (programming languages, security, software engineering, fairness, etc.)
  • Provenance-based audit and forensics
  • Design, performance and scalability of provenance systems

Location and Dates:

ProvenanceWeek will be held in conjunction with WebConf 2023 in Austin Texas on Sunday, April 30, and Monday, May 1, 2023.

Important Dates:

  • Papers Due: February 6, 2023
  • Notification of Acceptance: March 6, 2023
  • Camera-ready version due: March 15, 2023

All deadlines are end-of-day in the Anywhere on Earth (AoE) time zone.

Submission Site:

Submission is via easychair, using the following link:

Formatting the submissions:

Papers must be submitted in PDF format according to the ACM template published in the ACM guidelines, selecting the generic “sigconf” sample. The PDF files must have all non-standard fonts embedded. Workshop papers must be self-contained and in English. Papers should not exceed 12 pages in length (maximum 8 pages for the main paper content + maximum 2 pages for appendixes + maximum 2 pages for references).

Authors should indicate the track in the title (IPAW, TAPP, DEMO, POSTER or BEST)


Papers can be submitted to one of the following tracks:

  • IPAW:
    Authors are invited to submit original research work to the IPAW track. This track solicits full research papers that describe mature, high-quality research on the topics of interest of the ProvenanceWeek. The title of submissions to IPAW must begin with "IPAW:".

  • TaPP:
    We invite innovative and creative contributions, including papers outlining new challenges for provenance research, promising formal approaches to provenance, experiments, and visionary (and possibly risky) ideas. The title of submissions to TaPP must begin with "TAPP:".

    In addition to regular research papers, we also encourage submissions of thefollowing flavors:

    • Short papers (up to 4 pages, including references) that would typically include work in progress or visionary ideas that are not yet fully developed, but have the potential to lead to cutting-edge research.
    • Application papers (up to 6 pages, including references) focusing on innovative applications and uses of provenance.
  • “Best of the Rest”
    As ProvenanceWeek is meant to bring together the provenance community, which is now established and publishing in conferences and journals across computational domains, the Theory and Practice of Provenance (TaPP) is forming a “Best of the Rest” category. If you have had a provenance-paper accepted to a conference or journal in 2022 thru February 2023, we invite you to submit the abstract to TaPP. We will invite the “Best of the Rest” to present their work at TaPP to widen the conversation.

    Submit the abstract and original publication venue and date of the previously published work. The title of the abstract must begin with "BEST:".

We also encourage the presentation of ongoing work as posters or demonstrations. Proposals for posters or demonstrations should be formatted and submitted as described above, with the following additional restrictions:

  • Demonstrations:
    Using no more than 4 pages, describe the context and highlights of the proposed demonstration, including a brief description of the demonstration scenario. The title of the proposal must begin with "DEMO:".
  • Posters:
    Submit a 1-page abstract of the poster. The title of the abstract must begin with "POSTER:".