2023
Data of an Unusual Size: A practical guide to analysis and interactive visualization of massive datasets 1
A hands-on tutorial on the fundamentals of big data computation from a practical usage lens. It covers distributed computing with Dask and interactive visualization+dashboards with hvPlot+Panel, while working on the cloud with real ~70GB of big data.
This tutorial is co-authored by Dharahs Pothina, my colleague at Quansight. :)
- PyCon US · 19th April · Recording & Material
- SciPy US · 11th July · Material
- PyData NYC · 1st November · Recording & Material
- PyData Global · 7th December · Material
Community-first open source: An action plan! 1
A step-wise guide for creating community-driven projects, including repository management, contributor pathways, and governance principles, with real examples from our journey (at Quansight) transitioning a company-backed OSS project, Nebari, to be more community-oriented.
My colleague at Quansight, Tania Allard, helped prepare the talk and championed several initiatives discussed in the talk.
- PyCon US Maintainer Summit · 21st April · Recording
- JupyterCon · 12th May · Recording
- SciPy US · 14th July · Recording
Collaboration Infrastructure In Data Science: Tools, Challenges, And Best Practices
PyLadiesCon 2023 · 2nd December
A talk sharing tools, principles, and best practices for collaboration while using PyData libraries, with a focus on infrastructure like Jupyter and conda tools, and a discussion about some collaboration-related gaps and potential solutions in our ecosystem.
PyLadiesCon is a fully online conference, and I presented this talk in the APAC track. :)
Ensuring runtime reproducibility in the Python ecosystem
PyData Global 2023 · 8th December
A talk about how to proactively think about reproducibility while working on Python projects. It discusses general best practices and dives into a tool, conda-store, built around ensuring reproducibility.
My colleague at Quansight, Jaime Rodríguez-Guerra, co-authored this talk.
PyData Global is a fully online conference, and I presented this talk in the General track.