Skip to content
Draft
Show file tree
Hide file tree
Changes from all commits
Commits
File filter

Filter by extension

Filter by extension

Conversations
Failed to load comments.
Loading
Jump to
Jump to file
Failed to load files.
Loading
Diff view
Diff view
3 changes: 3 additions & 0 deletions pydata-vermont-2024/category.json
Original file line number Diff line number Diff line change
@@ -0,0 +1,3 @@
{
"title": "PyData Vermont 2024"
}
Original file line number Diff line number Diff line change
@@ -0,0 +1,43 @@
{
"description": "www.pydata.org\n\nBacktesting refers to the process of evaluating a time series forecasting algorithm on historical data by replicating the corresponding real-world scenario. Concurrently, parameters such as the model updating and retraining frequencies are also tuned based on the usecase and relevant computational constraints.\n\nIn this talk, we will review the backtesting of time series algorithms using sktime and skforecast. The two open-source machine learning libraries are popular options for developing and deploying forecasting models. Specifically, the following aspects will be covered.\n\n\u2022 Comparing and contrasting backtesting-related features of the two libraries\n\n\u2022 An overview of the different types of cross-validation schemes for time series forecasting, including expanding and fixed windows\n\n\u2022 Model update and retraining for both direct and recursive multistep forecasts\n\nThe talk is geared toward data scientists that want to systematically evaluate time series forecasting models in varied settings. In addition to gaining an overview of the various aspects, the audience will also learn about the implementation options supported by the two libraries. No prior knowledge about machine learning algorithms for forecasting is needed to attend the talk.\n\nPyData is an educational program of NumFOCUS, a 501(c)3 non-profit organization in the United States. PyData provides a forum for the international community of users and developers of data analysis tools to share ideas and learn from each other. The global PyData network promotes discussion of best practices, new approaches, and emerging technologies for data management, processing, analytics, and visualization. PyData communities approach data science using many languages, including (but not limited to) Python, Julia, and R. \n\nPyData conferences aim to be accessible and community-driven, with novice to advanced level presentations. PyData tutorials and talks bring attendees the latest project features along with cutting-edge use cases.\n\n00:00 Welcome!\n00:10 Help us add time stamps or captions to this video! See the description for details.\n\nWant to help add timestamps to our YouTube videos to help with discoverability? Find out more here: https://github.com/numfocus/YouTubeVideoTimestamps",
"duration": 1836,
"language": "eng",
"recorded": "2024-07-29",
"related_urls": [
{
"label": "Conference Website",
"url": "https://pydata.org/vermont2024/"
},
{
"label": "https://github.com/numfocus/YouTubeVideoTimestamps",
"url": "https://github.com/numfocus/YouTubeVideoTimestamps"
}
],
"speakers": [
"TODO"
],
"tags": [
"Education",
"Julia",
"NumFOCUS",
"Opensource",
"PyData",
"Python",
"Tutorial",
"coding",
"how to program",
"learn",
"learn to code",
"python 3",
"scientific programming",
"software"
],
"thumbnail_url": "https://i.ytimg.com/vi/7NXCdfzr5d8/maxresdefault.jpg",
"title": "Abhishek Murthy - Backtesting Time Series Forecasting Algorithms in SKTime and SKForecast",
"videos": [
{
"type": "youtube",
"url": "https://www.youtube.com/watch?v=7NXCdfzr5d8"
}
]
}
Original file line number Diff line number Diff line change
@@ -0,0 +1,43 @@
{
"description": "www.pydata.org\n\nAlthough tools that address rescaling spatial and temporal datasets separately do exist in Python and R, there is currently a gap in this space for users in the life and social sciences who are not computer programmers. Here we focus on developing improved tools to assist users in incorporating spatiotemporal datasets into their research programs through a new piece of software called Spacetime.\n\nPyData is an educational program of NumFOCUS, a 501(c)3 non-profit organization in the United States. PyData provides a forum for the international community of users and developers of data analysis tools to share ideas and learn from each other. The global PyData network promotes discussion of best practices, new approaches, and emerging technologies for data management, processing, analytics, and visualization. PyData communities approach data science using many languages, including (but not limited to) Python, Julia, and R. \n\nPyData conferences aim to be accessible and community-driven, with novice to advanced level presentations. PyData tutorials and talks bring attendees the latest project features along with cutting-edge use cases.\n\n00:00 Welcome!\n00:10 Help us add time stamps or captions to this video! See the description for details.\n\nWant to help add timestamps to our YouTube videos to help with discoverability? Find out more here: https://github.com/numfocus/YouTubeVideoTimestamps",
"duration": 3804,
"language": "eng",
"recorded": "2024-07-29",
"related_urls": [
{
"label": "Conference Website",
"url": "https://pydata.org/vermont2024/"
},
{
"label": "https://github.com/numfocus/YouTubeVideoTimestamps",
"url": "https://github.com/numfocus/YouTubeVideoTimestamps"
}
],
"speakers": [
"TODO"
],
"tags": [
"Education",
"Julia",
"NumFOCUS",
"Opensource",
"PyData",
"Python",
"Tutorial",
"coding",
"how to program",
"learn",
"learn to code",
"python 3",
"scientific programming",
"software"
],
"thumbnail_url": "https://i.ytimg.com/vi/4RBAFtaQfcI/maxresdefault.jpg",
"title": "Alex Burnham - Keynote: Lowering the barrier of entry to spatiotemporal data analysis",
"videos": [
{
"type": "youtube",
"url": "https://www.youtube.com/watch?v=4RBAFtaQfcI"
}
]
}
Original file line number Diff line number Diff line change
@@ -0,0 +1,43 @@
{
"description": "www.pydata.org\n\nImagine what you could do if you could tune Large Language Models (LLMs) with contributions from across your community. There are many popular open LLMs, just look at Hugging Face. Up until now, if a developer wanted to contribute a change to an open LLM, the only practical option was to publish a fork of the model which results in many forks but no version of the model having all or many of the community\u2019s changes. In this session we learn about a new, novel technique using the InstructLab open source project to enable incremental, community contributions to instruction-tune a LLM.\n\nPyData is an educational program of NumFOCUS, a 501(c)3 non-profit organization in the United States. PyData provides a forum for the international community of users and developers of data analysis tools to share ideas and learn from each other. The global PyData network promotes discussion of best practices, new approaches, and emerging technologies for data management, processing, analytics, and visualization. PyData communities approach data science using many languages, including (but not limited to) Python, Julia, and R. \n\nPyData conferences aim to be accessible and community-driven, with novice to advanced level presentations. PyData tutorials and talks bring attendees the latest project features along with cutting-edge use cases.\n\n00:00 Welcome!\n00:10 Help us add time stamps or captions to this video! See the description for details.\n\nWant to help add timestamps to our YouTube videos to help with discoverability? Find out more here: https://github.com/numfocus/YouTubeVideoTimestamps",
"duration": 1653,
"language": "eng",
"recorded": "2024-07-29",
"related_urls": [
{
"label": "Conference Website",
"url": "https://pydata.org/vermont2024/"
},
{
"label": "https://github.com/numfocus/YouTubeVideoTimestamps",
"url": "https://github.com/numfocus/YouTubeVideoTimestamps"
}
],
"speakers": [
"TODO"
],
"tags": [
"Education",
"Julia",
"NumFOCUS",
"Opensource",
"PyData",
"Python",
"Tutorial",
"coding",
"how to program",
"learn",
"learn to code",
"python 3",
"scientific programming",
"software"
],
"thumbnail_url": "https://i.ytimg.com/vi/l-Nq2b8Y-mA/maxresdefault.jpg",
"title": "BJ Hargrave- Open Source Community Instruction-tuning of Large Language Models | PyData Vermont 2024",
"videos": [
{
"type": "youtube",
"url": "https://www.youtube.com/watch?v=l-Nq2b8Y-mA"
}
]
}
Original file line number Diff line number Diff line change
@@ -0,0 +1,43 @@
{
"description": "www.pydata.org\n\nThis talk provides an overview to Bayesian workflows in the JAX ecosystem, and is aimed at both newcomers and experienced practitioners. We will look at the Bayesian workflow, from model specification and prior selection, to choices for inference like MCMC, optimization, and VI, to posterior analysis and validation. We will discuss practical ways to use popular libraries in building modular, efficient workflows.\n\nPyData is an educational program of NumFOCUS, a 501(c)3 non-profit organization in the United States. PyData provides a forum for the international community of users and developers of data analysis tools to share ideas and learn from each other. The global PyData network promotes discussion of best practices, new approaches, and emerging technologies for data management, processing, analytics, and visualization. PyData communities approach data science using many languages, including (but not limited to) Python, Julia, and R. \n\nPyData conferences aim to be accessible and community-driven, with novice to advanced level presentations. PyData tutorials and talks bring attendees the latest project features along with cutting-edge use cases.\n\n00:00 Welcome!\n00:10 Help us add time stamps or captions to this video! See the description for details.\n\nWant to help add timestamps to our YouTube videos to help with discoverability? Find out more here: https://github.com/numfocus/YouTubeVideoTimestamps",
"duration": 1873,
"language": "eng",
"recorded": "2024-07-29",
"related_urls": [
{
"label": "Conference Website",
"url": "https://pydata.org/vermont2024/"
},
{
"label": "https://github.com/numfocus/YouTubeVideoTimestamps",
"url": "https://github.com/numfocus/YouTubeVideoTimestamps"
}
],
"speakers": [
"TODO"
],
"tags": [
"Education",
"Julia",
"NumFOCUS",
"Opensource",
"PyData",
"Python",
"Tutorial",
"coding",
"how to program",
"learn",
"learn to code",
"python 3",
"scientific programming",
"software"
],
"thumbnail_url": "https://i.ytimg.com/vi/b0_D0BxPYLo/maxresdefault.jpg",
"title": "Colin Carroll - The state of Bayesian workflows in JAX | PyData Vermont 2024",
"videos": [
{
"type": "youtube",
"url": "https://www.youtube.com/watch?v=b0_D0BxPYLo"
}
]
}
Original file line number Diff line number Diff line change
@@ -0,0 +1,43 @@
{
"description": "www.pydata.org\n\nIn this talk, we'll explore effective strategies for scaling pandas workloads using PySpark. We'll delve into techniques such as the Pandas API on Spark, Python UDFs, Pandas UDFs, and Pandas Function APIs. In addition, this talk covers how to manage dependencies and environment setup seamlessly when transitioning to distributed PySpark cluster, providing insights into optimizing performance and leveraging PySpark features for seamless integration with pandas workflows.\n\nPyData is an educational program of NumFOCUS, a 501(c)3 non-profit organization in the United States. PyData provides a forum for the international community of users and developers of data analysis tools to share ideas and learn from each other. The global PyData network promotes discussion of best practices, new approaches, and emerging technologies for data management, processing, analytics, and visualization. PyData communities approach data science using many languages, including (but not limited to) Python, Julia, and R. \n\nPyData conferences aim to be accessible and community-driven, with novice to advanced level presentations. PyData tutorials and talks bring attendees the latest project features along with cutting-edge use cases.\n\n00:00 Welcome!\n00:10 Help us add time stamps or captions to this video! See the description for details.\n\nWant to help add timestamps to our YouTube videos to help with discoverability? Find out more here: https://github.com/numfocus/YouTubeVideoTimestamps",
"duration": 1380,
"language": "eng",
"recorded": "2024-07-29",
"related_urls": [
{
"label": "Conference Website",
"url": "https://pydata.org/vermont2024/"
},
{
"label": "https://github.com/numfocus/YouTubeVideoTimestamps",
"url": "https://github.com/numfocus/YouTubeVideoTimestamps"
}
],
"speakers": [
"TODO"
],
"tags": [
"Education",
"Julia",
"NumFOCUS",
"Opensource",
"PyData",
"Python",
"Tutorial",
"coding",
"how to program",
"learn",
"learn to code",
"python 3",
"scientific programming",
"software"
],
"thumbnail_url": "https://i.ytimg.com/vi/Ff78Y6FXIkw/maxresdefault.jpg",
"title": "Hyukjin Kwon - Demystifying pandas with PySpark when scaling out | PyData Vermont 2024",
"videos": [
{
"type": "youtube",
"url": "https://www.youtube.com/watch?v=Ff78Y6FXIkw"
}
]
}
Original file line number Diff line number Diff line change
@@ -0,0 +1,43 @@
{
"description": "www.pydata.org\n\nLearn about how BETA Technologies leverages Python and AWS to serve trillions of data points to hundreds of engineers on a day-to-day basis.\n\nPyData is an educational program of NumFOCUS, a 501(c)3 non-profit organization in the United States. PyData provides a forum for the international community of users and developers of data analysis tools to share ideas and learn from each other. The global PyData network promotes discussion of best practices, new approaches, and emerging technologies for data management, processing, analytics, and visualization. PyData communities approach data science using many languages, including (but not limited to) Python, Julia, and R. \n\nPyData conferences aim to be accessible and community-driven, with novice to advanced level presentations. PyData tutorials and talks bring attendees the latest project features along with cutting-edge use cases.\n\n00:00 Welcome!\n00:10 Help us add time stamps or captions to this video! See the description for details.\n\nWant to help add timestamps to our YouTube videos to help with discoverability? Find out more here: https://github.com/numfocus/YouTubeVideoTimestamps",
"duration": 1525,
"language": "eng",
"recorded": "2024-07-29",
"related_urls": [
{
"label": "Conference Website",
"url": "https://pydata.org/vermont2024/"
},
{
"label": "https://github.com/numfocus/YouTubeVideoTimestamps",
"url": "https://github.com/numfocus/YouTubeVideoTimestamps"
}
],
"speakers": [
"TODO"
],
"tags": [
"Education",
"Julia",
"NumFOCUS",
"Opensource",
"PyData",
"Python",
"Tutorial",
"coding",
"how to program",
"learn",
"learn to code",
"python 3",
"scientific programming",
"software"
],
"thumbnail_url": "https://i.ytimg.com/vi/sSxIRT76DtQ/maxresdefault.jpg",
"title": "Jayce @ BETA - Big Data Engineering With Python and AWS | PyData Vermont 2024",
"videos": [
{
"type": "youtube",
"url": "https://www.youtube.com/watch?v=sSxIRT76DtQ"
}
]
}
Original file line number Diff line number Diff line change
@@ -0,0 +1,43 @@
{
"description": "www.pydata.org\n\nData, code, software, people, papers, patents, funders, organizations, institutions, consortiums, models, metrics, applications... all of it is critical in a reality where open science practices are not only the status quo, but encouraged, supported, or even incentivized.\n\nJoin in this exploration of the open movement, from open-source to open funding to open data networks. Learn from the experience of the Open Source Science Initiative in their endeavor to map the entirety of the digital knowledge and research ecosystems. Join the conversation by mapping your corner of the digital realm -- the tools you use, the people with whom you interact, and the institutions, organizations, or companies that support you.\n\nPyData is an educational program of NumFOCUS, a 501(c)3 non-profit organization in the United States. PyData provides a forum for the international community of users and developers of data analysis tools to share ideas and learn from each other. The global PyData network promotes discussion of best practices, new approaches, and emerging technologies for data management, processing, analytics, and visualization. PyData communities approach data science using many languages, including (but not limited to) Python, Julia, and R. \n\nPyData conferences aim to be accessible and community-driven, with novice to advanced level presentations. PyData tutorials and talks bring attendees the latest project features along with cutting-edge use cases.\n\n00:00 Welcome!\n00:10 Help us add time stamps or captions to this video! See the description for details.\n\nWant to help add timestamps to our YouTube videos to help with discoverability? Find out more here: https://github.com/numfocus/YouTubeVideoTimestamps",
"duration": 1802,
"language": "eng",
"recorded": "2024-07-29",
"related_urls": [
{
"label": "Conference Website",
"url": "https://pydata.org/vermont2024/"
},
{
"label": "https://github.com/numfocus/YouTubeVideoTimestamps",
"url": "https://github.com/numfocus/YouTubeVideoTimestamps"
}
],
"speakers": [
"TODO"
],
"tags": [
"Education",
"Julia",
"NumFOCUS",
"Opensource",
"PyData",
"Python",
"Tutorial",
"coding",
"how to program",
"learn",
"learn to code",
"python 3",
"scientific programming",
"software"
],
"thumbnail_url": "https://i.ytimg.com/vi/7c51njj9JPs/maxresdefault.jpg",
"title": "Jonathan Starr - Mapping the Open Science Ecosystem | PyData Vermont 2024",
"videos": [
{
"type": "youtube",
"url": "https://www.youtube.com/watch?v=7c51njj9JPs"
}
]
}
Loading