Skip to content

Conversation

@ELC
Copy link
Contributor

@ELC ELC commented Jun 19, 2025

No description provided.

ELC added 8 commits June 17, 2025 22:05
Fixes #xxx

Event config:
~~~yaml
repo_dir: W:\Repositories\pyvideo-data

# Copy the event template here and adapt to the event parameters

# Only repo_dir: and events: are loaded

# =============================================================================
events:
  # - title: SciPy 2024
  #   dir: scipy-2024
  #   youtube_list:
  #     - https://www.youtube.com/playlist?list=PL1PbeFStIOoO7rDLs431H-rn0h24Wr80S
  #   related_urls:
  #     - label: Conference Website
  #       url: https://www.scipy2024.scipy.org/
  #   language: eng
  #   dates:
  #     begin: 2024-07-08
  #     end: 2024-07-14
  #     default: 2024-07-08
  #   minimal_download: false
  #   issue: xxx
  #   overwrite:
  #     # all: true # takes precedence over add_new_files and existing_files_fields
  #     add_new_files: true
  #     existing_files_fields:
  #       - duration
  #       - thumbnail_url
  #       - videos
  #       - description
  #       - language
  #       - recorded
  #       - related_urls
  #       - speakers
  #       - tags
  #       - title
  #   tags:

  - title: JupyterCon 2020
    dir: jupytercon-2020
    youtube_list:
      - https://www.youtube.com/playlist?list=PL_1BH3ug7n1KCiM-g0x9ZoWuDNhwuM1kr
    related_urls:
      - label: Conference Website
        url: https://web.archive.org/web/20201030085456/https://jupytercon.github.io/jupytercon2020-website/
    language: eng
    dates:
      begin: 2020-10-05
      end: 2020-10-17
      default: 2020-10-05
    minimal_download: false
    issue: xxx
    overwrite:
      # all: true # takes precedence over add_new_files and existing_files_fields
      add_new_files: true
      existing_files_fields:
        - duration
        - thumbnail_url
        - videos
        - description
        - language
        - recorded
        - related_urls
        - speakers
        - tags
        - title
    tags:

  - title: JupyterCon 2023
    dir: jupytercon-2023
    youtube_list:
      - https://www.youtube.com/playlist?list=PL_1BH3ug7n1Ih_Yy2TmM7MZ2zogSLZvzE
    related_urls:
      - label: Conference Website
        url: https://web.archive.org/web/20230531110007/https://www.jupytercon.com/
    language: eng
    dates:
      begin: 2023-05-10
      end: 2023-05-12
      default: 2023-05-10
    minimal_download: false
    issue: xxx
    overwrite:
      # all: true # takes precedence over add_new_files and existing_files_fields
      add_new_files: true
      existing_files_fields:
        - duration
        - thumbnail_url
        - videos
        - description
        - language
        - recorded
        - related_urls
        - speakers
        - tags
        - title
    tags:

  - title: Py4AI 2024
    dir: py4ai-2024
    youtube_list:
      - https://www.youtube.com/playlist?list=PL0RwQVm3YPu5k9iIaQUehwgh2M1DgKWaT
    related_urls:
      - label: Conference Website
        url: https://web.archive.org/web/20240511071059/https://www.py4ai.com/
    language: eng
    dates:
      begin: 2024-03-16
      end: 2024-03-16
      default: 2024-03-16
    minimal_download: false
    issue: xxx
    overwrite:
      # all: true # takes precedence over add_new_files and existing_files_fields
      add_new_files: true
      existing_files_fields:
        - duration
        - thumbnail_url
        - videos
        - description
        - language
        - recorded
        - related_urls
        - speakers
        - tags
        - title
    tags:

  - title: XtremePython 2024
    dir: xtremepython-2024
    youtube_list:
      - https://www.youtube.com/playlist?list=PL9XJvIlpSqocUf9t_YcRHmNza5WkTACvJ
    related_urls:
      - label: Conference Website
        url: https://xtremepython.dev/2024/
    language: eng
    dates:
      begin: 2024-11-19
      end: 2024-11-19
      default: 2024-11-19
    minimal_download: false
    issue: xxx
    overwrite:
      # all: true # takes precedence over add_new_files and existing_files_fields
      add_new_files: true
      existing_files_fields:
        - duration
        - thumbnail_url
        - videos
        - description
        - language
        - recorded
        - related_urls
        - speakers
        - tags
        - title
    tags:

  - title: XtremePython 2023
    dir: xtremepython-2023
    youtube_list:
      - https://www.youtube.com/playlist?list=PL9XJvIlpSqoeSy5TcB0FGG_-6q3NeQNPl
    related_urls:
      - label: Conference Website
        url: https://xtremepython.dev/2023/
    language: eng
    dates:
      begin: 2023-04-16
      end: 2023-04-16
      default: 2023-04-16
    minimal_download: false
    issue: xxx
    overwrite:
      # all: true # takes precedence over add_new_files and existing_files_fields
      add_new_files: true
      existing_files_fields:
        - duration
        - thumbnail_url
        - videos
        - description
        - language
        - recorded
        - related_urls
        - speakers
        - tags
        - title
    tags:

  - title: XtremePython 2022
    dir: xtremepython-2022
    youtube_list:
      - https://www.youtube.com/playlist?list=PL9XJvIlpSqoc4ZggVIefeet8wjoMDHhms
    related_urls:
      - label: Conference Website
        url: https://xtremepython.dev/2022/
    language: eng
    dates:
      begin: 2022-12-27
      end: 2022-12-27
      default: 2022-12-27
    minimal_download: false
    issue: xxx
    overwrite:
      # all: true # takes precedence over add_new_files and existing_files_fields
      add_new_files: true
      existing_files_fields:
        - duration
        - thumbnail_url
        - videos
        - description
        - language
        - recorded
        - related_urls
        - speakers
        - tags
        - title
    tags:

  - title: XtremePython 2021
    dir: xtremepython-2021
    youtube_list:
      - https://www.youtube.com/playlist?list=PL9XJvIlpSqoeT58uNpmwLE8RA2vkr-klo
    related_urls:
      - label: Conference Website
        url: https://xtremepython.dev/2021/
    language: eng
    dates:
      begin: 2021-11-25
      end: 2021-11-25
      default: 2021-11-25
    minimal_download: false
    issue: xxx
    overwrite:
      # all: true # takes precedence over add_new_files and existing_files_fields
      add_new_files: true
      existing_files_fields:
        - duration
        - thumbnail_url
        - videos
        - description
        - language
        - recorded
        - related_urls
        - speakers
        - tags
        - title
    tags:

  - title: Posit:Conf 2024
    dir: positconf-2024
    youtube_list:
      - https://www.youtube.com/playlist?list=PL9HYL-VRX0oSFkdF4fJeY63eGDvgofcbn
    related_urls:
      - label: Conference Website
        url: https://web.archive.org/web/20240804081019/https://posit.co/conference/
    language: eng
    dates:
      begin: 2024-08-12
      end: 2024-08-14
      default: 2024-08-12
    minimal_download: false
    issue: xxx
    overwrite:
      # all: true # takes precedence over add_new_files and existing_files_fields
      add_new_files: true
      existing_files_fields:
        - duration
        - thumbnail_url
        - videos
        - description
        - language
        - recorded
        - related_urls
        - speakers
        - tags
        - title
    tags:

  - title: Posit:Conf 2023
    dir: positconf-2023
    youtube_list:
      - https://www.youtube.com/playlist?list=PL9HYL-VRX0oRFZslRGHwHuwea7SvAATHp
    related_urls:
      - label: Conference Website
        url: https://web.archive.org/web/20230902080800/https://posit.co/conference
    language: eng
    dates:
      begin: 2023-09-17
      end: 2023-09-20
      default: 2023-09-17
    minimal_download: false
    issue: xxx
    overwrite:
      # all: true # takes precedence over add_new_files and existing_files_fields
      add_new_files: true
      existing_files_fields:
        - duration
        - thumbnail_url
        - videos
        - description
        - language
        - recorded
        - related_urls
        - speakers
        - tags
        - title
    tags:

  - title: PyTorch Day 2025
    dir: pytorchday-2025
    youtube_list:
      - https://www.youtube.com/playlist?list=PL_lsbAsL_o2DBxQBRA5SoqnTL_inXCOLU
    related_urls:
      - label: Conference Website
        url: https://events.linuxfoundation.org/pytorch-day-france/
    language: eng
    dates:
      begin: 2025-05-07
      end: 2025-05-07
      default: 2025-05-07
    minimal_download: false
    issue: xxx
    overwrite:
      # all: true # takes precedence over add_new_files and existing_files_fields
      add_new_files: true
      existing_files_fields:
        - duration
        - thumbnail_url
        - videos
        - description
        - language
        - recorded
        - related_urls
        - speakers
        - tags
        - title
    tags:

  - title: PyTorch Conference 2024
    dir: pytorchconf-2024
    youtube_list:
      - https://www.youtube.com/playlist?list=PL_lsbAsL_o2B_znuvm-pDtV_cRhpqZb8l
    related_urls:
      - label: Conference Website
        url: https://pytorch.org/event/pytorch-conference-2024/
    language: eng
    dates:
      begin: 2024-09-18
      end: 2024-09-19
      default: 2024-09-18
    minimal_download: false
    issue: xxx
    overwrite:
      # all: true # takes precedence over add_new_files and existing_files_fields
      add_new_files: true
      existing_files_fields:
        - duration
        - thumbnail_url
        - videos
        - description
        - language
        - recorded
        - related_urls
        - speakers
        - tags
        - title
    tags:

  - title: PyTorch Conference 2023
    dir: pytorchconf-2023
    youtube_list:
      - https://www.youtube.com/playlist?list=PL_lsbAsL_o2BivkGLiDfHY9VqWlaNoZ2O
    related_urls:
      - label: Conference Website
        url: https://pytorch.org/event/pytorch-conference-2023/
    language: eng
    dates:
      begin: 2023-10-16
      end: 2023-10-17
      default: 2023-10-16
    minimal_download: false
    issue: xxx
    overwrite:
      # all: true # takes precedence over add_new_files and existing_files_fields
      add_new_files: true
      existing_files_fields:
        - duration
        - thumbnail_url
        - videos
        - description
        - language
        - recorded
        - related_urls
        - speakers
        - tags
        - title
    tags:

  - title: PyTorch Conference 2019
    dir: pytorchconf-2019
    youtube_list:
      - https://www.youtube.com/playlist?list=PL_lsbAsL_o2BY-RrqVDKDcywKnuUTp-f3
    language: eng
    dates:
      begin: 2019-10-16
      end: 2019-10-17
      default: 2019-10-16
    minimal_download: false
    issue: xxx
    overwrite:
      # all: true # takes precedence over add_new_files and existing_files_fields
      add_new_files: true
      existing_files_fields:
        - duration
        - thumbnail_url
        - videos
        - description
        - language
        - recorded
        - related_urls
        - speakers
        - tags
        - title
    tags:

# ISO_639-3 language codes https://en.wikipedia.org/wiki/ISO_639-3

# languages = {
#     'ita': 'Italian',
#     'zho': 'Chinese',
#     'por': 'Portuguese',
#     'ukr': 'Ukrainian',
#     'deu': 'German',
#     'eng': 'English',
#     'rus': 'Russian',
#     'fra': 'French',
#     'spa': 'Spanish',
#     'eus': 'Basque',
#     'cat': 'Catalan',
#     'glg': 'Galician',
#     'kor': 'Korean',
#     'lit': 'Lithuanian',
#     'jpn': 'Japanese',
#     'ces': 'Czech',
#     'pol': 'Polish',
#     'heb': 'Hebrew',
#     'tha': 'Thai',
# }

~~~

Scraped with [pyvideo_scrape](https://github.com/pyvideo/pyvideo_scrape)
… for 50+ files

- Processed panel discussion, startup showcase, and regular session files
- Processed sponsored keynotes and sponsored sessions
- Processed first batch of lightning talks (5 files)
- Extracted speakers from titles and descriptions
- Removed author names and organization info from titles
- Removed title prefixes from descriptions
- Cleaned up Lightning Talk: prefixes from titles
- Extracted speakers from titles and descriptions
- Removed author names and organizations from titles
- Removed title prefixes from descriptions
- Removed Lightning Talk: and Sponsored Keynote: prefixes from titles
- Processed all regular sessions, keynotes, sponsored sessions, lightning talks
- Cleaned up panel discussions, startup showcase, and other special sessions
- Refined descriptions by removing speaker names and organization details.
- Extracted and added speaker names for several talks.
- Cleaned up titles by removing speaker names and unnecessary prefixes.
@ELC ELC marked this pull request as ready for review November 22, 2025 00:22
@jonafato jonafato merged commit 3cee68a into pyvideo:main Nov 24, 2025
1 check passed
Sign up for free to join this conversation on GitHub. Already have an account? Sign in to comment

Labels

None yet

Projects

None yet

Development

Successfully merging this pull request may close these issues.

2 participants