NewPipe's core library for extracting data from streaming sites

bandcamp crawler extractor mediaccc newpipe peertube scraper soundcloud youtube
4 Open Issues Need Help Last updated: Jul 6, 2025

Open Issues Need Help

View All on GitHub

AI Summary: Refactor the NewPipe Extractor library's link handling mechanism to improve its simplicity, debuggability, and maintainability. This involves creating a new `Link` interface with methods for retrieving extractors and unique identifiers, and implementing corresponding `Link` classes for each supported site. The goal is to replace the existing complex and inefficient link handler and factory system.

Complexity: 4/5
help wanted multiservice codequality

NewPipe's core library for extracting data from streaming sites

Java
#bandcamp#crawler#extractor#mediaccc#newpipe#peertube#scraper#soundcloud#youtube

AI Summary: Integrate Javadoc checks into the NewPipe Extractor's CI/PR pipeline to prevent faulty documentation from being merged into the dev branch. This involves configuring the CI system (likely GitHub Actions) to run a Javadoc verification step before accepting pull requests, ensuring documentation quality and preventing build failures during the release process.

Complexity: 3/5
enhancement meta good first issue

NewPipe's core library for extracting data from streaming sites

Java
#bandcamp#crawler#extractor#mediaccc#newpipe#peertube#scraper#soundcloud#youtube

AI Summary: The task is to improve the robustness of the YouTube metadata extraction in the NewPipe Extractor library. Currently, the extraction relies on fixed indices, which are prone to breaking when the YouTube layout changes. The solution involves making the extraction logic more resilient by identifying metadata elements (uploader, view count, upload date) based on their inherent properties (e.g., presence of specific strings, attached style/attachment runs) rather than their position.

Complexity: 4/5
enhancement youtube codequality good first issue

NewPipe's core library for extracting data from streaming sites

Java
#bandcamp#crawler#extractor#mediaccc#newpipe#peertube#scraper#soundcloud#youtube

AI Summary: Add @Nullable annotations to the fields in the ListExtractor.java class of the NewPipe Extractor library to explicitly indicate which fields can be null. This will improve code clarity, prevent potential crashes caused by unexpected null values, and align with Kotlin's null safety features.

Complexity: 2/5
help wanted codequality good first issue

NewPipe's core library for extracting data from streaming sites

Java
#bandcamp#crawler#extractor#mediaccc#newpipe#peertube#scraper#soundcloud#youtube