Datasets collection and preprocessings framework for NLP extreme multitask learning
-
Updated
Jul 9, 2025 - Python
Datasets collection and preprocessings framework for NLP extreme multitask learning
Datasets for skin image analysis
A complete imitation learning pipeline for bar alignment using the UR5 robot in NVIDIA Isaac Sim. Includes manual data collection with a game controller, dataset organization for LeRobot, diffusion policy training, and policy deployment through ROS2.
Migrated to pyedmine
SDN Topology Emulation and Development of Dataset for ML-Based Intrusion Detection through the Ryu SDN Framework, Mininet and VirtualBox VMs
A full-stack webapp for collecting and managing speech datasets.
Script automation for Dataset Collection and Building with Tkinter GUI.
DatasetIQ is an automated, structured registry of machine learning datasets with unified metadata, continuous validation, and queryable access for dataset discovery, comparison, and selection.
🌐 Build smarter living solutions with AI_Hanlin, a next-gen AI workstation designed to enhance everyday experiences and streamline daily tasks.
Webots Image Dataset Collection For Computer Vision And Deep Learning
Automated drone flight system for collecting multiview images.
🎥 Transform video files into detailed production documents with CineView AI, an automated tool for filmmakers and content creators, powered by Google Gemini.
Techgium hackathon submission
Collection of country-level business email datasets and B2B contact samples for research, analytics, and data science projects.
A GUI for managing, visualizing, and analyzing competitive programming datasets with a PyQt6 GUI.
CyberClassify - A carefully crafted tool for classifying and organizing malware datasets.
Awesome-matchem-datasets is a curated collection of high-quality datasets for machine learning and data analysis in the field of chemistry. This repository includes various datasets, ranging from molecular structures to experimental results, suitable for both research and educational purposes.
This interactive Python tool enables the recording of bilingual audio samples using PyAudio and ipywidgets. Designed for data collection tasks such as speech datasets, it provides a user-friendly interface to record, save, label, and manage audio files directly within a Jupyter Notebook.
自动搜集获取相关的视频,接管浏览器海量搜集,并自动判别。Automatically collects relevant videos, takes over bulk browser collection, and makes automatic judgments.
Android app for biometric eye image capture with automatic quality evaluation, and dataset collection support.
Add a description, image, and links to the dataset-collection topic page so that developers can more easily learn about it.
To associate your repository with the dataset-collection topic, visit your repo's landing page and select "manage topics."