Skip to content
View guidogerb's full-sized avatar

Highlights

  • Pro

Block or report guidogerb

Block user

Prevent this user from interacting with your repositories and sending you notifications. Learn more about blocking users.

You must be logged in to block users.

Maximum 250 characters. Please don’t include any personal information such as legal names or email addresses. Markdown is supported. This note will only be visible to you.
Report abuse

Contact GitHub support about this user’s behavior. Learn more about reporting abuse.

Report abuse
guidogerb/README.md

GuidoGerb Pioneering Labs for Logic & Creativity

An academic inquiry into the frontier where music, artificial intelligence, scientific computing, and the human spirit converge.

Gary Gerber examines and documents the space where jazz guitar pedagogy, large ensemble composition, digital signal processing, and rigorous software engineering meet. This is a scholarly endeavor — a place of disciplined reasoning and imaginative exploration, where logic and creativity are treated as complementary faculties of the human mind. The work here is dedicated to elevating the human experience through careful research, principled craftsmanship, and the celebration of live music performance and human artistic expression.


Areas of Scholarly Inquiry

Gary Gerber investigates and documents research, experiments, and reference works across:

  • Music Information Retrieval & DSP — Audio feature extraction, spectral analysis, and signal processing in service of composition, mixing, jazz guitar pedagogy, and large ensemble writing.
  • AI & Machine Learning for Music and Art — Inquiry at the intersection of machine learning, sound design, and visual artistry — always with the human performer and composer at the center.
  • Pure Web Technology & Rust/WASM Architecture — A dedicated effort porting all web technologies to pure HTML5 and pure JavaScript as lightweight, zero-dependency wrappers around singular Rust WebAssembly components — deeply committed to eradicating supply chain exploits, maximally hardened security postures, and maximally performant platforms.
  • Jazz Guitar Pedagogy & Large Ensemble Composition — Research, notation frameworks, and educational study centered on jazz guitar instruction and orchestral authoring for human performers.

A Broader Human Purpose

This scholarship is animated by a single overarching intent: to promote the human experience. Gary Gerber studies and develops tools that:

  • Support and celebrate live music performance and human artistic expression in all its forms.
  • Enable large populations — students, educators, composers, writers, and citizens — with authoring tools for literature, digital media, and musical works.
  • Empower governments and civic institutions to support the good efforts of people, the health of communities, and the flourishing of society.
  • Are built around the principles of happiness, health, and the traditional atomic family as the foundational organizational unit of global societies.
  • Champion the success of all humankind as a global family — supporting only good ideals and fighting for the dignity, freedom, and flourishing of every person.

A Curated Bibliography

This organization maintains a large and growing collection of public library references and repositories that Gary Gerber has evaluated and finds aligned with these academic and humane values — music technology, open audio standards, scientific computing, Rust/WASM tooling, AI research, and human-centered software design. Consider it an honest intellectual map of the landscape under study.


Deep Gratitude to Anthropic

An enormous and sincere acknowledgment is owed to Anthropic — a true industry leader in AI safety and capability. Their exceeding expertise, principled approach to building beneficial AI, and their commitment to exposing cutting-edge tools to researchers, educators, and creators have made work like this possible. Claude and the broader Anthropic ecosystem have been instrumental in helping Gary Gerber think more clearly, reason more rigorously, and create more ambitiously in service of human flourishing. Thank you, Anthropic, for building with conscience.


GuidoGerb Pioneering Labs for Logic & Creativity — Utah, USA · A scholarly workshop by a musician and programmer, in service of people.


Vision

I’m interested in using open tooling to make it easier for researchers, engineers, and artists to:

  • Prototype and share audio/ML experiments quickly.
  • Reuse models, datasets, and notebooks across projects.
  • Bridge DAWs, live performance tools, and research codebases.

If your work touches music tech, MIR, generative models, creative coding, or applied data science for the arts, you’re in the right place.


Projects You Might Find Here

  • Audio and MIDI preprocessing utilities.
  • Model training / evaluation scripts for music and audio ML.
  • Tools that connect DAWs, live‑coding environments, and ML backends.
  • Notebooks and minimal demos for scientific and artistic experiments.

(Repo‑level READMEs go into install, usage, and citations.)


How to Collaborate

Contributions are very welcome, especially from:

  • Researchers (MIR, audio ML, generative models, HCI for creative tools).
  • Musicians, producers, and sound designers who like to prototype with code.
  • Data/ML engineers interested in open creative tooling.

Ways to get involved:

  1. Open an issue to propose a feature, experiment, or integration.
  2. Fork a repo, create a small, focused PR (bugfix, refactor, new example, or doc).
  3. Share example notebooks, demo projects, or datasets that others can build on.

Please keep contributions:

  • Reproducible (clear environment, minimal config).
  • Well‑documented (short README, comments where non‑obvious).
  • Respectful of licensing for samples, datasets, and models.

Contact

If you’d like to discuss a collaboration, research idea, or integration with your lab, DAW workflow, or art project, feel free to:

  • Open a “discussion” or issue in the most relevant repository.
  • Mention @GuidoGerb on GitHub in a thread you’d like me to see.

Let’s build tools that make science and art talk to each other.

Favorite Projects

Repository Upstream Description
7digital-api raoulmillais/7digital-api Node.js wrapper for the 7digital API
AgentCPM-GUI OpenBMB/AgentCPM-GUI GUI for AgentCPM
agentlego InternLM/agentlego Open toolkit for tool-augmented LLM agents
ai-toolkit ostris/ai-toolkit Various AI scripts and tools
AlchemistCoder InternLM/AlchemistCoder
ampache ampache/ampache A web based audio/video streaming application and file manager
ampache-administrator lachlan-00/ampache-administrator Admin tools for release and build processes
amplify-js aws-amplify/amplify-js A declarative JavaScript library for application development using cloud services
amplify-ui aws-amplify/amplify-ui Amplify UI Components
AnchorWeave guidogerb/AnchorWeave
anus anus-dev/ANUS
api-stub guidogerb/api-stub
app guidogerb/app
apps guidogerb/apps
ArcLight IzzelAliz/Arclight A Bukkit(1.20/1.21) server implementation in modding environment using Mixin. ⚡
asset-manager guidogerb/asset-manager
asset-manager-xfer guidogerb/asset-manager-xfer
AudioX ZeyueT/AudioX [ICLR 2026] Repository of AudioX
audiveris Audiveris/audiveris Open-source Optical Music Recognition
awesome-cursorrules PatrickJS/awesome-cursorrules 📄 Configuration files that enhance Cursor AI editor experience with custom rules and behaviors
awesome-python vinta/awesome-python An opinionated list of awesome Python frameworks, libraries, software and resources
awesome-saleor saleor/awesome-saleor An opinionated list of awesome Saleor tools, libraries, and resources. Inspired by awesome-python.
blacklist-hosts SnowyYT07/blacklist-hosts
blender blender/blender Official Blender repository mirror
blender-addons blender/blender-addons Blender addons repository
blender-addons-contrib blender/blender-addons-contrib
blender-dev-tools blender/blender-dev-tools
blender-translations blender/blender-translations
blockchainvoting guidogerb/blockchainvoting
bridge-gapp guidogerb/bridge-gapp
camel camel-ai/camel CAMEL: Finding the Scaling Law of Agents
ChatDev OpenBMB/ChatDev Create Customized Software using Natural Language Idea
CogVideo THUDM/CogVideo
ComfyUI comfyanonymous/ComfyUI The most powerful and modular diffusion model GUI, api and backend with a graph/nodes interface.
ComfyUI-KJNodes kijai/ComfyUI-KJNodes Various utility nodes for ComfyUI
ComfyUI-LTXVideo Lightricks/ComfyUI-LTXVideo
ComfyUI-Mickmumpitz-Nodes mickmumpitz/ComfyUI-Mickmumpitz-Nodes
ComfyUI-Qwen-TTS flybirdxx/ComfyUI-Qwen-TTS A Simple Implementation of Qwen3-TTS's ComfyUI
ComfyUI_examples comfyanonymous/ComfyUI_examples
ComfyUI_OmnimatteZero smthemex/ComfyUI_OmnimatteZero Official implementation of OmnimatteZero: Training-Free Video Matting and Compositing via Latent Diffusion Models
ComfyUI_RH_DreamID-V HM-RunningHub/ComfyUI_RH_DreamID-V This is a ComfyUI plug-in for bytedance/DreamID-V
communique guidogerb/communique
configurator guidogerb/configurator
crewAI crewAIInc/crewAI Framework for orchestrating role-playing, autonomous AI agents
CubeComposer TencentARC/CubeComposer [CVPR 2026] Spatio-Temporal Autoregressive 4K 360° Video Generation from Perspective Video
CUDA-Agent BytedTsinghua-SIA/CUDA-Agent CUDA Agent: Large-Scale Agentic RL for High-Performance CUDA Kernel Generation
cycles blender/cycles The Cycles Render Engine - official mirror
diloco_simple PrimeIntellect-ai/diloco_simple torch implementation of diloco
DreamID-Omni Guoxu1233/DreamID-Omni [ICML 2026] DreamID-Omni: Unified Framework for Controllable Human-Centric Audio-Video Generation
DreamID-V bytedance/DreamID-V DreamID-V: Bridging the Image-to-Video Gap for High-Fidelity Face Swapping via Diffusion Transformer
fastapi fastapi/fastapi FastAPI framework, high performance, easy to learn, fast to code, ready for production
fastapi-cli fastapi/fastapi-cli FastAPI CLI
fastmcp jlowin/fastmcp The fast, Pythonic way to build MCP servers and clients
FastVMT mayuelala/FastVMT [ICLR 2026] FastVMT: This repo is the official implementation of "FastVMT: Eliminating Redundancy in Video Motion Transfer"
ffmpeg-custom feixiaku/ffmpeg-custom add custom filter/avcodec/avformat
FreeCAD FreeCAD/FreeCAD A free and open source parametric 3D modeler
fsutil guidogerb/fsutil
garygerber-website guidogerb/garygerber-website
ggml ggml-org/ggml Tensor library for machine learning
ggp-design-system guidogerb/ggp-design-system
ggp-mcp-agent guidogerb/ggp-mcp-agent
ggp-midi-to-musicxml guidogerb/ggp-midi-to-musicxml
ggp-python-project guidogerb/ggp-python-project
ggp-react-project guidogerb/ggp-react-project
ggp-studio guidogerb/ggp-studio
gitea go-gitea/gitea Git with a cup of tea! Painless self-hosted all-in-one software development service
GLM-5 zai-org/GLM-5 GLM-5: From Vibe Coding to Agentic Engineering
GLM-OCR zai-org/GLM-OCR GLM-OCR: Accurate × Fast × Comprehensive
gpt-builder Jonathan-Nunes71/gpt-builder bot assistant IA spécialisé en développement et automatisation
gpt-engineer AntonOsika/gpt-engineer Specify what you want it to build, the AI asks for clarification, and then builds it
graffiti-monkey Answers4AWS/graffiti-monkey Goes around tagging things
grok-1 xai-org/grok-1 JAX implementation of the Grok-1 open-weights model
guidogerb-paleontology guidogerb/guidogerb-paleontology
guidogerb-web guidogerb/guidogerb-web
guidogerb-website guidogerb/guidogerb-website
guidolib guidogerb/guidolib
hedgedoc hedgedoc/hedgedoc An open-source, web-based, self-hosted, collaborative markdown editor
Helios guidogerb/Helios
HiFi-Inpaint Correr-Zhou/HiFi-Inpaint [CVPR 2026] Offical implementation of the paper "HiFi-Inpaint: Towards High-Fidelity Reference-Based Inpainting for Generating Detail-Preserving Human-Product Images".
hivemind learning-at-home/hivemind Decentralized deep learning in PyTorch
hunyuan-text2video-comfyui-workflows trollize/hunyuan-text2video-comfyui-workflows Workflows to make videos with tencent's hunyuan video model...
HunyuanVideo Tencent/HunyuanVideo HunyuanVideo: A Systematic Framework For Large Video Generation Model
HY-WU Tencent-Hunyuan/HY-WU HY-WU (Part I): An Extensible Functional Neural Memory Framework and An Instantiation in Text-Guided Image Editing
IC-Light lllyasviel/IC-Light IC-Light is a project to manipulate the illumination of images
ids guidogerb/ids
improved-aesthetic-predictor christophschuhmann/improved-aesthetic-predictor Improved aesthetic predictor for LAION
Intern-S1 InternLM/Intern-S1 A Scientific Multimodal Foundation Model
InternLM InternLM/InternLM InternLM is a multilingual language model
InternLM-Math InternLM/InternLM-Math InternLM-Math: Open Math Large Language Models
InternLM-XComposer InternLM/InternLM-XComposer
JanusCoder guidogerb/JanusCoder
kaprekar-thermodynamics guidogerb/kaprekar-thermodynamics
Kiwi-Edit showlab/Kiwi-Edit A unified and fully open-source framework for instruction-guided and reference-guided video editing using natural language.
lambda-graffiti-monkey guidogerb/lambda-graffiti-monkey
langchain langchain-ai/langchain Build context-aware reasoning applications
langchain-ts-starter domeccleston/langchain-ts-starter Langchain.js template to get started quickly
LavaSR ysharma3501/LavaSR 🌋LavaSR: Fast Speech restoration and enhancement
llama.cpp ggerganov/llama.cpp LLM inference in C/C++
llm-agent-break guidogerb/llm-agent-break
lmdeploy InternLM/lmdeploy A toolkit for deploying large language models
localStorage guidogerb/localStorage
LoRWeB NVlabs/LoRWeB We propose a novel modular framework that learns to dynamically mix low-rank adapters (LoRAs) to improve visual analogy learning, enabling flexible and generalizable image edits based on example transformations.
LTX-2 Lightricks/LTX-2 Official Python inference and LoRA trainer package for the LTX-2 audio–video generative model.
LTX-Desktop Lightricks/LTX-Desktop An open-source desktop app for generating videos with LTX models
LTX-Video Lightricks/LTX-Video Official repository for LTX-Video
markdown-to-pdf BaileyJM02/markdown-to-pdf A GitHub Action to make PDF and HTML files from Markdown
matrix.to matrix-org/matrix.to A simple stateless privacy-protecting URL redirecting service for Matrix
mb3d thargor6/mb3d Mandelbulb3D
mediawiki-bootstrap mtyeh411/mediawiki-bootstrap A customizable responsive Bootstrap MediaWiki skin.
MFLUX-WEBUI CharafChnioune/MFLUX-WEBUI MFLUX-WEBUI using MLX and the FLUX DEV and Schnell models
MindSearch InternLM/MindSearch An LLM-based Multi-agent Framework for Complex Internet-scale Searches
mini-swe-agent SWE-agent/mini-swe-agent The 100 line AI agent that solves GitHub issues or helps you in your command line. Radically simple, no huge configs, no giant monorepo—but scores >74% on SWE-bench verified!
MiniCPM OpenBMB/MiniCPM MiniCPM series models
MiniCPM-o OpenBMB/MiniCPM-o
Modernizr Modernizr/Modernizr Modernizr is a JavaScript library that detects HTML5 and CSS3 features in the user’s browser.
MonarchRT guidogerb/MonarchRT
MSongsDB tbertinmahieux/MSongsDB Code for the Million Song Dataset, the dataset contains metadata and audio analysis for a million tracks, a collaboration between The Echo Nest and LabROSA. See website for details.
MuseScore musescore/MuseScore MuseScore is an open source and free music notation software
musicbrainz-docker metabrainz/musicbrainz-docker Run MusicBrainz server with Docker
musicbrainz-server metabrainz/musicbrainz-server The official MusicBrainz server codebase
new-uid-portal guidogerb/new-uid-portal
nopaste guidogerb/nopaste
nucleon-switch guidogerb/nucleon-switch
ollama ollama/ollama Get up and running with Llama 3.3, DeepSeek-R1, Phi-4, Gemma 3, Mistral Small 3.1, and other large language models
ollama-fargate guidogerb/ollama-fargate
ome guidogerb/ome
omnimatte erikalu/omnimatte Associating Objects and Their Effects in Video
OmnimatteZero dvirsamuel/OmnimatteZero [SiggraphAsia25] OmnimatteZero: Fast Training-free Omnimatte with Pre-trained Video Diffusion Models
OmniXtreme guidogerb/OmniXtreme
open-llms eugeneyan/open-llms A list of open LLMs available for commercial use
openai-node openai/openai-node Official JavaScript / TypeScript library for the OpenAI API
openai-python openai/openai-python The official Python library for the OpenAI API
OpenAOE InternLM/OpenAOE LLM Group Chat Framework: chat with multiple LLMs at the same time. 大模型群聊框架:同时与多个大语言模型聊天。
openclaw guidogerb/openclaw
OpenDiloco PrimeIntellect-ai/OpenDiloco OpenDiLoCo: An Open-Source Framework for Globally Distributed Low-Communication Training
opensheetmusicdisplay opensheetmusicdisplay/opensheetmusicdisplay Open SheetMusic Display - Open Source Music XML Renderer
organize-models guidogerb/organize-models
owl camel-ai/owl OWL: Optimized Workforce Learning for General Multi-Agent Assistance in Real-World Task Automation
paperbanana guidogerb/paperbanana
penpot penpotapp/penpot Penpot: The open-source design tool for design and code collaboration
photoshop guidogerb/photoshop
PhysicEdit liangbingzhao/PhysicEdit [ICML2026] From Statics to Dynamics: Physics-Aware Image Editing with Latent Transition Priors
playwright-python microsoft/playwright-python Python version of the Playwright testing and automation library
pojo-gernerator guidogerb/pojo-gernerator
portal-client guidogerb/portal-client
prime guidogerb/prime
Prompt-Engineering-Guide dair-ai/Prompt-Engineering-Guide Guides, papers, lecture, notebooks and resources for prompt engineering
puppeteer puppeteer/puppeteer JavaScript API for Chrome and Firefox
python-sdk modelcontextprotocol/python-sdk The official Python SDK for Model Context Protocol servers and clients
Qwen-Agent QwenLM/Qwen-Agent Agent framework and applications built on top of Qwen
Qwen-Image QwenLM/Qwen-Image Qwen-Image is a powerful image generation foundation model capable of complex text rendering and precise image editing.
Qwen3-Coder QwenLM/Qwen3-Coder Qwen3-Coder is the code version of Qwen3, the large language model series developed by Qwen team.
Qwen3-Coder-Next-Zeta-GGUF kirillleventcov/Qwen3-Coder-Next-Zeta-GGUF
Qwen3-Omni QwenLM/Qwen3-Omni Qwen3-omni is a natively end-to-end, omni-modal LLM developed by the Qwen team at Alibaba Cloud, capable of understanding text, audio, images, and video, as well as generating speech in real time.
Qwen3-VL QwenLM/Qwen2.5-VL Qwen visual language model
Qwen3.5 guidogerb/Qwen3.5
RealWonder guidogerb/RealWonder
runner-images actions/runner-images GitHub Actions runner images
saleor saleor/saleor Saleor Core: the high performance, composable, headless commerce API
saleor-dashboard saleor/saleor-dashboard Saleor Dashboard is a GraphQL-powered, single-page application
saleor-vatrc blender/saleor-vatrc saleor-vatrc
seleniumish iamawkah/seleniumish This repository is selenium test scripts
SevenDigital.Api.Schema 7digital/SevenDigital.Api.Schema Schema in .Net for the 7digital Public API
SevenDigital.Api.Wrapper 7digital/SevenDigital.Api.Wrapper A fluent c# wrapper for the 7digital API
sgl-cookbook sgl-project/sgl-cookbook Cookbook of SGLang - Recipe
sgl-docs sgl-project/sgl-docs
sgl-learning-materials sgl-project/sgl-learning-materials Materials for learning SGLang
sgl-project.github.io sgl-project/sgl-project.github.io This is the documentation repository for SGLang. It is auto-generated from https://github.com/sgl-project/sglang
sglang sgl-project/sglang SGLang: Fast Serving Framework for Large Language and Vision-Language Models
sglang-omni sgl-project/sglang-omni SGLang Omni: High-Performance Multi-Stage Pipeline Framework for Omni Models
solaris guidogerb/solaris
solaris-engine guidogerb/solaris-engine
SpatialT2I DAGroup-PKU/SpatialT2I [CVPR 2026🔥] Enhancing Spatial Understanding in Image Generation via Reward Modeling
Spectrum guidogerb/Spectrum
spksrc SynoCommunity/spksrc Cross-compilation framework to create Synology packages
spring-cli spring-io/spring-cli Spring CLI
spring-data-relational spring-projects/spring-data-relational Spring Data Relational
sqlmodel fastapi/sqlmodel SQL databases in Python, designed for simplicity, compatibility, and robustness
stable-diffusion-webui AUTOMATIC1111/stable-diffusion-webui Stable Diffusion web UI
Stable-Video-Infinity vita-epfl/Stable-Video-Infinity [ICLR 26 Oral] Stable Video Infinity: Infinite-Length Video Generation with Error Recycling
Step-3.5-Flash stepfun-ai/Step-3.5-Flash Fast, Sharp & Reliable Agentic Intelligence
storefront saleor/storefront Saleor Storefront built using React, Next.js with App Router, TypeScript, GraphQL, and Tailwind CSS.
StoryDiffusion HVision-NKU/StoryDiffusion StoryDiffusion: Consistent Self-Attention for Long-Range Image and Video Generation
streamlit streamlit/streamlit Streamlit — A faster way to build and share data apps
surge surge-synthesizer/surge Surge Synthesizer is an open source, hybrid synthesizer, and the result of a large, supportive community of contributors
SWE-agent princeton-nlp/SWE-agent SWE-agent takes a GitHub issue and tries to automatically fix it
SWE-ReX princeton-nlp/SWE-ReX
tasks-tracker guidogerb/tasks-tracker
temp-uid guidogerb/temp-uid
terraform-aws-lambda-scheduler-stop-start diodonfrost/terraform-aws-lambda-scheduler-stop-start Terraform module that creates a Lambda scheduler to stop and start resources on AWS
terraform-ggp guidogerb/terraform-ggp
test guidogerb/test
tiny-aya-tech-report Cohere-Labs/tiny-aya-tech-report
Track4World TencentARC/Track4World Track4World: Feedforward World-centric Dense 3D Tracking of All Pixels
tttLRM cwchenwang/tttLRM [CVPR 2026 Highlight] tttLRM: Test-Time Training for Long Context and Autoregressive 3D Reconstruction
UltraRAG OpenBMB/UltraRAG A Low-Code MCP Framework for Building Complex and Innovative RAG Pipelines
Utonia guidogerb/Utonia
VecGlypher guidogerb/VecGlypher
vexchords 0xfe/vexchords Chord diagrams for JavaScript
vexflow 0xfe/vexflow A JavaScript library for rendering music notation and guitar tablature
videomt tue-mps/videomt [CVPR 2026] Official code and models for Video Encoder-only Mask Transformer (VidEoMT).
VoxCPM guidogerb/VoxCPM
Wan2GP 6Morpheus6/wan2gp [NVIDIA | AMD] Unified wan2gp installer for all NVIDIA GPU's and AMD with Python 311, Pytorch 2.10 and NVFP4 Support for RTX50. Super Optimized Gradio UI for AI video creation (6GB+ VRAM). Wan 2.1/2.2, Qwen, Hunyuan Video, LTX Video, Flux and OVI and more. (AMD Windows supported by 7900(XT), 7800(XT), 7600(XT), Phoenix, 9070(XT) and Strix Halo)
whisper openai/whisper Robust Speech Recognition via Large-Scale Weak Supervision
whisper.cpp ggerganov/whisper.cpp Port of OpenAI's Whisper model in C/C++
wixy guidogerb/wixy
wordlebuster guidogerb/wordlebuster
Z-Image Tongyi-MAI/Z-Image
zenphoto zenphoto/zenphoto Free, open-source photo and media gallery CMS

Popular repositories Loading

  1. blacklist-hosts blacklist-hosts Public

    Forked from StevenBlack/hosts

    🔒 Consolidating and extending hosts files from several well-curated sources. Optionally pick extensions for porn, social media, and other categories.

    Python

  2. guidogerb guidogerb Public

    Home of the ultimate Generative AI tools on planet earth.

    Python