Hey everyone,
I wanted to share a UI case study layout I put together for a research project mapping text, vocal frequencies (Bi-LSTM), and facial micro-expressions (CNN) into dynamic audio layers via Music21.
Because the underlying models are too heavy for basic free tiers, I built a lightweight Gradio interface to act as a 0-click visual production walkthrough and tech-stack overview.
Would love any feedback on the layout structure or optimization tips for multi-stream pipelines!
Link: Enigma Sound Ai - a Hugging Face Space by ApurvaDev111