- What has changed with ElevenLabs Music v2
- Multi-Genre Coherence Architecture
- Immediate impact for creative agencies and SMEs
- Three concrete use cases for Italian SMEs
- The competitive landscape: where does Music v2 position itself?
- What press releases don't say
- What to do now: three operational priorities
- Outlook: Where AI audio generation is heading in 2027
ElevenLabs has released Music v2, an updated AI-based music generation model. The main novelty is the ability to handle fluid transitions between very different genres—opera, heavy metal, rap—within a single track. Additionally, the model introduces a function for inpainting audioIs it possible to regenerate specific sections of a passage without altering the already approved parts.
Therefore, for creative agencies and SMEs that produce video content, podcasts, or digital campaigns, this update significantly reduces music post-production times. In particular, inpainting eliminates the need to start from scratch every time a single section does not meet the client's expectations. Consequently, the workflow becomes more iterative and controllable.
We of SHM Studio we are closely monitoring the evolution of AI tools applied to content production. In fact, the integration of solutions like ElevenLabs Music v2 into processes such as digital marketing it can generate concrete competitive advantages for Italian SMEs. In summary, this is a relevant update, not a simple version increase.
What has changed with ElevenLabs Music v2
May 28, 2026, ElevenLabs announced the release of Music v2, the second generation of your AI-based music synthesis model. According to The Decoder, the model can handle transitions between musically distant genres—opera, heavy metal, rap—while maintaining the harmonic and structural coherence of the piece.
Additionally, Music v2 introduces a feature of inpainting audio. This mechanism allows you to select a specific section of a generated track and regenerate it in isolation. The remaining parts remain intact. Therefore, the revision process becomes surgical, not destructive.
Previously, any edits.
Multi-genre coherence architecture
The central technical challenge of Music v2 isn't generating individual genres. In fact, prior models—including the first version of ElevenLabs Music—were already capable of producing stylistically coherent tracks within a defined genre. The qualitative leap concerns Gender transitions.
Maintaining musical consistency during a transition from opera to metal involves simultaneously managing variables of timbre, tempo, harmony, and rhythmic structure. However, ElevenLabs claims that Music v2 successfully navigates these transitions without losing the song's narrative thread. This is a non-trivial outcome from a model architecture perspective.
Similarly, the inpainting function requires the model to understand the surrounding musical context before regenerating the selected section. In this sense, the model operates with a form of musical context awareness which brings it closer to the behavior of a human editor.
Immediate impact for creative agencies and SMEs
For agencies that produce video content, commercials, or digital campaign materials, original music often represents a significant fixed cost. In particular, acquiring licenses for commercial tracks or using professional composers impacts production budgets, especially for SMEs.
So, tools like ElevenLabs Music v2 open up a different operational scenario. An agency can generate original music tailored to each format—reels, pre-rolls, podcasts, corporate presentations—without recurring licensing costs. Furthermore, inpainting allows for the rapid adaptation of the same track to different creative variations.
We of SHM Studio We work daily with Italian SMEs that manage multi-channel campaigns. Therefore, reducing audio production time has a direct impact on speed to market. An agency that integrates Music v2 into its workflow can deliver audio variations in hours, not days.
To learn more about how AI integrates into content production processes, it's useful to consult our dedicated pages on AI services and to the SEO copywriting.
Three practical use cases for Italian SMEs
First of all, it's worth identifying the contexts in which Music v2 yields the greatest operational return. Below are three representative scenarios for the Italian B2B and retail market.
- Social video campaign Generating original music for reels and short videos eliminates the risk of copyright claims on platforms like Instagram and YouTube. Furthermore, multi-genre consistency allows for differentiating musical style for different audience segments without resorting to separate tracks.
- Podcast and branded audio content: SMEs that produce institutional podcasts or industry interviews can generate custom jingles and music beds. In particular, inpainting allows for updating individual sections when the brand evolves, without having to redo the entire sonic identity.
- Presentations and event materials: Agencies that organize B2B events or trade shows can produce custom soundtracks for every moment of the day—opening, networking, closing—with a single instrument and reduced turnaround time.
These scenarios connect directly to the activities of digital marketing and all LinkedIn campaign that we manage for our clients. Therefore, audio becomes an element of the content strategy, not an accessory.
The competitive landscape: where does Music v2 position itself?
ElevenLabs isn't the only player in this space. However, the combination of multi-genre transitions and inpainting sets it apart from more direct competitors. Tools like Suno and Udio offer text-based music generation, but with less control over fine-grained section revisions.
According to the analysis of Gartner, generative AI applied to creative media is in a phase of rapid maturation. Consequently, the gap between leading and secondary models tends to widen quickly. Those who adopt the most advanced tools today build an operational advantage that will be difficult to overcome later on.
Moreover, it's worth considering that ElevenLabs already has an established position in the AI voice synthesis market. Therefore, the expansion into music follows a platform logic: a single provider for voice, sound effects, and original music. For agencies, this reduces the complexity of the tool ecosystem.
For those who manage Google Ads campaigns with video components, integrating quality original audio can improve ad engagement metrics. Furthermore, for activities SEO Link to YouTube, original music removes restrictions on monetization and distribution.
What press releases don't say
It is appropriate to maintain a critical reading. ElevenLabs claims that multi-genre transitions occur without losing musical coherence. However, the perceived quality of these transitions depends on the use context and the final audience's expectations.
For content intended for social media platforms or corporate videos, the quality level of Music v2 is likely sufficient. Conversely, for productions requiring a distinctive and refined sound identity—television commercials, soundtracks for high-profile corporate films—direct testing is necessary.
Additionally, the inpainting feature, while promising, introduces a new learning curve. Granular control of an AI-generated song requires basic musical knowledge to be exploited effectively. Nevertheless, for a creative team with even elementary knowledge of musical structure, the operational advantage is clear.
For those who wish to explore the integration of these tools into a structured content strategy, our team is available via the Contact Us. Finally, to stay updated on the most relevant AI developments for Italian SMEs, the SHM Studio Blog publish regular analyses on these topics.
What to do now: three operational priorities
For SMEs and agencies looking to evaluate Music v2 in a structured way, we suggest three immediate priorities.
- Map existing audio touchpoints: Identify all content formats that currently use purchased or licensed music. This census defines the scope of immediate application of the tool.
- Start a pilot on a specific format. Choose a low-risk format, such as social reels, and test Music v2 on a full production cycle. Then, measure the time saved and the customer's perceived quality.
- Integrate audio into your content strategy: treat original music not as a decorative element but as a variable of brand identity. In particular, evaluate the consistency between musical style and brand positioning.
To further explore how to structure a content strategy that includes audio, video, and text coherently, our pages on web services and on digital marketing offer a useful starting point. Additionally, for SMEs who want to understand how AI fits into an editorial plan, the section AI services Describe the adoption paths we follow with our clients.
Outlook: Where AI audio generation is heading in 2027
Music v2 is a directional signal, not a destination. According McKinsey, generative AI applied to creative media has significant automation potential in content production processes. Consequently, in the next 18-24 months, it is reasonable to expect even more controllable and integratable models into agency production pipelines.
In particular, the convergence between speech synthesis, music generation, and AI video production suggests that by 2027-2028, it will be possible to produce complete audiovisual content—voice, music, images—with a single AI-assisted workflow. For Italian SMEs, this scenario redefines the relationship between production budgets and the quality of the final content.
Therefore, adopting operational familiarity with tools like ElevenLabs Music v2 today is not just a tactical choice. It's an investment in the ability to compete in a rapidly reshaping content ecosystem. Those who wait for the market to stabilize risk starting behind.
Related articles
Discover other articles that explore similar topics in depth, selected to give you a more complete and stimulating view. Each piece of content is carefully chosen to enrich your experience.