Examples of prompts generated by ChatGPT-4.0

Description Ours (QA-MDT)
Params: 675M
AudioLDM2
(negative prompt: "low quality")
MusicGEN
Params: 1.5B
A modern synthesizer creating futuristic soundscapes.
Prompt from demo of AudioLDM2
Dynamic, vibrant, and stylish music for advertising soundtracks.
Shoegaze track with dreamy guitars and reverb effects.
Tranquil piano piece for relaxation and mindfulness.
Chillwave track with nostalgic synths and dreamy atmospheres.
Soft music for video dubbing that evokes a sense of vitality and joy.
Vaudeville piece with humorous lyrics and theatrical presentation.
Violin performance with a sentimental and nostalgic atmosphere.
Gothic rock track with dark atmospheres and intense guitars.
J-pop tune with upbeat rhythms and catchy melodies.
Celtic melody with traditional instruments and lilting rhythms.
Dancehall tune with infectious rhythms and party vibes.
Glam metal song with anthemic choruses and flashy solos.
Polka tune with lively accordion and cheerful rhythms.
Smooth, soulful saxophone in a relaxed jazz tune for quiet evenings.
Hip-hop beat with deep bass and sharp electronic samples.
Folk melody with acoustic guitar and harmonica.
Acoustic ballad with heartfelt lyrics and soft piano.
Drum and bass track with rapid beats and energetic bass lines.

Examples of prompts in MusicCaps

Description Ours (QA-MDT)
Params : 675M
AudioLDM2
(negative prompt: "low quality")
Ground Truth
The clip just contains a high pitched synth melody. The motif keeps going higher in pitch, and then eventually a riser comes in. The combination of these two factors creates a sense of build-up to a climactic moment.
This composition contains an upright bass playing softly along to a harp and strings playing a melody while a male deep voice is softly singing a melody sounding like telling a story. The song sounds like it was made for Christmas. This song may be playing at home having dinner with the whole family.
The song is an instrumental. The song is slow tempo with gentle drum brushes, percussion, bass guitar solo, and piano accompaniment gently. The song is groovy and emotional. The song is possibly a Christian worship song or a smooth jazz song. The audio quality is average.
This audio contains someone playing a piece on cello ranging from the low register up into the higher register. This song may be playing during a live performance.
This is a documentary music piece. There is a strings section that is holding a single chord. The bass guitar is repeating an ominous bass line. There is a simple percussion beat in the rhythmic background. The atmosphere has a dramatic feel to it. It feels like a story is about to unfold. The piece suits perfectly as a documentary music piece. It could also fit well in the soundtrack of a mystery movie.
This audio contains people playing rhythms and melodies on bells with sticks. This is an amateur recording.
This is a live performance of an alternative R&B music piece. There is a female vocalist singing in a seductive manner in the lead. Later on, a male vocalist starts rapping. There is a groovy bass line in the background. The DJ is scratching the turntable. The rhythmic background is provided by an R&B acoustic drum beat. The atmosphere is sensual and the sound is urban. Sounds of the crowd cheering can be heard in the recording.
The R&B song features a passionate male vocalist singing over a wide funky electric guitar melody, smooth bass guitar, punchy kick and snare hits, shimmering hi hats, snappy rimshots and soft crash cymbals. The rimshots are present in the first half of the loop, while a more energetic second part of the loop consists of punchy snare hits. It sounds emotional and heartfelt, as the vocal is slightly distorted.
A female vocalist sings this delicate harmony. The tempo is slow with an organ accompaniment. It is soft, mellow, meditative, enigmatic, mysterious, melancholic and haunting. This song is a Modern Classical.
The song track is instrumental. The tempo is medium with temple bells being played to create two distinct tones with long vibrations creating clear overtones. The soundtrack is calming and deeply religious. The audio track quality is poor with ambient environment noises.
This is an instrumental piece with a harmonica as the main lead melodic instrument. High pitched, wobbly sounds are played on the harmonica and contrasted by a deep and full acoustic guitar on which arpeggios are played. There's a sustained pad synth in the background which adds to the overall calming and soothing tone of the song.
This song contains someone strumming chords on an acoustic guitar while playing a harmonica. This song may be playing at a local bar.
Someone is playing a fast melody on a low bansuri flute along with someone playing tablas and a shrutibox in the background. This song may be playing at a live performance.
The pop rock music features a male voice singing. An electric guitar with a distortion effect on plays plays two chords every two measures. The drums play a strong rhythm and together with a synth bass drive the pulse of the music.
The Electro Pop song features a flat female vocal, occasionally supported by wide background female doubling vocals, singing over quiet drums, groovy and boomy bass, arpeggiated synth melody and some sound effects of the airplane and the explosion. In the second part, the drums cut through the mix more, therefore they are more audible, while the new elements appear, including shimmering bells and simple hi hats. Sounds like a low quality recording, especially because of that first part of the loop.
The low quality recording features a flat male vocal singing over acoustic rhythm guitar chords, after which that same vocal is talking. The song sounds passionate, while the recording, overall, is noisy and in mono.
A male singer sings this beautiful melody with backup singers in vocal harmony. The song is medium tempo with a percussive string section, strong bass line, guitar lead, steady drumming rhythm , keyboard accompaniment and various percussion clicks. The song is emotional and romantic. The song is of poor audio quality.
This is a calm type of song which features a flute being intricately played on top of the various instruments. There is a violin creating a sustained tone underneath. The music feels mystical and enchanting.
This is a bluegrass music piece. There is a mandolin playing the main tune as the lead while a banjo and an acoustic guitar are supporting it in the melodic background. There is a joyful feeling to this piece. It could be used in the soundtrack of a movie or a TV show with a rural setting. It could also be used in the background of pastoral social media content.
The male mid to high range voice sings loudly and full of emotions pouring out of his soul while strumming some minor chords on the guitar. This recording is of poor quality. This song may be played in an open mic, poetry bar.
The Rock song features an energetic male vocal singing over repetitive electric guitar melody, groovy bass, punchy kick and snare, shimmering hi hats and energetic crash cymbal. There is a short drum roll that represents a variation in a repetitive loop. It sounds energetic and passionate.
This is a parody of an electronic music song with a chipmunk vocal effect. There is an edited version of the original track playing in the background that is transposed to a very high pitch.
The song is mostly instrumental with a faint male vocal. The song is medium tempo with a slick drumming rhythm , booming bass line, siren tones and a keyboard playing arpeggiated tones. The song is followed by camera flash and click tones. The song is exciting with a lot of fanfare. The song is fading with the end credits superimposed with camera flash tones.
This music is a western classical instrumental. The tempo is slow with a cello solo. The music is soft, rich, deep, mellow, euphonious, pensive, melancholic, emotional and sentimental.
This song contains a plucked string instrument playing a melody in the higher register along with an acoustic guitar strumming chords on the backbeat and an upright bass playing a simple melody along to the lead melody. This song may be playing in a video-presentation.
This is a recording that was done outdoors and later had background music placed over it. There are the sounds of footsteps, and a young male saying something unintelligible. The track placed over it is a high octane rock song with fast electric bass playing, overdriven electric guitar strumming and a rock motif on the electric guitar.