Drag files or click to select
You can convert 3 files up to 10 MB each
Drag files or click to select
You can convert 3 files up to 10 MB each
What AVI to AAC conversion actually does
AVI (Audio Video Interleave) is one of the oldest multimedia containers, designed by Microsoft in 1992 as part of Video for Windows. Files with the .avi extension served for nearly two decades as the primary format for home video, video editing on Windows, digital VHS rips, DVD copies and webcam recordings. AVI is a container based on RIFF (Resource Interchange File Format) capable of storing virtually any video and audio codec of its era: DivX, Xvid, MPEG-4 Part 2, less often H.264 for video, and PCM, MP3, AC3, MPEG-1 Layer II, less often AAC for audio.
AAC stands for Advanced Audio Coding and is a modern audio codec designed as the successor to MP3. Unlike M4A, which is a container with extended capabilities, a file with the .aac extension stores the audio stream in raw form - a sequence of ADTS frames with no extra container wrapping. This brings two clear advantages: minimal file size thanks to the absence of container overhead, and maximum simplicity for systems that read audio in a streaming fashion.
Converting AVI to AAC is the process of separating the audio track from the video and storing it as ADTS. The video stream is discarded entirely, only the audio frames remain, packed into a streaming format that can be read from any position without first parsing container metadata. If the source AVI has no audio track (a silent screen capture or a recording with a broken microphone, for example), the conversion is not performed and the service reports the absence of sound.
The peculiarity of AVI is that it almost never carries sound in AAC. The standard set of AVI audio codecs is PCM (uncompressed, like in WAV), MP3, AC3 and MPEG-1 Layer II. This is tied to the format's history: AVI was popular even before AAC became widespread, and most video compression utilities of that time (DivX, Xvid, VirtualDub) saved sound in MP3 or PCM. So when extracting AAC from AVI the service almost always re encodes the audio: the source stream is decoded and reassembled into AAC.
Technical differences between AVI and AAC
File structure
AVI is a RIFF based container originally designed for streaming short video clips on the limited memory of Windows 3.1. The file holds separate tracks (video and audio), an index table at the end of the file, and a general header describing parameters. The structure is dated: the index has a size limit, which causes problems for long files (over 4 GB), and it copes poorly with modern variable bitrate codecs.
AAC in the form of an ADTS file is fundamentally simpler. It is a sequence of independent frames, each starting with its own synchronisation header of 7 or 9 bytes. The header specifies the sample rate, the channel count and the profile version. No chapters, no cover art, no multilingual tracks, no attachments - only audio data. This structure was designed for streaming broadcasts and embedding into systems that read audio as data arrives.
What usually sits in the AVI audio track
In most real world AVI files the audio is stored in one of the older formats:
- PCM 16-bit (uncompressed audio) - found in webcam archives, VHS digitisations and minimum compression scenarios. The track takes a lot of space (about 10 MB per minute of stereo) but preserves full precision.
- MP3 - the most common variant for DivX/Xvid rips from the late 1990s through the mid 2000s. The bitrate is usually 128 to 192 kbps stereo, sometimes 96 kbps for budget rips.
- AC3 (Dolby Digital) - used in DVD rips, typically 192 to 448 kbps with multichannel sound (5.1).
- MPEG-1 Layer II - found in TV rips and video archives from digital TV tuners.
- AAC - rarely, mostly in later AVI files after 2008.
All of these formats are not directly compatible with the streaming ADTS form (apart from AAC), so when extracting AAC from AVI the service almost always re encodes the audio.
What happens to the sound during conversion
The service decodes the source audio to uncompressed PCM in memory and then encodes it back into AAC at a default bitrate of 192 kbps. Re encoding is performed in a single pass and preserves the source sample rate (44.1 or 48 kHz) and basic channel count (mono or stereo). This is lossy re encoding relative to the source, but the loss is minimal: AAC LC at 192 kbps is subjectively indistinguishable from MP3 192 kbps or AC3 source on consumer equipment and good headphones.
If the source AVI does carry an already compressed AAC stream (uncommon but possible), the service detects that and copies the stream into ADTS without re encoding. In that case quality stays identical to the source: the same frames, the same bitrate, the same sample rate.
What happens to the video stream
The video stream is discarded entirely. This is not compression and not a quality reduction - the video simply does not end up in the output file. To keep both sound and picture, choose conversion between video formats (AVI to MP4) rather than extracting AAC.
Size comparison
| Duration | AVI (DivX/Xvid rip) | AAC (192 kbps) | Reduction |
|---|---|---|---|
| 5 minutes | around 35-70 MB | around 7 MB | 5 to 10x |
| 30 minutes | around 200-400 MB | around 42 MB | 5 to 10x |
| 1 hour | around 400-800 MB | around 85 MB | 5 to 10x |
| 1.5 hour movie | around 700 MB-1.4 GB | around 130 MB | 5 to 10x |
| 2 hour movie | around 1-2 GB | around 170 MB | 6 to 12x |
AVI files of DivX/Xvid rips are typically heavier than MP4 of the same duration thanks to dated codecs and less efficient compression. Once AAC is extracted, the audio file size no longer depends on how big the source AVI was.
When you need to extract AAC from AVI
Home video archives
Many users have stored family videos and travel footage from the late 1990s and 2000s in AVI for years. These are recordings from the digital camcorders of the time, VHS digitisations, DVD rips of home movies. If the interest is the sound (greetings, songs, voices of relatives), extracting AAC produces a compact audio file in the tens of megabytes instead of a gigabyte sized AVI. This is useful for long term storage and sharing with relatives.
Old podcasts and educational video courses
In the mid 2000s many podcasts and educational courses were released as AVI files with DivX video and MP3 audio. If such an archive has survived (on DVDs, external drives, in cloud archives), AAC produces a convenient audio format for repeated listening and for publishing to modern podcast services. For speech and dialogue, 192 kbps AAC delivers transparent quality.
Recordings of lectures, conferences, webinars
For a long time lectures and conferences were recorded as AVI as the universal Windows format. If only the audio matters (the talk, the discussion, audience questions), extracting AAC cuts the file size by a factor of 5 to 10. A one hour lecture turns from a 500 MB AVI into an 85 MB AAC, comfortable to listen on the road, at work, on the move.
Archives of streaming recordings
In the 2000s and early 2010s tools for recording streams and the desktop (early Camtasia Studio, Fraps, FlashBack) often saved the result as AVI. Streamers and instructors who have hundreds of such files in storage can extract just the audio as AAC for separate storage and processing, freeing up disk space.
Sending to transcription APIs
Many automated speech recognition services (for transcribing lectures, interviews, work meetings) accept AAC as input. If the archive holds AVI files with interview or meeting recordings, AAC produces a ready format for delivery to any modern transcription service without extra intermediate conversions.
Preparing content for web radio and streaming
The ADTS form of AAC was originally designed for streaming. Each frame is self contained and carries its own header, so a listener can connect to a stream at any moment and start playback immediately. If you have an episode archive in AVI and you are preparing a feed for an internet radio station or a podcast service, raw AAC delivers minimal latency and stable behaviour during connection drops.
Extracting soundtracks from old rips
Music videos, concert recordings and movie excerpts were often kept in archives as AVI with MP3 or AC3 audio. AAC makes it possible to extract just the audio for adding to a playlist, using in projects, or repeated listening without the video.
Technical details of the extraction
Re encoding as the norm
Unlike MP4 to AAC, where re encoding is rare, in the case of AVI it happens almost every time. This is because AVI is an older container, and most audio codecs typical for AVI (PCM, MP3, AC3, MPEG-1 Layer II) are not directly compatible with the streaming ADTS form. Re encoding is performed in a single pass and introduces no audible artefacts at modern bitrates.
Bitrate and quality
The default 192 kbps is chosen as a sensible compromise. For speech content (lectures, podcasts, interviews) AAC LC at 192 kbps delivers transparent quality with significant headroom. For archives with MP3 192 kbps in the source, conversion to AAC 192 kbps does not lead to audible loss. For AC3 5.1 folded down to stereo, AAC at 192 kbps preserves full dialogue intelligibility and the depth of background music. For audiophiles with expensive headphones 256 kbps is available; going higher makes no sense.
Sample rate and channels
The sample rate is preserved as is: 44.1 kHz for most MP3 sources and webcam archives, 48 kHz for AC3 and DVD rips. Stereo stays stereo. A multichannel AC3 track (5.1, 6.1) is folded down to stereo during re encoding while preserving the balance between front channels and a phantom centre: dialogue stays intelligible, the musical background keeps depth. If the AVI carried a multichannel track and you want to preserve it, choose conversion to M4A or WAV.
Metadata and additional tracks
A raw ADTS stream does not support metadata the way M4A does. Track title, artist, cover art, release year - none of these can be stored inside an AAC file because of the format's design. AVI usually carries only basic metadata (title, description, sometimes author), which is lost during AAC extraction. If metadata matters, choose conversion to M4A or MP3 with ID3 tag support.
AAC profiles and compatibility
AAC LC (Low Complexity) is used by default - the most universal and compatible profile. It is supported by every device, including older car stereos and Smart TVs from previous generations. For archive recordings AAC LC is a safe choice that guarantees playback on any equipment.
Which files work best
AVI to AAC conversion handles any AVI file that carries at least one audio track. This covers practically every real world case:
- DivX and Xvid rips of films, series and educational material
- Digital archives of VHS tapes (home video, concerts, broadcasts)
- Recordings from digital camcorders from 1998 to 2010
- DVD rips with sound in AC3 or MP2
- TV captures from digital TV tuners
- Stream recordings and desktop captures from early tools like Camtasia
- Webcam Skype call recordings from the early 2000s (often AVI)
Files without an audio track (AVI timelapses, screen captures without microphone, test clips) cannot be converted to AAC - the service returns an error explaining there is no audio.
Broken or truncated AVI files. AVI does not handle index table damage well: if the index is corrupted, the file may not open at all. For such files it is worth trying to repair the index in a video utility (for example, in VLC through "fix") and then converting to AAC.
Duration and size. AAC is well suited to recordings of any length. For long lectures and films you get a compact file convenient for sharing through messengers and storing in the cloud.
Why AAC is a strong format
Minimal overhead
An AAC file consists almost entirely of audio data. There are no index tables, no container elements, no header redundancy. On long recordings the difference compared with M4A ranges from 0.5 to 2 percent in favour of AAC. When working with large AVI archives this delivers tangible disk space savings.
Self synchronisation during streaming reception
Each AAC frame begins with a unique sync signature, by which the player instantly locates the boundary of the next frame. If a stream connection breaks, the player automatically resynchronises to the nearest complete frame after recovery and continues playback without restarting the file. This is critical for internet radio and any scenario with an unstable network.
Universal compatibility
AAC is supported by all modern operating systems, browsers and mobile devices. Android plays AAC from the very first version, Windows and macOS have done so for decades. HTML5 audio in the browser decodes AAC natively through the audio tag. iOS has supported AAC at the hardware level since the very first iPod era.
Better quality than MP3 at the same bitrate
AAC is technically superior to MP3: a more accurate psychoacoustic model, more efficient handling of high frequencies, a more precise stereo image. At 128 kbps AAC sounds the way MP3 does at 192 kbps. At 192 kbps the difference from the source is indistinguishable for most listeners, even on quality headphones.
A natural fit for hardware decoders
Many hardware chips (DSPs in smartphones, TVs, car stereos) carry a built in AAC decoder. Playback through a hardware decoder consumes significantly less power than software decoding. This translates into longer battery life when listening on portable devices.
AAC vs the alternatives
| Format | Structure | Metadata | Size | When to choose |
|---|---|---|---|---|
| AAC | streaming ADTS | minimal | baseline | streaming, embedded devices, web radio |
| M4A | MP4 container | full iTunes | plus 1-2% | tagged music, audiobooks with chapters |
| MP3 | streaming | ID3 tags | plus 30% | maximum compatibility with old hardware |
| WAV | RIFF container | limited | 30-50x | mastering, lossless processing |
| FLAC | FLAC container | Vorbis comments | 5-10x | preserving lossless from a PCM AVI source |
| OGG | OGG container | Vorbis comments | plus 5-10% | open ecosystems, Linux |
If your priority is to feed audio into a stream, send it to an API, run it on web radio or on an IoT device, choose AAC. If you need tags, cover art and chapters (an audiobook cut from an AVI archive, for example), choose M4A. If compatibility with older hardware is the priority, MP3 remains the choice. If the source AVI had a PCM track and lossless quality matters, choose FLAC or WAV.
Limits and recommendations
AAC does not preserve the video stream. The video physically does not end up in the output file. If there is any chance the visuals will be needed later, keep the original AVI alongside the AAC.
Multichannel sound. If the AVI carried a 5.1 AC3 track (typical for DVD rips), it is folded down to stereo during AAC re encoding while preserving the balance. A full multichannel mix in a raw ADTS file is formally possible, but not all players handle it correctly. To guarantee that a multichannel mix is preserved, choose conversion to M4A or WAV.
Broken AVI indices. If the AVI file is damaged in the index portion, direct extraction may not work. For such files first try to repair the index in VLC or a specialised AVI utility, then convert.
Metadata. AVI usually holds minimal metadata (title, description, date). During AAC extraction it is lost. If cataloguing matters, choose M4A with tags or MP3 with ID3.
Different AVI versions. AVI exists in several versions (AVI 1.0, OpenDML AVI 2.0). The original AVI 1.0 has a 4 GB file size limit; OpenDML removes this limit. Modern players read both versions without distinction.
Protected content. Very rare AVI files may carry DRM (for example, early ASF certified rip variants). Audio extraction may not work in such cases. Ordinary user AVI files have no restrictions.
What is AVI to AAC conversion used for
Home video archives
Extract sound from family videos and travel footage from the late 1990s and 2000s stored as AVI. Recordings from digital camcorders, VHS digitisations and home movie DVDs turn into compact AAC files for long term storage and sharing with relatives.
Old podcasts and educational courses
Convert podcast and educational course episode archives from AVI to AAC for re publishing on modern podcast services. For speech and dialogue 192 kbps AAC delivers transparent quality.
Lectures, conferences and webinars
Extract the audio portion from lecture and conference recordings made in AVI. A one hour lecture turns from a 500 MB AVI into an 85 MB AAC, comfortable to listen to on the road or at work.
Streaming recordings archives
Free up disk space by extracting just the audio track from desktop and stream AVI recordings made with early Camtasia, Fraps and FlashBack. AAC is stored separately from the bulky video.
Sending to transcription services
Convert AVI with interview and meeting recordings to AAC for delivery to modern automated speech recognition services. AAC is accepted by every popular API without extra intermediate conversions.
Soundtrack extraction
Pull audio tracks from old music videos, concert recordings and movie excerpts in AVI for adding to playlists or using in projects without the video.
Tips for converting AVI to AAC
Be ready for re encoding
Unlike MP4, AVI almost never contains audio in AAC - it is usually MP3, AC3 or PCM. This means a one off re encoding step is performed during AAC conversion. At 192 kbps AAC LC the losses are imperceptible by ear, so this is not something to worry about.
Repair broken AVI files in advance
AVI is sensitive to damage of its index table. If the file does not open in a player, before conversion try repairing the index through VLC: Tools, Preferences, Input/Codecs, Damaged or incomplete AVI file - 'Always fix'.
Match bitrate to the content
For speech (lectures, podcasts, interviews) 128 kbps is enough - the voice sounds clean and the file stays compact. For music and films aim at 192 to 256 kbps. Going above 256 kbps in AAC delivers minimal quality gains at a noticeable size penalty.
Keep the original AVI if in doubt
After extraction the video cannot be recovered, it physically does not end up in the AAC. If you might need the picture later, keep the AVI alongside the AAC. The same applies to multichannel audio and rare metadata.