In today's world, Algebraic code-excited linear prediction is a topic that arouses great interest and debate in different areas of society. Its relevance and diversity of approaches has led to extensive discussion and reflection on its implications. From academic perspectives to the everyday environment, Algebraic code-excited linear prediction has generated endless questions and positions that seek to understand its scope and impact on our reality. In this article, we will delve into a detailed analysis of Algebraic code-excited linear prediction, exploring its different aspects and offering a comprehensive vision to understand its importance and current challenges.
Algebraic code-excited linear prediction (ACELP) is a speech coding algorithm in which a limited set of pulses is distributed as excitation to a linear prediction filter. It is a linear predictive coding (LPC) algorithm that is based on the code-excited linear prediction (CELP) method and has an algebraic structure. ACELP was developed in 1989 by the researchers at the Université de Sherbrooke in Canada.[1]
The ACELP method is widely employed in current speech coding standards such as AMR, EFR, AMR-WB (G.722.2), VMR-WB, EVRC, EVRC-B, SMV, TETRA, PCS 1900, MPEG-4 CELP and ITU-T G-series standards G.729, G.729.1 (first coding stage) and G.723.1.[2][3][4][5] The ACELP algorithm is also used in the proprietary ACELP.net codec.[6] Audible Inc. use a modified version for their speaking books. It is also used in conference-calling software, speech compression tools and has become one of the 3GPP formats.
The ACELP patent expired in 2018 and is now royalty-free.[7]
The main advantage of ACELP is that the algebraic codebook it uses can be made very large (> 50 bits) without running into storage (RAM/ROM) or complexity (CPU time) problems.
The ACELP algorithm is based on that used in code-excited linear prediction (CELP), but ACELP codebooks have a specific algebraic structure imposed upon them.
A 16-bit algebraic codebook shall be used in the innovative codebook search, the aim of which is to find the best innovation and gain parameters. The innovation vector contains, at most, four non-zero pulses.
In ACELP, a block of N speech samples is synthesized by filtering an appropriate innovation sequence from a codebook, scaled by a gain factor g c, through two time-varying filters.
The long-term (pitch) synthesis filter is given by:
The short-term synthesis filter is given by: