Acoustic model structure. More...
#include <acmod.h>
Data Fields | |
cmd_ln_t * | config |
Configuration. | |
logmath_t * | lmath |
Log-math computation. | |
glist_t | strings |
Temporary acoustic model filenames. | |
fe_t * | fe |
Acoustic feature computation. | |
feat_t * | fcb |
Dynamic feature computation. | |
bin_mdef_t * | mdef |
Model definition. | |
tmat_t * | tmat |
Transition matrices. | |
void * | mgau |
either s2_semi_mgau_t or ms_mgau_t, will make this more type-safe in the future. | |
void(* | mgau_free )(void *) |
Function to dealloate mgau. | |
frame_eval_t | frame_eval |
Function to compute GMM scores. | |
int16 * | senone_scores |
GMM scores for current frame. | |
bitvec_t * | senone_active_vec |
Active GMMs in current frame. | |
int * | senone_active |
Array of active GMMs. | |
int | n_senone_active |
Number of active GMMs. | |
int | log_zero |
Zero log-probability value. | |
mfcc_t ** | mfc_buf |
Temporary buffer of acoustic features. | |
mfcc_t *** | feat_buf |
Temporary buffer of dynamic features. | |
FILE * | rawfh |
File for writing raw audio data. | |
FILE * | mfcfh |
File for writing acoustic feature data. | |
uint8 | state |
State of utterance processing. | |
uint8 | compallsen |
Compute all senones? | |
uint8 | grow_feat |
Whether to grow feat_buf. | |
uint8 | reserved |
int16 | output_frame |
Index of next frame of dynamic features. | |
int16 | n_mfc_alloc |
Number of frames allocated in mfc_buf. | |
int16 | n_mfc_frame |
Number of frames active in mfc_buf. | |
int16 | mfc_outidx |
Start of active frames in mfc_buf. | |
int16 | n_feat_alloc |
Number of frames allocated in feat_buf. | |
int16 | n_feat_frame |
Number of frames active in feat_buf. | |
int16 | feat_outidx |
Start of active frames in feat_buf. |
Acoustic model structure.
This object encapsulates all stages of acoustic processing, from raw audio input to acoustic score output. The reason for grouping all of these modules together is that they all have to "agree" in their parameterizations, and the configuration of the acoustic and dynamic feature computation is completely dependent on the parameters used to build the original acoustic model (which should by now always be specified in a feat.params file).
Because there is not a one-to-one correspondence from blocks of input audio or frames of input features to frames of acoustic scores (due to dynamic feature calculation), results may not be immediately available after input, and the output results will not correspond to the last piece of data input.
Definition at line 102 of file acmod.h.
cmd_ln_t* acmod_s::config |
feat_t* acmod_s::fcb |
Dynamic feature computation.
Definition at line 110 of file acmod.h.
Referenced by acmod_free(), acmod_init(), acmod_process_cep(), acmod_process_feat(), and ps_get_feat().
fe_t* acmod_s::fe |
Acoustic feature computation.
Definition at line 109 of file acmod.h.
Referenced by acmod_end_utt(), acmod_free(), acmod_init(), acmod_process_raw(), acmod_start_utt(), and ps_get_fe().
mfcc_t*** acmod_s::feat_buf |
Temporary buffer of dynamic features.
Definition at line 130 of file acmod.h.
Referenced by acmod_free(), acmod_init(), acmod_process_cep(), acmod_process_feat(), and acmod_score().
Function to compute GMM scores.
Definition at line 121 of file acmod.h.
Referenced by acmod_score().
uint8 acmod_s::grow_feat |
Whether to grow feat_buf.
Definition at line 137 of file acmod.h.
Referenced by acmod_process_cep(), acmod_process_feat(), and acmod_set_grow().
logmath_t* acmod_s::lmath |
Log-math computation.
Definition at line 105 of file acmod.h.
Referenced by acmod_init(), ngram_search_init(), and ps_lattice_init_search().
Model definition.
Definition at line 113 of file acmod.h.
Referenced by acmod_clear_active(), acmod_free(), acmod_init(), bin_mdef_read_text(), ngram_search_init(), and ps_reinit().
mfcc_t** acmod_s::mfc_buf |
Temporary buffer of acoustic features.
Definition at line 129 of file acmod.h.
Referenced by acmod_end_utt(), acmod_free(), acmod_init(), and acmod_process_raw().
FILE* acmod_s::mfcfh |
File for writing acoustic feature data.
Definition at line 132 of file acmod.h.
Referenced by acmod_end_utt(), acmod_free(), acmod_process_cep(), and acmod_set_mfcfh().
void* acmod_s::mgau |
either s2_semi_mgau_t or ms_mgau_t, will make this more type-safe in the future.
Definition at line 115 of file acmod.h.
Referenced by acmod_free(), and acmod_score().
void(* acmod_s::mgau_free)(void *) |
Number of active GMMs.
Definition at line 125 of file acmod.h.
Referenced by acmod_clear_active(), and acmod_score().
int16 acmod_s::output_frame |
Index of next frame of dynamic features.
Definition at line 139 of file acmod.h.
Referenced by acmod_frame_idx(), acmod_rewind(), acmod_score(), acmod_start_utt(), ps_get_n_frames(), and ps_get_utt_time().
FILE* acmod_s::rawfh |
File for writing raw audio data.
Definition at line 131 of file acmod.h.
Referenced by acmod_end_utt(), acmod_free(), acmod_process_raw(), and acmod_set_rawfh().
Array of active GMMs.
Definition at line 124 of file acmod.h.
Referenced by acmod_free(), acmod_init(), and acmod_score().
bitvec_t* acmod_s::senone_active_vec |
Active GMMs in current frame.
Definition at line 123 of file acmod.h.
Referenced by acmod_clear_active(), acmod_free(), and acmod_init().
int16* acmod_s::senone_scores |
GMM scores for current frame.
Definition at line 122 of file acmod.h.
Referenced by acmod_free(), acmod_init(), and acmod_score().
uint8 acmod_s::state |
State of utterance processing.
Definition at line 135 of file acmod.h.
Referenced by acmod_end_utt(), acmod_init(), acmod_process_cep(), and acmod_start_utt().
glist_t acmod_s::strings |
Transition matrices.
Definition at line 114 of file acmod.h.
Referenced by acmod_free(), and ngram_search_init().