Prerequisites

How it works

Compile train graphs using the lexicon graph. This graph is specific to the individual sentences which leads to better performance than using the entire HCLG graph
Use the trained model, train graphs, ivectors, and MFCCs to perform force alignment. This produces exp/1.ali
Calculate the phone sequences from exp/1.ali. And use clean-phones.py to remove position dependency and stress

The output will be in exp/trans_cleaned.txt

There are other parameters that can be set in nnet3-align-to-phones.sh