Scp looks very simple format, but has several powerful features. The number after colon is a starting addressof the object of the file. The first column, uttid1, indicates the utterance id and the second, /some/where/feats.ark:123, is the file path of matrix/vector of kaldi formats. Scp is a text file such as, uttid1 /some/where/feats.ark:123 Unix command can be used as read-specifier and wspecifier copy-feats ark:'gunzip -c |' ark:some.ark indicates standard input stream or output stream. cat test.ark | copy-feats ark:- ark,t:- | less # Show the contents in the ark If you have Kaldi, you can convert it to text format as following # copy-feats Ĭopy-feats is designed to have high affinity with unix command line:Īrk can be flushed to and from unix pipe. This ia an example of ark file of KaldiMatrix: ark file This library mainly support KaldiMatrix/KaldiVector. More detail about the File-IO in Kaldi-asr: Basic of File IO in kaldi: Ark and copy-featsĪrk is an archive format to save any Kaldi objects. In this section, we describe the basic concept of ark and scp. ark and scp are used in in order to archive some objects defined in Kaldi, typically it is Matrix object of Kaldi. Kaldiio is an IO utility implemented in pure Python language for several file formats used in kaldi, which are named as ark and scp. A pure python module for reading and writing kaldi ark files
0 Comments
Leave a Reply. |
AuthorWrite something about yourself. No need to be fancy, just an overview. ArchivesCategories |