Navigating an `AlignedTextGrid`

This documentation covers reading in the output from the Montreal Forced Aligner using the Word and Phone classes from aligned_textgrid, but everything will generalize to custom classes.

from aligned_textgrid import AlignedTextGrid
from aligned_textgrid import Word, Phone

Reading in a TextGrid

To read in a one-speaker TextGrid, either give AlignedTextGrid() the path to the file, or a textgrid that has already been read in with praatio.textgrid.openTextgrid().

You also need to specify the sequence classes of each tier in the order they appear. For MFA output, the top tier is Word and the bottom tier is Phone, but if these were reversed, you would have to pass [Phone, Word] to entry_classes. The information about which class is the superset and which is the subset is encoded in the class information, and is automatically handled.

one_speaker = AlignedTextGrid(
    textgrid_path = "../resources/josef-fruehwald_speaker.TextGrid", 
    entry_classes = [Word, Phone]
)

With a two or more speaker TextGrid, you can either pass entry_classes a single list of interval classes to re-use with each speaker (for example [Word, Phone]), or an explicit list of nested classes (for example, [[Word, Phone], [Word, Phone]]).

two_speaker = AlignedTextGrid(
    textgrid_path = "../resources/KY25A_1.TextGrid",
    entry_classes= [Word, Phone]
)

If you have a textgrid a mixture of sequence hierarchies, you have to read it in with then fully nested list of classes.

from aligned_textgrid import custom_classes
Turn = custom_classes("Turn")

multi_hierarchy = AlignedTextGrid(
    textgrid_path = "../resources/KY25A_1_multi.TextGrid",
    entry_classes = [[Word, Phone], [Turn], [Word, Phone], [Turn]]
)

print(multi_hierarchy)

AlignedTextGrid with 4 groups, each with [2, 1, 2, 1] tiers. [['Word', 'Phone'], ['Turn'], ['Word', 'Phone'], ['Turn']]

Navigating the `AlignedTextGrid` object

Every AlignedTextGrid object contains at least one TierGroup, which in turn contains at least one SequenceTier.

This information is available if you print the object:

print(two_speaker)

AlignedTextGrid with 2 groups, each with [2, 2] tiers. [['Word', 'Phone'], ['Word', 'Phone']]

Or if you compare the len() of the one speaker vs two speaker textgrids.”

print(len(one_speaker))
print(len(two_speaker))

1
2

To get the Word tier of the first speaker in one_speaker, we can index it with [0][0]

one_speaker[0][0]

Sequence tier of Word; .superset_class: Top_wp; .subset_class: Phone

If you’d prefer to wrote more verbose but explicit code, you can also access tiers via the .tier_groups and .tier_list attributes as well.

one_speaker.tier_groups[0].tier_list[0]

Sequence tier of Word; .superset_class: Top_wp; .subset_class: Phone

To access the individual sequence intervals in a tier, you can also use indexing.

one_speaker[0][0][3]

Class Word, label: sunlight, .superset_class: Top_wp, .super_instance, None, .subset_class: Phone, .subset_list: ['S', 'AH1', 'N', 'L', 'AY2', 'T']

Tiers are also iterable.

for i in range(5):
    print(one_speaker[0][0][i].label)


when
the
sunlight
strikes

Once you’ve gotten to a sequence interval, indexing goes into its .subset_list

The len() of a tier returns how many sequence intervals it constains.

[len(one_speaker[0][0]), len(one_speaker[0][1])]

[377, 1191]

Get interval at time

The “Get interval at time” functionality from Praat has been implemented for each level of TextGrid representation.

speaker_one = two_speaker[0]
speaker_one_word = speaker_one[0]

speaker_one_word.get_interval_at_time(11)

This is the index for the word that appears at 11 seconds.

speaker_one.get_intervals_at_time(11)

[1, 2]

These are the indices for the word and phone tiers that are at 11 seconds.

two_speaker.get_intervals_at_time(11)

[[1, 2], [39, 96]]

two_speaker.get_intervals_at_time(11)

[[1, 2], [39, 96]]

These are the indices for the word and phone tiers for both speakers at 11 seconds.

Nested indexing

You can use the nested indices returned by .get_intervals_at_time() to get the actual sequence intervals as well.

eleven_seconds = two_speaker.get_intervals_at_time(11)
two_speaker[eleven_seconds]

[[Class Word, label: yeah, .superset_class: Top_wp, .super_instance, None, .subset_class: Phone, .subset_list: ['Y', 'AE1'],
  Class Phone, label: AE1, .superset_class: Word, .super_instance: yeah, .subset_class: Bottom_wp],
 [Class Word, label: after, .superset_class: Top_wp, .super_instance, None, .subset_class: Phone, .subset_list: ['AE1', 'F', 'T', 'ER0'],
  Class Phone, label: F, .superset_class: Word, .super_instance: after, .subset_class: Bottom_wp]]

Reuse

GPLv3

Reading in a TextGrid

Navigating the AlignedTextGrid object

Get interval at time

Nested indexing

Reuse

Navigating the `AlignedTextGrid` object