Add connectionist temporal classification (CTC) loss algorithm #11240

ZJUGuoShuai · 2024-01-13T15:03:08Z

Describe your change:

Connectionist Temporal Classification (CTC) loss is used in speech recognition, handwriting recognition and other sequence problems. It's used to get around not knowing the alignment between the input and the output.
The implementation has been verified to align with PyTorch's CTCLoss.

Add an algorithm?
Fix a bug or typo in an existing algorithm?
Add or change doctests? -- Note: Please avoid changing both code and tests in a single pull request.
Documentation change?

Checklist:

ZJUGuoShuai · 2024-01-16T02:22:56Z

Hi there, I just wanted to thank the maintainers for their hard work on the project, and I wanted to let them know that I submitted a few pull requests in the last few days. I would really appreciate it if one of them could take a look at them and let me know if there are any issues or if there's anything that needs to be fixed before they can be merged.

Thanks again for all your hard work! @cclauss

tianyizheng02 · 2024-06-03T12:40:37Z

machine_learning/loss_functions.py

+    Calculate the connectionist temporal classification (CTC) loss between the given
+    log probabilities and targets.
+
+    CTC loss is used in speech recognition, handwriting recognition and other sequence
+    problems. It's used to get around not knowing the alignment between the input and
+    the output.
+
+    References:
+    - https://en.wikipedia.org/wiki/Connectionist_temporal_classification
+    - https://pytorch.org/docs/stable/generated/torch.nn.CTCLoss.html


A couple suggestions to clarify the documentation:

PyTorch cites Graves et al for its implementation. Since your implementation is also based on this paper, please add it as a reference as well.

Please add a short paragraph explaining how the loss is actually calculated. This should explain some general questions about your variables (What is blank? What is alpha, and why is it calculated using DP?) so that the reader understands what is being calculated. I ask for this because this repository is meant for educational purposes, so we want readers to understand how and why the implementation works.

Also, in your implementation you use np.logaddexp for log-probabilities when calculating alpha rather than calculating probabilities directly. Since this differs from the definitions in the Graves et al paper, please be sure to note this implementation detail in your explanation.

Add connectionist temporal classification (CTC) loss algorithm

95859d3

imSanko approved these changes Jan 14, 2024

View reviewed changes

algorithms-keeper bot mentioned this pull request Jan 17, 2024

Add sparse categorical cross entropy loss algorithm #11249

Closed

15 tasks

tianyizheng02 requested changes Jun 3, 2024

View reviewed changes

Merge branch 'master' into add-ctc-loss

265d881

Provide feedback

Saved searches

Use saved searches to filter your results more quickly

Add connectionist temporal classification (CTC) loss algorithm #11240

Add connectionist temporal classification (CTC) loss algorithm #11240

ZJUGuoShuai commented Jan 13, 2024

ZJUGuoShuai commented Jan 16, 2024

tianyizheng02 Jun 3, 2024 •

edited

Loading

Add connectionist temporal classification (CTC) loss algorithm #11240

Are you sure you want to change the base?

Add connectionist temporal classification (CTC) loss algorithm #11240

Conversation

ZJUGuoShuai commented Jan 13, 2024

Describe your change:

Checklist:

ZJUGuoShuai commented Jan 16, 2024

tianyizheng02 Jun 3, 2024 • edited Loading

Choose a reason for hiding this comment

tianyizheng02 Jun 3, 2024 •

edited

Loading