Skip to content
New issue

Have a question about this project? Sign up for a free GitHub account to open an issue and contact its maintainers and the community.

By clicking “Sign up for GitHub”, you agree to our terms of service and privacy statement. We’ll occasionally send you account related emails.

Already on GitHub? Sign in to your account

Emoji Normalization Feature #22

Open
mechamobau opened this issue Jan 2, 2024 · 1 comment
Open

Emoji Normalization Feature #22

mechamobau opened this issue Jan 2, 2024 · 1 comment

Comments

@mechamobau
Copy link
Contributor

mechamobau commented Jan 2, 2024

I was thinking about this lib and there's a growing need to handle emojis effectively in text normalization. This feature would convert emojis into their corresponding textual descriptions, making the text more comprehensible and analyzable, especially when processing social media content or informal communications.

Use Case:
Often, emojis are used in texts to convey emotions or actions that are not captured by plain text. Normalizing these into words can aid in sentiment analysis, text-to-speech applications, and in contexts where emojis are not supported or are less meaningful.

Implementation Idea:
We could create a mapping of commonly used emojis to their respective descriptive phrases. The normalization function should then detect these emojis in the text and replace them with the mapped phrases.

It's possible to use Gitmoji project as reference, because their project has the list with all emoji and codes that is possible to use in commit messages, and this feature can adapt with it's own context (e.g they have :bug: as emoji for commits that solves bugs, maybe :insect: or something like that can be used in the place), and Github has it's own text-to-emoji cheatsheet too

Potential Challenges:

  • Ensuring comprehensive coverage of frequently used emojis.
  • Deciding on standardized descriptive text for each emoji, considering cultural and contextual variances.

Benefits:

  • Enhances the utility of text normalization in modern communication contexts.
  • Facilitates better understanding and processing of texts rich in emojis.

I believe this feature would be a valuable addition to the 'normalize-text' project, helping people that want to support apps that receives emoji codes and handles the emoji as needed.

@mechamobau mechamobau changed the title Implement Emoji Normalization Feature Emoji Normalization Feature Jan 2, 2024
@mechamobau
Copy link
Contributor Author

mechamobau commented Jan 2, 2024

What I was thinking about the API is something like that:

normalizeEmoji(`Seek knowledge :alien::rocket:`) // "Seek knowledge 👽🚀"

This is merely a suggestion and might not align perfectly with the project's objectives

Sign up for free to join this conversation on GitHub. Already have an account? Sign in to comment
Labels
None yet
Projects
None yet
Development

No branches or pull requests

1 participant