Skip to content

Commit ef9319b

Browse files
authored
add arxiv id.
1 parent b6214dd commit ef9319b

File tree

1 file changed

+5
-3
lines changed

1 file changed

+5
-3
lines changed

README.md

Lines changed: 5 additions & 3 deletions
Original file line numberDiff line numberDiff line change
@@ -1,6 +1,8 @@
11
# MaskLID
22

3-
This repository maintains the code for MaskLID: Code-Switching Language Identification through Iterative Masking (ACL-main 2024).
3+
This repository maintains the code for
4+
5+
**MaskLID: Code-Switching Language Identification through Iterative Masking** (ACL 2024): https://arxiv.org/abs/2406.06263
46

57
MaskLID is a simple, yet effective, code-switching (CS) language identification (LID) method. MaskLID does not require any training and is designed to complement current high-performance sentence-level LIDs. Sentence-level LIDs are classifiers trained on monolingual texts to provide single labels, typically using a softmax layer to turn scores into probabilities. However, in cases where a sentence is composed in both L1 and L2 languages, the LID classifier often only returns the dominant label L1. To address this limitation, MaskLID employs a strategy to **mask** text features associated with L1, allowing the LID to classify the text as L2 in the next round. This method uses the LID itself to identify the features that require masking and does not rely on any external resource.
68

@@ -69,7 +71,7 @@ If you find our method and code useful for your research, please cite:
6971
@article{kargaran2024masklid,
7072
title={MaskLID: Code-Switching Language Identification through Iterative Masking},
7173
author={Kargaran, Amir Hossein and Yvon, Fran{\c{c}}ois and Sch{\"u}tze, Hinrich},
72-
journal={arXiv preprint},
74+
journal={arXiv preprint arXiv:2406.06263},
7375
year={2024}
7476
}
75-
``
77+
```

0 commit comments

Comments
 (0)