Slot Induction via Pre-trained Language Model Probing and Multi-level Contrastive Learning

This repository provides PyTorch implementation for the paper Slot Induction via Pre-trained Language Model Probing and Multi-level Contrastive Learning (SIGDIAL 2023)

Environment Setup

The simplest way to set up the environment is to run our prepared BASH script as follows (NOTE: Anaconda virtual environment needs to be pre-installed before the scripts)

bash setup_env.sh

Your own virtual environment: Make sure you have python==3.9.12 and PyTorch==1.12.1 properly installed. Then, simply use pip to install the remaining required packages:

pip install -r requirements.txt

Dataset

We provide our prepared split of P1 and P2 datasets for both SNIPS and ATIS datasets under './dataset/' (See our paper for further details).

Configuration

Major important arguments are (configurable within the corresponding run_model.sh script). We recommend users to tune hyperparameters before using due to the existent sensitivity to hardware architecture:

--ckpt_dir: Saved directory for checkpoint
--epoch: Number of training epochs
--lr: training learning rate
--dataset: Choose dataset to train/evaluate (i.e. SNIPS_P1/ ATIS_P1)

Tuning hyperparameters

seg_level: Depth level of segmentation tree to extract semantic segments ($d$)
sent_temp: SentCL temperature ($\tau_{d}$)
seg_temp: SegCL temperature ($\tau_{s}$)
sent_coeff: Coefficient for SentCL loss ($\gamma$)
seg_coeff: Coefficient for SegCL loss ($\delta$)

Optional Configuration

ratio_seg: Ratio of segments for cropping (augmentations)
mask_type: Options to apply augmentations or not (i.e. no_mask or mask_seg)

Running Experiments

The following scripts are for Slot Induction training and evaluation (P1).
SNIPS

cd ./code/script/trad/
bash run_model.sh

SNIPS

cd ./code/script/atis/
bash run_model.sh

Citation

If you find our ideas, code or dataset helpful, please consider citing our work as follows:

@article{nguyen2023slot,
  title={Slot Induction via Pre-trained Language Model Probing and Multi-level Contrastive Learning},
  author={Nguyen, Hoang H and Zhang, Chenwei and Liu, Ye and Yu, Philip S},
  journal={arXiv preprint arXiv:2308.04712},
  year={2023}
}

Acknowledgement

Our UPL implementation is adapted from Petrurbed Masking
Our dataset is adapted from Capsule-NLU

Name		Name	Last commit message	Last commit date
Latest commit History 7 Commits
code		code
dataset		dataset
README.md		README.md
requirements.txt		requirements.txt
setup.py		setup.py
setup_env.sh		setup_env.sh

Provide feedback

Saved searches

Use saved searches to filter your results more quickly

Uh oh!

Repository files navigation

Slot Induction via Pre-trained Language Model Probing and Multi-level Contrastive Learning

Environment Setup

Dataset

Configuration

Running Experiments

Citation

Acknowledgement

About

Uh oh!

Releases

Packages

Uh oh!

Languages

nhhoang96/MultiCL_Slot_Induction

Folders and files

Latest commit

History

Repository files navigation

Slot Induction via Pre-trained Language Model Probing and Multi-level Contrastive Learning

Environment Setup

Dataset

Configuration

Running Experiments

Citation

Acknowledgement

About

Resources

Uh oh!

Stars

Watchers

Forks

Releases

Packages 0

Uh oh!

Languages

Packages