[EPIC] Sparsification for KPI extraction question answering task #231
Labels
enhancement
New feature or request
sparsification
Indicates that the issue exists to achieve model sparsification.
The current KPI extraction question answering model is huge (1.7gb) and it takes around a total of 7mins to infer for a pdf. We want to find smaller version of the models that get similar performance but using a smaller and faster model.
Overall, we want to investigate model pruning and test the effects of tools such as NeuralMagic to measure the performance impact of different levels of pruning. This EPIC is the first step for this overall goal.
The text was updated successfully, but these errors were encountered: