Increasing Robustness for Cross-domain Dialogue Act Classification on Social Media Data

Feedback
Report

26 Views PremiumOct 12, 2022

Automatically detecting the intent of an utterance is important for various downstream natural language processing tasks. This task is also called Dialogue Act Classification (DAC) and was primarily researched on spoken one-to-one conversations. The rise of social media has made this an interesting data source to explore within DAC, although it comes with some difficulties: non-standard form, variety of language types (across and within platforms), and quickly evolving norms. We therefore investigate the robustness of DAC on social media data in this paper. More concretely, we provide a benchmark that includes cross-domain data splits, as well as a variety of improvements on our transformer-based baseline. Our experiments show that lexical normalization is not beneficial in this setup, balancing the labels through resampling is beneficial in some cases, and incorporating context is crucial for this task and leads to the highest performance improvements (~7 F1 percentage points in-domain and ~20 cross-domain).

Repost is prohibited without the creator's permission.

0 Follower · 11 Videos

Recommended for You

All
Anime

From Masked Language Modeling to Translation: Non-English Auxiliary Tasks Improve Zero-shot Spoken L

10:00

From Masked Language Modeling to Translation: Non-English Auxiliary Tasks Improve Zero-shot Spoken L

13 Views

MaChAmp at SemEval-2023 Tasks 2, 3, 4, 5, 6, 7, 8, 9, 10, 11, and 12: On the Effectiveness of Interm

10:00

MaChAmp at SemEval-2023 Tasks 2, 3, 4, 5, 6, 7, 8, 9, 10, 11, and 12: On the Effectiveness of Interm

16 Views

We Need to Talk About train-dev-test Splits

8:00

We Need to Talk About train-dev-test Splits

18 Views

Frustratingly Easy Performance Improvements for Low-resource Setups: A Tale on BERT and Segment Embe

1:55

Frustratingly Easy Performance Improvements for Low-resource Setups: A Tale on BERT and Segment Embe

19 Views

Enough is Enough! A Case Study on the Effect of Data Size for Evaluation Using Universal Dependencie

4:31

Enough is Enough! A Case Study on the Effect of Data Size for Evaluation Using Universal Dependencie

8 Views

Where are we Still Split on Tokenization?

4:46

Where are we Still Split on Tokenization?

5 Views

Much Gracias: Semi-supervised Code-switch Detection for Spanish-English: How far can we get? full

6:03

Much Gracias: Semi-supervised Code-switch Detection for Spanish-English: How far can we get? full

11 Views

Much Gracias: Semi-supervised Code-switch Detection for Spanish-English: How far can we get?(teaser)

0:39

Much Gracias: Semi-supervised Code-switch Detection for Spanish-English: How far can we get?(teaser)

9 Views

MaChAmp at SemEval-2022 Tasks 2, 3, 4, 6, 10, 11, and 12: Multi-task Multi-lingual Learning for a Pr

6:32

MaChAmp at SemEval-2022 Tasks 2, 3, 4, 6, 10, 11, and 12: Multi-task Multi-lingual Learning for a Pr

27 Views

Lexical Normalization for Code-switched Data and its Effect on POS Tagging

12:15

Lexical Normalization for Code-switched Data and its Effect on POS Tagging

8 Views

Philippines vs New Zealand Vlog #philippines #newzealand #wellington #mabuhay #labanfilipinas

20:57

Philippines vs New Zealand Vlog #philippines #newzealand #wellington #mabuhay #labanfilipinas

The Boss live and Travelling the Philippines Vlog

0 View

Teacher No. 18 is here! Teaching the basics of AI videos, and learning English by the way, you must

0:34

Teacher No. 18 is here! Teaching the basics of AI videos, and learning English by the way, you must

gangxueaijiuganpai

1 View

Mammals_2024_S01E02_The New Wild

58:26

Mammals_2024_S01E02_The New Wild

1 View

Mammals_2024_S01E05_Heat

58:26

Mammals_2024_S01E05_Heat

2 Views

The New Self-Publishing Method That’s Turning Authors Into Millionaires – Don’t Miss Out!

20:00

The New Self-Publishing Method That’s Turning Authors Into Millionaires – Don’t Miss Out!

1 View

[Steam Beautification] "Rem" Selected Artwork Showcase Remake

0:57

[Steam Beautification] "Rem" Selected Artwork Showcase Remake

0 View

The Real Reason Your Amazon KDP Books Are Not Making Sales

14:47

The Real Reason Your Amazon KDP Books Are Not Making Sales

0 View

Oposição da Coreia (2003)

31:33

Oposição da Coreia (2003)

bili_1604276720

0 View

HE LIVES IN A MANSION IN THE PHILIPPINES 🏠

16:16

HE LIVES IN A MANSION IN THE PHILIPPINES 🏠

The Boss live and Travelling the Philippines Vlog

0 View

LEAVING FILIPINO FAMILY IN BOHOL - Back at Philippines Beach Home In Mindanao

19:56

LEAVING FILIPINO FAMILY IN BOHOL - Back at Philippines Beach Home In Mindanao

The Boss live and Travelling the Philippines Vlog

0 View