Large Language Models
SummLlama: Faithful Summarization Model [Link]
Ext2Gen: Robust Generation Model for RAG [Link]
Benchmark Datasets
UniSumEval: Benchmark Data for Text Summarization [Link]
ToFuEval: Hallucination Benchmark Data [Link]
ANIMAL-10N Data: Real-world Data with Noisy Labels [Link]
Algorithms
Retrieval for RAG
Word2Passage: Accurate Retrieval Method [Link]
Fast Autoregressive Decoding [Link]
DisCal: Calibrated Distillation [Link]
Robust Deep Learning
Awesome Noisy Labels [Link]
SELFIE: Robust Training against Noisy Labels [Link]
Prune4Rel: Robust Data Pruning [Link]
MQNet: Open-set Active Learning [Link]
FedRN: Federated Learning with Noisy Labels [Link]
Continual Learning
SDP: Scheduled Data Prior for Continual Learning [Link]