Publications

I Don't Know: Explicit Modeling of Uncertainty with an [IDK] Token
Knowledge Acquisition through Continued Pretraining is Difficult: A Case Study on r/AskHistorians
Efficient Parallelization Layouts for Large-Scale Distributed Model Training
Art Creation with Multi-Conditional StyleGANs
Generation of Bots Based on Observed Behavior