Massive pile of books in a warehouse with a computer

Part 5

What Are LLMs? Guide – [Bibliography]

Resources are referenced when appropriate throughout the Beginner’s Guide to LLMs. However, there isn’t always a logical place to insert each piece of research that’s been influential in the creation of this guide. 

—The LLM Guide stands on the shoulders of Giants who’ve been instrumental in my education on machine learning, transformers, LLMs, AI, etc.

Highly recommend exploring the brilliant work referenced below. 

 

Bibliography

Bender, E.M. et al. (2021) “On the dangers of stochastic parrots,” Proceedings of the 2021 ACM Conference on Fairness, Accountability, and Transparency [Preprint]. Available at: https://doi.org/10.1145/3442188.3445922.

Brown, T.B. et al. (2020) Language models are few-shot learners, arXiv.org. Available at: https://arxiv.org/abs/2005.14165 (Accessed: March 25, 2023).

Bubeck, S. et al. (2023) Sparks of artificial general intelligence: Early experiments with GPT-4, arXiv.org. Available at: https://arxiv.org/abs/2303.12712 (Accessed: March 25, 2023).

Dynabench: Rethinking benchmarking in NLP – ACL Anthology (no date). Available at: https://aclanthology.org/2021.naacl-main.324.pdf (Accessed: March 25, 2023).

Face recognition technology (FERET) (2017) NIST. Available at: https://www.nist.gov/programs-projects/face-recognition-technology-feret (Accessed: March 25, 2023).

Leffer, L. (2022) Dall-e Mini is obsessed with women in saris, and no one knows why, Gizmodo. Gizmodo. Available at: https://gizmodo.com/dall-e-mini-women-in-saris-mystery-1849099921 (Accessed: March 25, 2023).

Maatta, T. (2022) Natural language processing (NLP), Medium. Medium. Available at: https://tmmtt.medium.com/natural-language-processing-nlp-dc2c1d8d4110 (Accessed: March 25, 2023).

Mitchell, M. (2023) Did chatgpt really pass graduate-level exams?, Did ChatGPT Really Pass Graduate-Level Exams? AI: A Guide for Thinking Humans. Available at: https://aiguide.substack.com/p/did-chatgpt-really-pass-graduate (Accessed: March 25, 2023).

Bender, E.M. and Hanna, A. (2023) Mystery Ai Hype Theater 3000, videos.trom.tf. Available at: https://videos.trom.tf/w/p/4gykGcMrmHHs7bG2Y6qK9W?playlistPosition=1&resume=true (Accessed: March 25, 2023). 

Quach, K. (2020) Ai me to the moon… carbon footprint for ‘training GPT-3’ same as driving to our natural satellite and back, The Register® – Biting the hand that feeds IT. The Register. Available at: https://www.theregister.com/2020/11/04/gpt3_carbon_footprint_estimate/ (Accessed: March 25, 2023).

Raji, I.D. et al. (no date) AI and the everything in the whole wide world benchmark – neurips. Available at: https://datasets-benchmarks-proceedings.neurips.cc/paper/2021/file/084b6fbb10729ed4da8c3d3f5a3ae7c9-Paper-round2.pdf (Accessed: March 25, 2023).

Schuchmann, S. (2020) History of the first AI Winter, Medium. Towards Data Science. Available at: https://towardsdatascience.com/history-of-the-first-ai-winter-6f8c2186f80b (Accessed: March 25, 2023).

Stochastic parrots reading / viewing list (no date) Google Docs. Google. Available at: https://docs.google.com/document/d/1bG0yIdawiUvwh7m0AnXV5W6JHkK9xwXemuVjSU5tbhQ/preview (Accessed: March 25, 2023).

Strubell, E., Ganesh, A. and McCallum, A. (2019) Energy and policy considerations for Deep Learning in NLP, arXiv.org. Available at: https://arxiv.org/abs/1906.02243 (Accessed: March 25, 2023). 

Deep learning with python (2017). New York: Manning Publications.

FOSTER, D.A.V.I.D. (2023) Generative deep learning: Teaching machines to paint, write, compose, and play. S.l.: O’REILLY MEDIA.

Huang, S. et al. (2023) Language is not all you need: Aligning perception with language models, arXiv.org. Available at: https://arxiv.org/abs/2302.14045 (Accessed: March 25, 2023).

Tunstall, L., Werra, L.von and Wolf, T. (2022) Natural language processing with transformers: Building language applications with hugging face. Sebastopol, CA: O’Reilly.

What is CHATGPT, how does it work, and how is it impacting academia … (no date). Available at: https://www.wwu.edu/node/27967 (Accessed: March 25, 2023).

Wolfram, S. (2023) What is chatgpt doing … and why does it work?, Stephen Wolfram Writings RSS. Available at: https://writings.stephenwolfram.com/2023/02/what-is-chatgpt-doing-and-why-does-it-work/ (Accessed: March 25, 2023).

Wu, C. et al. (2023) Visual chatgpt: Talking, drawing and editing with Visual Foundation models, arXiv.org. Available at: https://arxiv.org/abs/2303.04671 (Accessed: March 25, 2023). 

Wang, A. et al. (2020) Superglue: A stickier benchmark for general-purpose language understanding systems, arXiv.org. Available at: https://arxiv.org/abs/1905.00537 (Accessed: March 25, 2023). 

Weil, E. (2023) You are not a parrot, Intelligencer. Available at: https://nymag.com/intelligencer/article/ai-artificial-intelligence-chatbots-emily-m-bender.html (Accessed: March 25, 2023). 

Carino, Meghan McCarty, and Emily Bender. “Do We Have an AI Hype Problem?” Marketplace Tech Podcast, 3 Apr. 2023, https://www.marketplace.org/shows/marketplace-tech/do-we-have-an-ai-hype-problem.

Conover, Adam, and Emily Bender. “The Real Problem with A.I. with Emily Bender.” Earwolf – Factually Podcast, https://www.earwolf.com/episode/the-real-problem-with-a-i-with-emily-bender/.

Dickson, Ben Dickson. “Abductive Inference: The Blind Spot of Artificial Intelligence.” TechTalks, 20 Sept. 2021, https://bdtechtalks.com/2021/09/20/myth-of-artificial-intelligence-erik-larson/.

Dickson, Ben. “Why Exams Intended for Humans Might Not Be Good Benchmarks for Llms like GPT-4.” VentureBeat, VentureBeat, 29 Mar. 2023, https://venturebeat.com/ai/why-exams-intended-for-humans-might-not-be-good-benchmarks-for-gpt-4/.

Gebru, Timnit, et al. “Datasheets for Datasets.” ArXiv.org, 1 Dec. 2021, https://arxiv.org/abs/1803.09010.

Metz, Rachel. “CHATGPT, Bing and Bard Don’t Hallucinate. They Fabricate.” Bloomberg.com, Bloomberg, 3 Apr. 2023, https://www.bloomberg.com/news/newsletters/2023-04-03/chatgpt-bing-and-bard-don-t-hallucinate-they-fabricate.

Mitchell, Margaret, et al. “Model Cards for Model Reporting.” ArXiv.org, 14 Jan. 2019, https://arxiv.org/abs/1810.03993.

—Will work to keep this list up to date as the Guide goes through updates and add-ons.

Level Up Your Data Science Skills

Learn

Get in nerd, we’re going learning!

Achieve

Gain confidence by building stuff!

Apply Your Skills

Go forth & apply your skills in meaningful ways.

Join today!