AI Language Models
karpathy/nn-zero-to-hero: Neural Networks: Zero to Hero
karpathy/makemore: An autoregressive character-level language model for making more things
karpathy/nanoGPT: The simplest, fastest repository for training/finetuning medium-sized GPTs.
(91) The spelled-out intro to neural networks and backpropagation: building micrograd - YouTube
(91) Andrej Karpathy - YouTube
(5) Andrej Karpathy (@karpathy) / Twitter
ChatGPT: Optimizing Language Models for Dialogue
Aligning Language Models to Follow Instructions
Recurrent neural network - Wikipedia
Transformer (machine learning model) - Wikipedia
Model index for researchers - OpenAI API
New GPT-3 Capabilities: Edit & Insert
OpenAI Codex Live Demo - YouTube
OpenAI has solved the XY problem | Hacker News
Building A Virtual Machine inside ChatGPT
ChatGPT Experiments - a Collection by Team CodePen on CodePen
Responding to recruiter emails with GPT-3 | Matt’s programming blog
Using GPT-3 to explain how code works
Competitive programming with AlphaCode
Show HN: If VS Code had a data-centric IDE sibling, what would that look like? | Hacker News
PromptLayer - The first platform built for prompt engineers
Facebook’s five pillars of Responsible AI
DocuChat - It’s time to talk to your documents
Vector Database for Vector Search | Pinecone
Welcome to GPT Index! — GPT Index documentation
Starter Tutorial — GPT Index documentation
A Primer to using GPT Index — GPT Index documentation
Defining LLMs — GPT Index documentation
Cohere | Building the Future of AI | Cohere
EleutherAI - text generation testing UI
bigscience/bloom · Hugging Face
bigscience/bloom-7b1 · Hugging Face
NouamaneTazi/bloomz.cpp: C++ implementation for BLOOM
Petals – Decentralized platform for running 100B+ language models
yandex/YaLM-100B: Pretrained language model with 100B parameters
salesforce/ctrl: Conditional Transformer Language Model for Controllable Generation
GitHub Copilot litigation · Joseph Saveri Law Firm & Matthew Butterick
Google AI updates: Bard and new AI features in Search
re:tune | the missing frontend for GPT-3
Adept: Useful General Intelligence
Simon Willison: “It’s increasingly apparent tha…” - Mastodon
Twitter pranksters derail GPT-3 bot with newly discovered “prompt injection” hack | Ars Technica
Relatedly, my company discovered the same issue and published this paper preprin… | Hacker News
[1908.07125] Universal Adversarial Triggers for Attacking and Analyzing NLP
[2212.03551] Talking About Large Language Models
Refined ChatGPT UI with extra features - ChatKit
Be My Eyes - See the world together
Introducing ChatGPT and Whisper APIs
No, DALL-E doesn’t have a secret language | Hacker News
xenova/transformers.js: Run 🤗 Transformers in your browser!
antimatter15/alpaca.cpp: Locally run an Instruction-Tuned Chat-Style LLM
Replicate – Run open-source machine learning models with a cloud API
Train and run Stanford Alpaca on your own machine - Replicate – Replicate
Fine-tune LLaMA to speak like Homer Simpson - Replicate – Replicate
Don’t trust AI to talk accurately about itself: Bard wasn’t trained on Gmail
Is the AI spell-casting metaphor harmful or helpful?
Simon Willison on promptengineering
Simon Willison on generativeai
An infinite number of monkeys eventually wrote this blog post (Interconnected)
(76) GPT-4 : Napkin Developer - YouTube
TECHNOLOGICAL SINGULARITY by Vernor Vinge
The Unpredictable Abilities Emerging From Large AI Models | Quanta Magazine
ChatGPT DAN 5.0 Jailbreak | Know Your Meme
ChatGPT Is Nothing Like a Human, Says Linguist Emily Bender
These engineers are being hired to get the most out of AI tools without coding | CBC Radio
Prompt Injections are bad, mkay?
Machine Learning: The High Interest Credit Card of Technical Debt – Google Research
Hallucination (artificial intelligence) - Wikiwand
GPT/ChatGPT Experiments - a Collection by Team CodePen on CodePen
prompts/JACK—GPT4-Prompt-Injection at main · abilzerian/prompts · GitHub
GitHub Copilot X: The AI-powered developer experience | The GitHub Blog
fast.ai - fast.ai—Making neural nets uncool again
teelinsan/camoscio: Camoscio: An Italian instruction-tuned LLaMA
22-hours/cabrita: Finetuning InstructLLaMA with portuguese data
tloen/alpaca-lora: Instruct-tune LLaMA on consumer hardware
Prompt Engineering Guide | Prompt Engineering Guide
spindas | Who needs a backend? ChatGPT as the universal Redux reducer
Here’s a cool demo of CRDTs: - Metaphor
mckaywrigley/chatbot-ui: An open source ChatGPT UI.
mckaywrigley/paul-graham-gpt: AI search & chat for all of Paul Graham’s essays.
mckaywrigley/wait-but-why-gpt: AI search & chat for all Wait But Why posts.
Extrapolate - Transform your face with Artificial Intelligence
OpenGPT - Create ChatGpt Application in seconds | OpenGPT
Roboflow: Give your software the power to see objects in images and video
Rerun — Visualize computer vision
hpcaitech/ColossalAI: Making large AI models cheaper, faster and more accessible
plasma-umass/ChatDBG: ChatDBG - AI-assisted debugging. Uses AI to answer ‘why’
A Mathematical Framework for Transformer Circuits
Roots Search Tool - a Hugging Face Space by bigscience-data
Closed AI Models Make Bad Baselines - Hacking semantics
Fixed issue when .env file does not yet exist · nat/openplayground@7e7d804
Prefect | The New Standard in Dataflow Automation - Prefect
context-labs/autodoc: Experimental toolkit for auto-generating codebase documentation using LLMs
https://the-algorithm.onrender.com/
https://the-algorithm-ml.onrender.com/
lm-sys/FastChat: The release repo for “Vicuna: An Open Chatbot Impressing GPT-4”
The Berkeley Artificial Intelligence Research Blog
project-baize/baize-chatbot: Let ChatGPT teach your own chatbot in hours with a single GPU!
THUDM/ChatGLM-6B: ChatGLM-6B:开源双语对话语言模型 | An Open Bilingual Dialogue Language Model
NolanoOrg/smol-gpt: Smol but mighty language model
NolanoOrg/cformers: SoTA Transformers with C-backend for fast inference on your CPU.
NolanoOrg/InstructLLaMa.cpp: Fast inference of Instruct tuned LLaMa on your personal devices.
abetlen/llama-cpp-python: Python bindings for llama.cpp
saharNooby/rwkv.cpp: INT4 and FP16 inference on CPU for RWKV language model
ggerganov/ggml: Tensor library for machine learning
(1) w̸͕͂͂a̷͔̗͐t̴̙͗e̵̬̔̕r̴̰̓̊m̵͙͖̓̽a̵̢̗̓͒r̸̲̽ķ̷͔́͝ (@anthrupad) / Twitter
(4) janus (@repligate) / Twitter
(4) tetraspace 💎 on Twitter: “@lovetheusers https://t.co/wRVhjjSt0Y” / Twitter
(5) Adept (@AdeptAILabs) / Twitter
(1) max drake (⨍) (@max__drake) / Twitter
(1) Joscha Bach (@Plinz) / Twitter
(1) Kerim Safa (@kerimsafa) / Twitter
(1) janus (@repligate) / Twitter
(1) Anthropic (@AnthropicAI) / Twitter
(1) @AJSturrock/AI Leaders / Twitter
ezelikman/parsel: Code for Parsel 🐍 - generate complex programs with language models
Welcome to LlamaIndex 🦙 (GPT Index)! — LlamaIndex documentation
Open Assistant | Open Assistant
ShreyaR/guardrails: Adding guardrails to large language models.
simonw/llm: Access large language models from the command-line
luchris429/purejaxrl: Really Fast End-to-End Jax RL Implementations
ArXiv Chat: Chat with the latest Arxiv papers
(1) Metal (@Metal_io) / Twitter
(1) Brian Roemmele (@BrianRoemmele) / Twitter
(2) Harrison Chase (@hwchase17) / Twitter
jart/sectorlisp: Bootstrapping LISP in a Boot Sector
Simon Willison: “So many highlights in this pap…” - Mastodon
@ReadMultiplex – multiplex-past, present, future technology research + insights ☂️
The future, soon: what I learned from Bing’s AI
A prosthesis for imagination: Using AI to boost your creativity
PsyArXiv Preprints | Analogy as a catalyst for cumulative cultural evolution
acheong08/ChatGPT-Proxy-V4: Cloudflare Bypass for OpenAI based on puid
MemoryGPT - ChatGPT with longterm memory
MemoryGPT is like ChatGPT with long-term memory
Storing and querying for embeddings with Redis – baeke.info
databricks/dolly-v2-12b · Hugging Face
[2304.01373] Pythia: A Suite for Analyzing Large Language Models Across Training and Scaling
github/codespaces-jupyter: Explore machine learning and data science with Codespaces
Prompt injection attack on ChatGPT steals chat data | System Weakness
greshake/llm-security: New ways of breaking app-integrated LLMs
Running Dolly 2.0 on Paperspace | Simon Willison’s TILs
Creating desktop backgrounds using Midjourney | Simon Willison’s TILs
sips: Scriptable image processing system | Simon Willison’s TILs
GPT-4 for API design research | Simon Willison’s TILs
Thoughts on AI safety in this era of increasingly powerful open source LLMs
Running Python micro-benchmarks using the ChatGPT Code Interpreter alpha
The Changelog podcast: LLMs break the internet
We need to tell people ChatGPT will lie to them, not debate linguistics
Replacing my best friends with an LLM trained on 500,000 group chat messages
The AI singularity is here | InfoWorld
Building LLM applications for production
Database “sharding” came from UO? – Raph’s Website
LMQL: Programming Large Language Models
mckaywrigley/prompts: My favorite AI prompts.
xtekky/chatgpt-clone: ChatGPT interface with better UI + running on free gpt api’s
StampyAI/stampy-ui: AI Safety Q&A
200 Concrete Problems In Interpretability Spreadsheet - Google Sheets
Toolkit - Create and discover AI plugins
The best way to build web apps without code | Bubble
teknium1/character-cards: A collection of character cards for use in AI Roleplaying
GPT-3 token encoder and decoder / Simon Willison | Observable
GPT-4 Week 4. The rise of Agents and the beginning of the Simulation era : ChatGPT
[2302.00093] Large Language Models Can Be Easily Distracted by Irrelevant Context
(7) AK on Twitter: “AI music is taking off, here are some companies working in the space” / Twitter
templeofninpo/templeofninpo.github.io
StoicAI v12: stopped trying to be fancy, made it fancier
oneil512/INSIGHT: INSIGHT is an autonomous AI that can do medical research!
Activeloop | Deep Lake | Data Lake for Deep Learning
kroll-software/babyagi4all: BabyAGI to run with GPT4All
Vector Search Database | Qdrant Cloud
CozoDB: embedded Datalog, performant graphs
vespa-engine/vespa: The open big data serving engine. https://vespa.ai
Vespa - the big data serving engine
Welcome | Weaviate - vector database
replit/ReplitLM: Inference code and configs for the ReplitLM model family
replit/replit-code-v1-3b · Hugging Face
mosaicml/examples: Fast and flexible reference benchmarks
BigCode - Open and responsible development of LLMs for code
bigcode-project/Megatron-LM: Ongoing research training transformer models at scale
bigcode/starcoder · Hugging Face
bigcode/ta-prompt · Datasets at Hugging Face
nbardy/SuperPrompt: Mutimodal LLM Lisp
RedPajama-INCITE-3B, an LLM for everyone — TOGETHER
togethercomputer/redpajama.cpp: Extend the original llama.cpp repo to support redpajama model.
CarperAI/stable-vicuna-13b-delta · Hugging Face
Available models in Generative AI Studio | Vertex AI | Google Cloud
Vertex AI – Google Play Android… – Google Cloud console
Dante | Build an AI chatbot trained on your data
Introducing speech-to-text, text-to-speech, and more for 1,100+ languages
keon/awesome-nlp: A curated list of resources dedicated to Natural Language Processing (NLP)
zhengzangw/awesome-huge-models: A collection of AWESOME things about HUGE AI models.
kyaiooiayk/Awesome-LLM-Large-Language-Models-Notes: What can I do with a LLM model?