Twitter
Simon Willison’s Weblog
Predictable updating about AI risk - Joe Carlsmith
Strong Evidence is Common
On the Universal Distribution - Joe Carlsmith
Toolbox-thinking and Law-thinking - LessWrong
Playing the training game
How might we align transformative AI if it’s developed very soon? - LessWrong
(9) Eliezer Yudkowsky on Twitter: “Another way of breaking loose of ‘arguments’: Any time somebody manages to persuade you of something via much hard work, do not neglect to remember that you would, if you had been smarter, probably have been persuadable by the empty string.” / Twitter
[1906.01820] Risks from Learned Optimization in Advanced Machine Learning Systems
[2210.01790] Goal Misgeneralization: Why Correct Specifications Aren’t Enough For Correct Goals
AI Safety Seems Hard to Measure - EA Forum
This GPT-4 answer speaks for itself - Marginal REVOLUTION
Kokotajlo Review of JC Alignment Report - Google Docs
Is Power-Seeking AI an Existential Risk? - 2206.13353.pdf
AI Could Defeat All Of Us Combined
AI Wins IMO Gold Medal | Metaculus
Yes Requires the Possibility of No - LessWrong
Exaggerating the risks (Part 6: Introducing the Carlsmith report) - Reflective altruism
Mistakes with Conservation of Expected Evidence - LessWrong
Rational predictions often update predictably* - EA Forum
Conservation of Expected Evidence - LessWrong
Rational predictions often update predictably* - EA Forum
Mantic Monday 3/14/22 - by Scott Alexander
Believing in things you cannot see - Joe Carlsmith
Thoughts on being mortal - Joe Carlsmith
Noticing Confusion - Sequence
Simulation arguments - simulation_arguments_revised.pdf
List of Cognitive Biases and Heuristics - The Decision Lab
The Missing Moods - Econlib
Mental Health and the Alignment Problem: A Compilation of Resources (updated April 2023) - LessWrong
This Can’t Go On
Tom Murphy 7’s Invincible Web Page
Tom 7 Radar
Dynamic import is not enabled in this context (deno-deploy) · Issue #40 · pluvial/svelte-adapter-deno
Building a Signal Analyzer with Modern Web Tech - Casey Primozic’s Homepage
(10) John A De Goes on Twitter: “Boss: Why should I let you choose programming language>? Go: So the developers you hire when I rage-quit over lack of proper generics and result enums can instantly take over and maintain my code. Haskell: So we can intimidate our intellectually weak competitors who dared…” / Twitter
Geoff Rich on Twitter: “Stumbled on some very strange (AI-generated?) tweets about SvelteKit. So weird, there’s tons of these if you search for “SvelteKit” and sort by latest. https://t.co/4MCIGer93z” / Twitter
(10) Cameron R. Wolfe on Twitter: “We all know that LLMs tend to make errors, whether it be simple mistakes (e.g., improper arithmetic), hallucinations, or something else. But, studying the statistics of mistakes that LLMs make shows us something that we might not intuitively expect. Background: One way to study… https://t.co/MaqYYqhrWA” / Twitter
(10) Talia Ringer on Twitter: “New preprint just dropped! “Can Transformers Learn to Solve Problems Recursively?” With @dylanszzhang, @CurtTigges, @BlancheMinerva, @mraginsky, and @TaliaRinger. https://t.co/D13mD2Q7aq https://t.co/wqM2FPQEQ4” / Twitter
Aran Komatsuzaki on Twitter: “SPRING: GPT-4 Out-performs RL Algorithms by Studying Papers and Reasoning Outperforms all SotA RL baselines, trained for 1M steps, without any training. https://t.co/SJbnVHaAdf https://t.co/9s8p2sBe1d” / Twitter
Kelly Vaughn on Twitter: “My friends. Think before you use Chat GPT during the interview process. https://t.co/7yMp8CGyBu” / Twitter
(10) gfodor.id on Twitter: “My intuition as a software engineer and shoggoth whisperer is that the flailing AutoGPT situation is misleading. Some schizo is going to find a niche where something meaningful breaks through for some bizarre and inexplicable reason and then the Carmacks of the world flood in.” / Twitter
tiiuae/falcon-40b · Hugging Face
Open LLM Leaderboard - a Hugging Face Space by HuggingFaceH4
Chatbot Arena Leaderboard Updates (Week 4) | LMSYS Org
artidoro/qlora: QLoRA: Efficient Finetuning of Quantized LLMs
Hugging Face – Blog
Making LLMs even more accessible with bitsandbytes, 4-bit quantization and QLoRA
Guanaco Playground Tgi - a Hugging Face Space by uwnlp
Stability-AI/StableStudio: Community interface for generative AI
Reimagine XL
Async/await and promises | DeviceScript
microsoft/devicescript: TypeScript for Tiny IoT Devices
Diagram · Design tools from the future.
naviserver / naviserver — Bitbucket
aolserver/aolserver: AOLserver is America Online’s Open-Source web server. AOLserver is the backbone of the largest and busiest production environments in the world. AOLserver is a multithreaded, Tcl-enabled web server used for large scale, dynamic web sites.
(10) AK on Twitter: “Open AI releases paper + dataset Let’s Verify Step by Step trained a model to achieve a new state-of-the-art in mathematical problem solving by rewarding each correct step of reasoning (“process supervision”) instead of simply rewarding the correct final answer (“outcome… https://t.co/M6PwkqBLL9” / Twitter
(10) Roberto Nickson on Twitter: “Adam Mosseri (Instagram CEO) just explained exactly how the Instagram algorithm works, and how they rank content in stories, feed, reels & explore. The last slides are most important. It’s how I’ve added 500K followers to my accounts in 4 months. Here’s what you need to know:” / Twitter
Matt Shumer on Twitter: “Is anyone else noticing significantly downgraded GPT-4 capabilities today? Seems like OpenAI updated the model, and results aren’t as good as before.” / Twitter
Ask HN: Is it just me or GPT-4’s quality has significantly deteriorated lately? | Hacker News
(10) Ate-a-Pi on Twitter: “This OpenAI paper might as well have been titled “Moving Away From PaperClip Maxxing” So good - they took a base GPT-4, fine tuned it a bit on math so that it understood the language as well as the output format - then no RL. Instead they trained and compared two reward… https://t.co/3Tdi2kep98” / Twitter
(10) Carter Anderson on Twitter: “I use Rust because there is literally nothing else on the market like it. The character of other “similar” languages (Zig, Go, C++, Jai, etc) is completely different. The developer community is the best I have ever been in, despite the hiccups here and there.” / Twitter
(10) bowser on Twitter: “i wonder how many people know that camera sensor noise is unique so if you post pictures from the same camera on main and alt they can be linked” / Twitter
Tau: Accurate AI Software Generation with Guaranteed AI Safety