Microsoft open-sources "the earliest DOS source code discovered to date"

upvote

Microsoft open-sources "the earliest DOS source code discovered to date"

(arstechnica.com)

380 points

by DamnInteresting17 hours ago |

upvote

by jmward0115 hours ago|

[-]

It is rare that I say this but, thanks MS! Arguably just as, if not more, important is the BASIC that they wrote. That was what they actually wanted to do. DOS just got them the contract with IBM. For decades MS was really a developer tools company with a side biz of writing operating systems and other misc software. They also open sourced that BASIC code too [1].

[1] https://opensource.microsoft.com/blog/2025/09/03/microsoft-o...

reply

upvote

by ramon15610 hours ago|

[-]

I dont think I've ever seen a commit that says "49 years ago". Damn.

reply

upvote

by RobotToaster2 hours ago|

[-]

Not quite as old, but brl-cad is still in active development and has commits from 1983. https://github.com/BRL-CAD/brlcad/graphs/contributors?all=1

reply

upvote

by pletnes7 hours ago|

[-]

1968 https://www.netlib.org/go/fft.f

reply

upvote

by formerly_proven10 hours ago|

[-]

https://github.com/dspinellis/unix-history-repo

reply

upvote

by 2 hours ago|

[-]

deleted

reply

upvote

by steve197710 hours ago|

[-]

I remember when I realized I had been using Microsoft all along through my Commodore 64.

reply

upvote

by vee-kay10 hours ago|

[-]

What's interesting is that Microsoft BASIC itself was derived from BASIC-PLUS which itself was derived from Dartmouth BASIC (which evolved into a structured programming language called SBASIC (Structured BASIC). But the popularity of Microsoft BASIC, actually halted the standardisation of SBASIC as an ANSI standard.

https://en.wikipedia.org/wiki/Microsoft_BASIC

The Altair BASIC interpreter was developed by Microsoft founders Paul Allen and Bill Gates using a self-written Intel 8080 emulator running on a PDP-10 minicomputer.[1] The MS dialect is patterned on Digital Equipment Corporation's BASIC-PLUS on the PDP-10, which Gates had used in high school.

https://en.wikipedia.org/wiki/Dartmouth_BASIC

Dartmouth BASIC is the original version of the BASIC programming language. It was designed by two professors at Dartmouth College, John G. Kemeny and Thomas E. Kurtz. With the underlying Dartmouth Time-Sharing System (DTSS), it offered an interactive programming environment to all undergraduates as well as the larger university community.

Dartmouth also introduced a dramatically updated version known as Structured BASIC (or SBASIC) in 1975, which added various structured programming concepts. SBASIC formed the basis of the American National Standards Institute (ANSI) "Standard BASIC" efforts in the early 1980s.

In contrast to the Dartmouth compilers, most other BASICs were written as interpreters. This decision allowed them to run in the limited main memory of early microcomputers. Microsoft's Altair BASIC is one example: it was designed to run in only 4 KB of memory (interestingly, it was delivered on paper tape).

Kemeny became involved in an effort to produce an ANSI standard BASIC in an attempt to bring together the many small variations of the language that had developed through the late 1960s and early 1970s. This effort initially focused on a system known as Minimal BASIC that was similar to earliest versions of Dartmouth BASIC, while later work was aimed at a Full BASIC that was essentially SBASIC with various extensions.

But by the late 1980s, tens of millions of home computers were running some variant of the MS BASIC interpreter. It had become the de facto standard for BASIC, which eventually led to the abandonment of the ANSI SBASIC efforts.

Kemeny and Kurtz, however, decided to continue their efforts to introduce the concepts from SBASIC and the ANSI Standard BASIC efforts. This became True BASIC.

https://en.wikipedia.org/wiki/True_BASIC

There are versions of the True BASIC compiler for MS-DOS, Microsoft Windows, and Classic Mac OS. At one time, versions for TRS-80 Color Computer, Amiga and Atari ST computers were offered, as well as a UNIX command-line compiler.

After several years of inactivity, as of February 2026, the TrueBASIC website is officially closed.

reply

upvote

by dboreham2 hours ago|

[-]

Nit: the pdp-10 is generally considered a mainframe not a minicomputer.

reply

upvote

by BobbyTables23 hours ago|

[-]

Ah, the good ol’ “Embrace, Extend, … Extinguish”

reply

upvote

by nananana99 hours ago|

[-]

I cannot describe to you how jealous I am of the fact that back then writing a few thousand lines of assembly was what it took to launch a successful software company.

reply

upvote

by curiousObject8 hours ago|

[-]

>writing a few thousand lines of assembly was what it took to launch a successful software company.

Yes, but that assembly was not DOS, and it wasn’t easy.

Microsoft purchased the DOS code, they didn’t write it. Of course, they did develop and modify DOS. But that was a clever (and lucky) business deal, not a technological accomplishment.

The real beginning of Microsoft was earlier, with Allen, Gates and Davidoff writing the Altair BASIC interpreter. That was a serious achievement.

They had never seen the computer they were writing that assembly code for. They did not even own any computers. It took them 8 weeks on a university computer they were not supposed to be using for that

“Altair agreed to meet them to possibly buy a BASIC interpreter… Gates and Allen had neither a BASIC interpreter nor even an Altair system on which to develop and test one. However, Allen had written an Intel 8008 emulator that ran on a PDP-10 time-sharing computer. Allen adapted this emulator based on the Altair programmer guide, and they developed and tested the interpreter on Harvard's PDP-10.

The finished interpreter, including its own I/O system and line editor, fit in only four kilobytes of memory, leaving plenty of room for the interpreted program. In preparation for the demo, they stored the finished interpreter on a punched tape that the Altair could read, and Paul Allen flew to Albuquerque to meet with Altair…

While on final approach into the Albuquerque airport, Allen realized that they had forgotten to write a bootloader to read the tape into memory. Writing in 8080 machine language, Allen finished the program before the plane landed. Only when they loaded the program onto an Altair and saw a prompt asking for the system's memory size did Gates and Allen know that their interpreter worked on the Altair hardware.”

https://en.wikipedia.org/wiki/Altair_BASIC

reply

upvote

by BobbyTables23 hours ago|

[-]

Imagine if the University had sued for their share of the IP and that was created using their resources…

It’s funny because I thought Jobs/Wozinak got their initial funding from selling phreaking boxes. And more recently, Anthropic engaged in criminal copyright violations with only a slap on the wrist.

Feels like a common theme of every “great” company having its origins from a “boost” resulting from criminal activity. (After all, that’s where the money is!)

Just imagine the criminal penalties possible for pirating and selling one copy of a movie or making one long distance phone call with phreaking.

reply

upvote

by areweai1 hours ago|

[-]

In the case of Microsoft, I'm not seeing it.

Being born into a 1% household and understanding the asymmetric upside that having the money and the time to speculate is far more significant than the civil and criminal legal violations on the way.

The most common way to go from one-percenter rich to .001% rich is to already have enough wealthy people generating capital in your personal network that you can raise capital on sweetheart terms to buy the labor of people who don't.

Then you sell it at a massive premium and repeat.

I think it's empirically dubious to identify the UW mainframes as the secret sauce instead of "being able to ask your mom for a meeting with the chairman of IBM followed by asking her for 80,000 dollars ASAP."

If the original creators of DOS were born into a wealthy family and on a first name basis with the chairman of IBM, do you think they would've sold it to Gates?

Trying to attribute the tech business "founding crime" feels like displacement for what is perfectly legal and accepted cultural practice.

reply

upvote

by dboreham2 hours ago|

[-]

See also Airbnb and Uber.

reply

upvote

by yokoprime8 hours ago|

[-]

To be fair, i think you needed a cutthroat businessman leading the company. Which i guess is more or less the same today

reply

upvote

by themafia31 minutes ago|

[-]

> a cutthroat businessman leading the company

I'm sure his family connections aided him significantly.

reply

upvote

by justsomehnguy6 hours ago|

[-]

This too but early MS to their employs was closer to a hipster SV vibe coding in a coffee shop a decade ago.

reply

upvote

by greenbit8 hours ago|

[-]

And for such simple processors and systems no less! No descriptor tables to deal with, no memory management to configure. These days it takes a little processor inside the main processor, just to get things started. Those were golden times.

reply

upvote

by 8 hours ago|

[-]

deleted

reply

upvote

by embedding-shape8 hours ago|

[-]

Replace Assembly with TypeScript/Rust/Go/whatever and as long as the idea is good and useful, same thing applies today.

reply

upvote

by risyachka7 hours ago|

[-]

Except the competition was essentially non existent and no one would copy your product with llm in a day

reply

upvote

by uluyol7 hours ago|

[-]

What makes you think there was no competition?

reply

upvote

by embedding-shape7 hours ago|

[-]

The "competition" never been just a different codebase, that's one of the smallest pieces you'd have to actually build if you want to build a product people actually want to buy and use. The magic is basically all around it, multiplied by the code, but you really must have every else down pretty tight before the codebase even start mattering. But once it does matter, it matters a lot, hence the difficult balancing.

reply

upvote

by avadodin8 hours ago|

[-]

More than a few people would rather die in poverty than put in the effort today even if you offered to time-machine them back with their finished product.

reply

upvote

by gnabgib17 hours ago|

[-]

Discussion, on the source, at the time (79 points, 24 days ago, 19 comments) https://news.ycombinator.com/item?id=47957494

Or on the GitHub clone (162 points, 15 comments) https://news.ycombinator.com/item?id=47946813

reply

upvote

by locusofself16 hours ago|

[-]

wow, they had to OCR it back in from paper printouts

> This source code is old enough that it hadn’t been stored digitally. “A dedicated team of historians and preservationists led by Yufeng Gao and Rich Cini,” calling itself the “DOS Disassembly Group,” painstakingly transcribed and scanned in code from paper printouts provided by Paterson. This process was made even more difficult because modern OCR software struggled with the quality of the decades-old printout.

reply

upvote

by FarmerPotato15 hours ago|

[-]

I'd like to hear more about what works in OCR of dot-matrix fonts.

I've been able to OCR letter-quality printer output to 97% (mostly Os and Xs problems).

But it seems that machine-learning text-recognition is also now biased to reject computer code because it doesn't look like human language.

reply

upvote

by ndiddy5 hours ago|

[-]

There's a writeup here from one of the people on the team about the work it took to go from the listings to source code. http://cini.classiccmp.org/recoveryblog.htm

> With less-than-satisfactory OCR output, I resorted to a process I used many years ago when converting scans made of old Commodore ROM dumps printed on a Commodore 1515 dot-matrix printer. The process relies on the ASCII OCR output having the same repetitive errors. "B" and "8", "S" and "5" are good examples, as are "l" and "1", and "O" and "0". There are many other similar single-character errors and, when working with x86 code, there are similar errors with instructions like "MOV". This process naturally works better if the output file is monolithic rather than single-page OCR conversions because you can do substitutions across the entire converted printout and not 75 separate files.

> The next formatting hassle was the spacing. This required repetitive substitutions of a descending numbers of spaces to tabs (i.e., replace 8 spaces with a tab, 7, 6, etc.). Then if you want to return it to fixed spaces (which is likely how the original printer printed it -- spaces and not vertical tabs), you can. For pure re-creation work, spaces produce absolute column formatting while tabs can move around depending on the program displaying the file.

> Once you run thought the 15 or so common global substitutions and tab conversion, it's a lot easier to work with the file to fix formatting and perform other cleanup. This is then followed by a line-by-line comparison against the original printouts. Overall I'd say the conversion output quality with this method is very good.

reply

upvote

by FarmerPotato3 hours ago|

[-]

Hmm, doesn't say anything about what OCR tools they used.

I've got a 4" stack of wide-carriage COBOL. I guess it's two revisions of the same system so I only need to scan the newer half. Its probably from a TI Omni 810.

On the other hand, I've got 100 pages of code printed in compressed font by someone wanting to make sure that 80+ char lines fit within margins. So a lot of words just don't come out at all. A frequent error is "A" becomes "H", "O" becomes "U" because the top dots aren't "attached".

And columns of line numbers starting with 0001, or hex? The most confounding thing is OCR that thinks 00 is a sideways 8, and that dominates the uniform block, so it tries to interpret the whole column as sideways text. In another situation, it interprets two stacked lines (each starting with 0) as one line starting with 8 and it just goes off the rails.

So I've been working with automatic skew correction, then clipping it into rows, in order to get each line of text isolated from the surrounding context. When I do that, I get better results, but it is not great either.

I'm considering going all-in on training a new recognizer on snippets. For that, I'll be constructing "The Set of All As" and so on.

reply

upvote

by accrual2 hours ago|

[-]

Pretty interesting. I wonder if a whitelist against certain columns in the output could help, e.g. this column can only contain valid x86 instructions (e.g. MOV is allowed, M0V is not), this column can only contain hexadecimal (1 is allowed but never "l"), etc. Probably more work than it's worth given the final line-by-line comparison that happens anyway.

reply

upvote

by embedding-shape8 hours ago|

[-]

Boring reply perhaps, but I've had wild success with adding even a tiny LLM afterwards to do "fixups" over OCRd text, works great for the typical O/0 issues and similar, just pass it the scrambled OCRd text together with the text around it, and even dumb and tiny 7b models running on CPU do a pretty fine job.

reply

upvote

by bob7788 hours ago|

[-]

ABBYY has a specific module for dot matrix printouts so I’m surprised it was a struggle for them but every document is different

reply

upvote

by WalterBright11 hours ago|

[-]

I've recovered some ancient software I wrote via scanning in listings I found among my dad's papers.

reply

upvote

by SoftTalker16 hours ago|

[-]

Yet another case where text printed on paper outlived any digital storage.

reply

upvote

by jshier16 hours ago|

[-]

Seems like it was never digitally stored in the first place, and the printed text was barely readable due to age. Not really a big win for paper.

reply

upvote

by SoftTalker15 hours ago|

[-]

Well it had to have been on disk or tape at some point. It wasn't all typed in by hand every time they needed to build a new version.

reply

upvote

by debesyla12 hours ago|

[-]

unless they used punch cards

reply

upvote

by Sharlin9 hours ago|

[-]

Punch cards are still a form of digital storage, mind.

reply

upvote

by wongarsu8 hours ago|

[-]

Also a form of storing things on paper

reply

upvote

by accrual2 hours ago|

[-]

Reminds me of an old fortune cookie message or meme, something like "digital data is made from analog parts".

reply

upvote

by WalterBright11 hours ago|

[-]

I threw out all my punch cards. Wish I'd kept at least a listing!

reply

upvote

by genxy53 minutes ago|

[-]

I find punch cards being used in old engineering books I buy from the 60s.

Maybe write them again?

reply

upvote

by andsoitis11 hours ago|

[-]

> unless they used punch cards

For MS-DOS?

reply

upvote

by WalterBright11 hours ago|

[-]

Not likely. Punch cards disappeared around the end of 1976.

reply

upvote

by SoftTalker2 hours ago|

[-]

My firt job out of college in the early 1990s was at an equipment manufacturer who was still using them. They had a big chart on the wall titled "punch-card elimination" and a line trending down, but it wasn't at zero yet.

My work there was all new code and didn't involve any of that, however.

reply

upvote

by greenbit7 hours ago|

[-]

I remember seeing stacks of cards being carried into/out of the university "computing center" in the mid 1980s, on more than a couple of occasions. Though in retrospect, these were probably just old programs that had been in various professors offices since the mid 70s, being taken to get read into some disk in the mainframe.

reply

upvote

by MomsAVoxell8 hours ago|

[-]

We still learned how to use them in the 80’s high school computer classes, mostly because we had a balance of CP/M plus card-reader/early DOS machines, eventually .. in the labs. Rich kid schools had Apples though, and some of them also had card readers for BASIC ..

reply

upvote

by greenbit7 hours ago|

[-]

"[..] card readers for BASIC"

Finally, a sensible use case for BASIC's "READ" and "DATA" commands. Learning BASIC as a kid on a micro, it always struck me as an odd way to get input into a program. Sure, with INPUT, you'd have to hand enter your input every time, but baking into the program meant that you'd have to edit your program any time you wanted to change anything.

But with a card reader, you could "cut the deck". Keep the program cards, and then just stack on whatever set of data cards you wanted.

From this vantage point, in the 21st century with our flying cars and what not, it seems really quirky that back then, even your data could be a tangible thing.

reply

upvote

by MomsAVoxell7 hours ago|

[-]

Indeed, we still pay homage to the era with terms such as the stack, pushing and popping, and all kinds of things .. i remember we had fun inserting random infinite loops in other students cards on occasion until we all realized we could just have marked “finished” stacks with an X across the spine, and also to ease sorting, and so on .. i would mark certain sub-routines with different color markers on the spine too, just to see a budget for how much computing time i expected to be billed for, and so on and on .. lots of valuable hands on came from the card-based computing, its a lost art ..

reply

upvote

by fortran775 hours ago|

[-]

My college used them for PL/I and IBM Assembly language programming classes until 1982. Cards were used well into the mid-80s.

reply

upvote

by Anonyneko8 hours ago|

[-]

We still used them in the university as late as in 2010...

...as writing paper.

reply

upvote

by zargon15 hours ago|

[-]

The idea that it never existed digitally is obviously untrue. Likely poor wording in the author's part. They probably meant something like, so old that a printout is all that survived (which sounds vaguely like not being digital to someone in an era so far removed from a time when programs were/could realistically be printed.)

reply

upvote

by WalterBright11 hours ago|

[-]

Having printouts were necessary when:

1. you were using a DECwriter dot matrix printer as a terminal

2. using an ASR-33 teletype as a terminal

3. using punch cards or paper tape

4. using a glass tty that could only display 24 lines

5. when you did not have a remote terminal, and wanted to spread your code out on a table and debug it

reply

upvote

by tankenmate10 hours ago|

[-]

Brings back memories of desk checking

reply

upvote

by fc417fc80213 hours ago|

[-]

> a time when programs were/could realistically be printed

Really depends on the program. Source code is often quite manageable. Even artifacts aren't always as large as you might expect. Busybox on my system weighs in at 1.9 MiB or alternatively 928 KiB with zstd maxed out.

But I don't really see a point to printing any of it. A situation that might require the printouts is likely to largely preclude the continued existence of modern electronics, the ability to replace batteries, or even a connection to a reliable electrical grid.

reply

upvote

by zargon13 hours ago|

[-]

Yeah, that's why I tried to include both categories. Even for programs that are small enough to be printed, we just don't do it any more. I could have worded that part better myself.

reply

upvote

by onion2k11 hours ago|

[-]

Early versions of some things, MS Basic being one example I think, were baked into ROM. One of the best innovations that Paul Allen came up with was adding software hooks to the code so bugs that were found later could still be patched.

reply

upvote

by irishcoffee13 hours ago|

[-]

How did they print it then, I wonder?

reply

upvote

by bryanrasmussen12 hours ago|

[-]

They had some old German guy with a big beard, and two interns, running some sort of big contraption that looked like a medieval torture instrument, and the interns would run and put letters in a row and then the old guy move a massive letter and in the end out came a bit of paper with source code on it.

reply

upvote

by eipi10_hn9 hours ago|

[-]

Where can I buy this printer?

reply

upvote

by wheybags8 hours ago|

[-]

Humbrechthof, Mainz, Germany ofc.

(https://en.wikipedia.org/wiki/Humbrechthof)

reply

upvote

by 7bit9 hours ago|

[-]

One has to be pretty ignorant and dismissive to claim that this is not "a big win for paper".

First of all, that comment is weirdly out of place. The quality and longevity of paper is not the topic.

Secondly, there are fragments of paper with writing as old as 2,000 years.

Thirdly, paper you look at and see the writing. With digital documents, you need the technology to read the medium and then you need to know how the information was encoded onto the medium, before you even arrive at the same level with paper, where you can start to decide the actual writing.

Paper has brought us where we are today, and given us what we know about the past. Don't be so ignorant and dismissive.

reply

upvote

by petcat15 hours ago|

[-]

> struggled with the quality of the decades-old printout.

barely

It sounds like this printout has deteriorated badly and was barely readable.

reply

upvote

by Sharlin9 hours ago|

[-]

If it was your standard issue cheap dot-matrix printout, it may not been particularly legible even back then.

reply

upvote

by justsomehnguy5 hours ago|

[-]

Even if the printer itself was fine it doesn't imply the ribbon was wet enough.

reply

upvote

by 9 hours ago|

[-]

deleted

reply

upvote

by acomjean5 hours ago|

[-]

Interesting story of how MS got into the operating system business. IBM wanted the CPM operating system, but Digital Research wouldn’t sign ibms NDA… really a pivot point in computing history.

From “Triumph of the Nerds” tv transcript:

https://www.pbs.org/nerds/part2.html

Jack Sams (IBM) was looking for a package from Microsoft containing both the BASIC computer language and an Operating System. But IBM hadn't done their homework.

Steve Ballmer: They thought we had an operating system. Because we had this Soft Card product that had CPM on it, they thought we could licence them CPM for this new personal computer they told us they wanted to do, and we said well, no, we're not in that business.

Jack Sams (IBM); When we discovered we didn't have - he didn't have the rights to do that and that it was not...he said but I think it's ready, I think that Gary's got it ready to go. So I said well, there's no time like the present, call up Gary.

Steve Ballmer: And so Bill right there with them in the room called Gary Kildall at Digital Research and said Gary, I'm sending some guys down…. Treat them right, they're important guys.

reply

upvote

by bragr4 hours ago|

[-]

Eh, basically all facts in this story are disputed by all sides. Aside from general gist that there was some meeting that didn't go well.

reply

upvote

by chuckadams4 hours ago|

[-]

Whether Kildall actually blew IBM off at that meeting or not, what was definitely the case was that CP/M didn't have a 16-bit version ready to meet IBM's schedule, and that's what ultimately took them out of the running.

reply

upvote

by danborn2648 minutes ago|

[-]

Looking through the source is a great reminder of how constrained early computing was. It's amazing how much of this architecture still influences modern systems.

reply

upvote

by userbinator17 hours ago|

[-]

I wonder how long it'll be before they release the source for the earliest Windows versions. The fact that they still have the source for this very old DOS at least gives hope that they also do for old Windows.

reply

upvote

by GaryBluto14 hours ago|

[-]

The day they would make Windows 2000 codebase open source (or source available) would be the day I could die happy (although I'd probably be long dead anyways by the time there's a glimmerof chance of it happening). What a beautiful, smooth-running operating system it was.

reply

upvote

by ndiddy5 hours ago|

[-]

They will never release the code for anything that new because at that point, there's tons of licensed third-party code and the codebase is so large that going through everything to verify ownership would not be feasible. The code to NT 4 and XP have been leaked though.

reply

upvote

by optymizer14 hours ago|

[-]

Agreed. It's still my favorite Windows version.

reply

upvote

by greenbit7 hours ago|

[-]

Except for "the hive". Remember the hive? Sort of an alternate registry, in addition to the actual registry. Granted, it was pretty invisible, until it got corrupted.

I had a win2k machine that was my daily (at home) that was fine until idk about 2006, at which point something happened (muons?) and it would go into some kind of panic state just after bringing up the desktop. Hive corruption. I tried on and off for a couple of years to repair it, no luck. It wasn't just about the files on the HD, it was easy enough to transplant the drive and read/write anything, it was that I really liked the way I had the environment configured. Sure, it was all kind of moot, but it became a kind of personal windmill to resurrect this old thing. In the end, I booted an XP CD in it, and selected 'upgrade', and voila, it was Duncan Idaho, back from the dead.

Anyway.. loved win2k, but not a fan of the hive.

reply

upvote

by chuckadams4 hours ago|

[-]

The registry is a collection of individual database files known as hives.

reply

upvote

by justsomehnguy5 hours ago|

[-]

I think you are mistaking the registry for the registry.

https://devblogs.microsoft.com/oldnewthing/20030808-00/?p=42...

reply

upvote

by NitpickLawyer12 hours ago|

[-]

Wasn't there a 2000 source leak a while ago? I remember some exploits coming out after the leak.

reply

upvote

by toyg9 hours ago|

[-]

Yes but it could not be legally used by anything.

reply

upvote

by avadodin8 hours ago|

[-]

OP said source available was acceptable. not even asking for compiler access which is also widely available.

Windows has always been more than modular enough for any repurposing and there were licenses that were not tied to specific hardware so you could use them even today.

Which is to say no one is stopping you from building a COPILOT.VBX for VisualBasic 3.0.

reply

upvote

by londons_explore14 hours ago|

[-]

There is a mostly complete leak of it...

reply

upvote

by WalterBright10 hours ago|

[-]

It shouldn't be hard to disassemble it.

reply

upvote

by protocolture13 hours ago|

[-]

I imagine its not far off. I get the impression they are almost done with windows as a platform.

reply

upvote

by teamsolid16 hours ago|

[-]

I am sure that there is a lot good material to take inspiration and learning even from the early Windows 3.11.

reply

upvote

by mycall16 hours ago|

[-]

Do a deep dive into how OS/360 formalized to having DOS.

reply

upvote

by SoftTalker16 hours ago|

[-]

/s ?

reply

upvote

by AlecSchueler13 hours ago|

[-]

Pretty sure it's a bot or simple karma farming operation.

reply

upvote

by throwaway2744815 hours ago|

[-]

They waited a couple decades too long for this to be of interest.

reply

upvote

by dang17 hours ago|

[-]

Recent and related:

Microsoft open sources DOS 1.00 on 45th anniversary - https://news.ycombinator.com/item?id=47957494 - April 2026 (19 comments)

reply

upvote

by jug8 hours ago|

[-]

While oldest source of it, note that the 86-DOS v0.1-C binaries are even earlier (and v0.34 has also been found) than this v1.00 source and can be downloaded and used in an emulator. :-)

https://arstechnica.com/gadgets/2024/01/the-oldest-known-ver...

reply

upvote

by teamsolid16 hours ago|

[-]

It is wonderful how early years of modern computing was brilliant. We treated machines as they really are: machines. Performance, creativity, science..., all possible to make a 386 machine work. Nowadays is all about libraries, virtualization, [bad] code over [bad] code over [bad] code..., I dont like it.

reply

upvote

by dhosek15 hours ago|

[-]

I sometimes think that my mental model of a computer is still an Apple ][+ with 48K of RAM leads to my writing better code.

reply

upvote

by WalterBright10 hours ago|

[-]

While I did a few 10 line programs in BASIC in high school on punch cards, when things really started was a freshman class on semiconductors. The class started with diodes and quantum mechanics, then onto transistors, then flip flops, then registers, then ALUs. Then it was on to designing/building a digital clock (which never worked right), and later designing/building/programming single board computers (6802 chip).

It was fun knowing everything about a computer. That's long gone!

reply

upvote

by stevesimmons12 hours ago|

[-]

And mine is a Commodore Vic-20 circa 1981, with 3583 bytes of free RAM. Programmed in 6502 assembler. Can't get much closer to the CPU than that.

reply

upvote

by aenis12 hours ago|

[-]

For a very long while now, we had programmers who never understood any low level concepts at all. They have started with js or python, and never looked 'down'. There are no limits to monstrosities they will consider normal.

Linus Torvalds, a few months ago, said something to this effect when discussing AI coding tools. That his (also, mine) generation was lucky to have started with low level stuff and managed to retain the understanding of the whole stack - and kids these days don't get that. Good luck acquiring this level of feel for computers, algorithms, data structures today, when a kid's first experience with coding will be a seemingly genius chatbot.

reply

upvote

by charcircuit9 hours ago|

[-]

>and managed to retain the understanding of the whole stack

No one understands the whole stack. There is too much specialized information.

reply

upvote

by Sharlin9 hours ago|

[-]

Even assembly is a high-level language relative to what’s actually going on inside a modern CPU.

reply

upvote

by goodpoint4 hours ago|

[-]

DOS and brilliant in the same sentence...

reply

upvote

by 9dev4 hours ago|

[-]

At some point, we'll probably have a new field in history for digital archeology, and I'm really envious for those future historians! They'll be getting to sleuth around old datasets, trying to reconstruct the history of computing, understand long-forgotten file formats to preserve data, use statistical methods to analyse binary backups, and trace for specific documentation versions to crack old encryption formats...

reply

upvote

by EvanAnderson1 hours ago|

[-]

The term "programmer-archaeologist" was coined by the author Vernor Vinge in his 1999 "A Deepness in the Sky"[0] (a pretty great read and definitely recommended) and the field is arguably a real thing now[1].

[0] https://en.wikipedia.org/wiki/A_Deepness_in_the_Sky

[1] https://en.wikipedia.org/wiki/Software_archaeology

reply

upvote

by giobox2 hours ago|

[-]

This field already is alive and well in the gaming community. Games companies are notorious for not spending money on keeping their old code around, which is why it's been at the forefront of digital archaeology efforts a lot of the time to preserve the industry's history.

I'd also throw the wayback when machine and the internet archive into this bucket.

reply

upvote

by dang3 hours ago|

[-]

Related ongoing thread:

Microsoft's 6502 BASIC is now Open Source (2025) - https://news.ycombinator.com/item?id=48257058

reply

upvote

by lesser-shadow54 minutes ago|

[-]

[dead]

reply

upvote

by danborn268 hours ago|

[-]

Fascinating piece of computing history. Preserving early DOS source code gives a lot of context to the structural choices that stuck around in x86 architecture for decades.

reply

upvote

by imoverclocked16 hours ago|

[-]

Time to find vulnerabilities!

I remember in the naughts, coming across a dos machine that was quite out of time… even for the university basement it was living in next to a pile of lead brick. Its only job was to run an instrument via an home-built ISA card and write data out to 5.25” floppies.

What uses would this code have in 2026?

reply

upvote

by yjftsjthsd-h12 hours ago|

[-]

It's a single user OS that runs everything in ring zero by design. I'm not sure, definitionally, that it can have security vulnerabilities. I... guess maybe code execution on exposure to an untrusted floppy disk filesystem?

reply

upvote

by greenbit7 hours ago|

[-]

Look closely, you'll notice there's no network interface. The only vulnerability in a system like that is physical access by malicious individuals.

About the worst mal-ware it can have is a boot sector that installs a "terminate, stay resident" (TSR) that copies itself onto any floppy that gets inserted.

reply

upvote

by FarmerPotato15 hours ago|

[-]

To see what decisions they made. Like any historical document. Aim to understand the people of the time.

reply

upvote

by gxd2 hours ago|

[-]

THANK YOU!

Can we now have all the Infocom games owned by Activision (which is yours) now? Pretty please? I know the source is available, but we'd like them with a MIT license (including the manuals, artwork etc).

PS: a couple of them could be harder, like Shogun, but it's okay to skip these.

reply

upvote

by rvnx6 hours ago|

[-]

I’m sure this is better software than Windows Millenium Edition

reply

upvote

by okandship10 hours ago|

[-]

readable plain text plus boring metadata still ages better than most clever archival systems

reply

upvote

by xandrius8 hours ago|

[-]

In this case a paper printout.

reply

upvote

by 11 hours ago|

[-]

deleted

reply

upvote

by hackerqwe9 hours ago|

[-]

More code that copilot can be trained on.

reply

upvote

by gnarlouse11 hours ago|

[-]

How about Microsoft fixes npm, github, and vscode

reply

upvote

by 15 hours ago|

[-]

deleted

reply

upvote

by 15 hours ago|

[-]

deleted

reply

upvote

by 11 hours ago|

[-]

deleted

reply

upvote

by froyooh16 hours ago|

[-]

Back when it was all written by hand and optimized well.

reply

upvote

by xuzhenpeng15 hours ago|

[-]

[flagged]

reply

upvote

by patrickndaye9198 hours ago|

[-]

[dead]

reply

upvote

by embirdating8 hours ago|

[-]

[dead]

reply

upvote

by Tanayk0713 hours ago|

[-]

[flagged]

reply

upvote

by dooosss15 hours ago|

[-]

Too little, too late.

reply

upvote

by signa1116 hours ago|

[-]

in the words of mr. mitch-hedburg “here, you throw this away“

reply

upvote

by TedDoesntTalk14 hours ago|

[-]

He could have sold those printouts instead of giving them away.

reply

upvote

by theanonymousone10 hours ago|

[-]

I'm wondering whether ReactOS can exploit Claude et. al. to their fullest and "recreate" Windows 2000/95. I may donate some tokens for that cause.

reply

upvote

by leobuskin10 hours ago|

[-]

I've used Claude to fix/reconstruct & build leaked Win2k3 on Linux with original toolchain via Wine. This approach included full gdi sources reconstruction. I just don't know what to do with this, it's kinda difficult to "wash" on this scale

reply

upvote

by CursedSilicon10 hours ago|

[-]

That sounds like a terrifying legal minefield that they would not want to tread

reply

upvote

by theanonymousone10 hours ago|

[-]

Is it not safe to assume Window source code is not present in the LLM training data?

reply

upvote

by stavros10 hours ago|

[-]

No: https://archive.org/download/windows-source-code

reply

upvote

by xandrius8 hours ago|

[-]

Slap a fair use on it and call it a day.

reply

upvote

by rvnx6 hours ago|

[-]

> Anthropic offers a formal copyright indemnification policy for its enterprise customers using the Claude API. The policy protects businesses from copyright infringement claims arising from authorized use of Claude or its generated outputs

So just claim it is Claude

reply

upvote

by greenbit7 hours ago|

[-]

What's that phrase, "derivative work" or something?

reply

upvote

by leni5369 hours ago|

[-]

But surely anything the LLM outputs is clear of licensing requirements /s

Or would Microsoft like to argue otherwise in court?

reply