Hacker Newsnew | past | comments | ask | show | jobs | submit | robbrown451's commentslogin

I was built with itself, and is essentially optimized for apps like itself.

It started off slow and as the system got better, it sped up its own development, basically exponentially.

Sometimes it got a bit weird, where I would be improving the protocol the LLM uses to save edits, but it would assume the changes that it was actively sending were already in place.


" There's no reason to think a super intelligence would be totally fine being a slave to apes."

Sure there is. Intelligence doesn't give us our selfish motivations, natural selection does. We have similar motivations to C elegans, that has all of 302 neurons. Stay alive and have sex.

Honeybees don't though. They are about halfway between humans and C elegans when it comes to cognitive power. But they are not selfish because they don't reproduce directly (I'm talking about the worker bees). So they will sting even though it kills them. All their behavior is consistant with this.


Kinda lame that people are downvoting this.

I've had the same perspective for quite a while now, but hadn't been able to phrase it this cleverly.

Our neocortex is, by any definition, vastly more "intelligent" than the rest of our brain. Yet it doesn't attack the cerebellum. In fact, it takes orders from the older "lizard brain"!


Heh, yeah that's a clever analogy as well. (and thanks!)

Do code harnesses that build themselves count as recursive self improvement, or does it need to be the AI itself to qualify for the term?

I always was fascinated (obsessed?) by robots that build robots, or even things like this that can contribute a lot to making the next version of itself: https://buildyourcnc.com/products/cnc-machine-blacktoe-v4-2x... (cnc router that cuts plywood, and is made out of cnc-router cut plywood)

This is my own effort at an AI assisted coding environment optimized for building itself: https://recursi.dev/ (just launching it, hope its ok to mention it, it is free/open source.... here is the HN link that has gotten no love yet: https://news.ycombinator.com/item?id=48401022 )

Personally I think harnesses are as important as the AI itself, and have this crazytheory that even if the models stopped improving today we could still have massive advances in the harnesses alone.


I think harnesses would count, AI != LLMs. Any piece of code that helps the computer reason for itself is AI, the harnesses are AI in a sense.

By that interpretation, neither the harness nor the LLM is the AI. The computer (or system of computers) taken as a whole is the AI. You can't remove any piece and still have an intelligent system.

Does this extend to power generation then, too?

Sure, why not. When you plug a device into the power network, it becomes one big system.

People are specifically talking about the engine itself and not the tools used.

We wouldn't call humans creating a calculator "recursive self improvement".


I wouldn't call the harness an AI, but I might call a tool that plays a major role in creating another one like it "recursive self improvement." For instance in the industrial revolution a metal lathe and a milling machine were instrumental in creating the next generation of themselves. Same thing with a robot that is fabricated by similar (i.e. older model of the same) robots. All of them lead to exponential improvement.


If you want to get out ahead of what's coming, it'll be small models that bootstrap the harness rather than anything else.

I used to think that, but ended up going the other direction, partly because I don't have the wherewithall to build a model but then I realized, with existing models that can take more than a tiny amount of context, you can just let any model bootstrap itself with a good prompt sent by the system.

There's a ton of other tricks to it, but mostly keeping the protocol simple for the AI so it can concentrate on coding logic and not stuff like managing BS boilerplate, dependencies, etc. (for instance I make extensive use of things like abstract syntax tree library to help with surgical edits from the LLM)

That said, I would be very open to collaborating with someone who builds such small models, I don't think the system strictly needs it, but it also could have some extra power if it had it.


I'm aware we're not there yet, but think of something like https://chatjimmy.ai/ ; at some point, you're going to be able to dynamically build the harness so it creates the necessary consistency & dynamicism at a speed unheard of.

But yes, I'm aware no ones got anywhere near there, mostly because most of the focus is on exploding the context and parameters. I'm saying that phase is done.


I'm not sure what I am looking at with chatjimmy.... what is special about it? Speed?

I'm also not sure what you mean by "we aren't there yet." Where?

Sorry, not trying to be difficult or dense, I'm just not sure what you are referring to.

> mostly because most of the focus is on exploding the context and parameters.

Large context allows a surprising amount of "learning" to happen at inference time rather than training time. I think that is relatively unexplored. As long as the model itself has passed a certain threshold of smarts, and the context is large enough (Gemini and its million token context being WAY past that point) you are not really limited by the model, you are only limited by how good the stuff you feed into that context is.

That's what happened when, nearly a year ago, I saw a major leap in capabilities that happened entirely on my end.... not in the AI, but in code written by the AI. I found it genuinely frighting to be honest. I think OpenClaw tapped into something similar, which seemed to surprise a lot of people. There were latent capabilities in the AI that were unknown until brought out by a clever harness.


image a streamlined model whose only job is to build then execute the harness at the speed youre seeing in chat jimmy.

Speed isn't really a big deal for me. I want good quality code. It's already able to generate code 10-100X as fast as I could code it myself.

Anyway, are you speaking of the harness? The harness on mine isn't AI, so speed just isn't an issue.


> Generated in 0.008s • 14,293 tok/s

Chat Jimmy runs ~300X faster than the ~50 tok/s you are used to. What could you do differently when you are able to generate code 3,000 - 30,000X as fast as you could code it yourself? What if it was all good quality code? What would you do differently if it were 100,000X faster? mtok/s? gtok/s?


refine that to: what if your harness grew to encompass a larger, slower model and adapted to both the model and the project. thats where i expect the harness to go.

use the big models to code an adaptive small model. train it to use and build tools. give it a standard temple language for any project and bake it into a chip.

right now, LLMs are great because they dont need much data pruning, but once they break through to the functional components, the first thing to do is train a well scoped harness builder.


> mine also makes extensive use of things like abstract syntax tree library to help with surgical edits from the LLM

Tell me more! This takes me way back. I did one like this in the GPT-4 days! (8k context window)


Start off with my video!!! You can also try it with zero setup (you can code right there on the static web page, it will save your edits in the browser indexed DB, and hotpatch them back into the code before it runs it.... also you can grant permission to the browser to read/write to a local directory)

recursi.dev

Seriously, I'm looking for collaborators.

There's upwards of 80,000 lines of code in the editor system, a lot to it to make sure that even newbies don't get stuck.... so that's kind of proof the system works since it doesn't break down when the codebase grows large.


remove the images from the url, i stared at the page for 3 secs waiting for something to show up (gigabit connection)

yes? the future for any verifiable task is the model attempts to verify initial state and a goal then decomposes its tasks in to every smaller verifiable subtasks, with /memory being the persistence between runs and then /dreaming on the results of those memory files + run data to introduce new ideas.

i think thats the path to async agi these labs are imagining. The only limit is that sensor data you have on the world or your system, how long your willing to wait, and how much you're willing to spend to parallelize it.

maybe once you start building out these verified workflows you can feed that back into training and hte model starts to get a feel for the world to the point that it can intuit things since it has these sub paths built.

my personal agi test is can a model, trained on video of someone knocking on a door and then open it encounter a microwave for the first time and open it when the foods done without knocking.


You ought to include a canary string if you are going to disclose your evals like that!

You need the AI eventually building another AI for the name to apply. This page is just bullshit. They vibe-code their harnesses, and yes, it shows.

Anyway, what does recursive self-improvement even means for neural-network based AIs? It's not clear it's possible at all.


Recursive self-improvement would be the model helping with the model research program. Coming up with hypotheses for training and architecture improvements, running experiments, interpreting the results, figuring out how to incorporate the best stuff into the next version, etc.

Where do you see evidence of vibe coding the harness? (and who are you talking about, Anthropic or the link I shared?)

It seems odd to complain about a AI coding tool being coded with AI. That's just eating your own dog food. In my opinion it makes it better, because the tool is very well tested.


> and who are you talking about, Anthropic or the link I shared?

About Anthropic.


> Do code harnesses that build themselves count as recursive self improvement, or does it need to be the AI itself to qualify for the term?

Shhh just let the marketing slop wash over you.


Rob Brown here, the developer of this app and ecosystem. I've poured my heart and soul into this project for over a year. I think its crazy powerful and cool and fun. I hope you'll check it out!!!! I'm here for questions.

When in history has being idle not been a problem?

If AI and robots are able to do all the jobs, being idle isn't the negative it has always been.

All through history, you needed lots of non-idle people to do all the work that needed to be done. This is a new situation we are coming upon.


If they are doing all the jobs, who is going to receive economic opportunities? Will we no longer be able to participate in the economy?


In what way do you want to participate when there's no economic value in any of it? Just do whatever you want for yourself; you're free.


The freedom you’re describing is the freedom of a domesticated animal, by the way. With the same outcome if you become a nuisance


Well we're animals and "domesticated" is synonymous with "civilized", so no problem there. And I can't see why anyone would make themselves a "nuisance" when literally all their needs - and most of their desires - are being met, so whatever outcome you're referring to is extremely unlikely.


I'm in their time zone, and was just planning to stop with my bad habit of staying up working till 4 am and waking up at noon.

So much for that plan.


I'm having trouble understanding what they want to "upskill" those people to do.

What skills won't be replaced? The only ones I can think of either have a large physical component, or are only doable by a tiny fraction of the current workforce.

As for the ones with a physical component (plumbers being the most cited), the cognitive parts of the job (the "skilled" part of skilled labor) can be replaced while having the person just following directions demonstrated onscreen for them. And of course, the robots aren't far behind, since the main hard part of making a capable robot is the AI part.


'main hard part of making a capable robot is the AI part'

Robots are far behind.

Mechanical hands with human equivalent performance is as hard as the AI part.

Strong, fast, durable, tough, touch and temp sensitive, dexterous, light, water-proof, energy efficient, non-overheating.

Muscles and tendons in human hands and forearms self-heal and grow stronger with more use.

Mechanical tendons stretch and break. Small motors have plenty of issues of their own.


And your claim is that those will never be solved?

As a professional robotics engineer I can tell you for a fact they are coming soon.


There's nothing in that post claiming those problems will never be solved. I understand the claim as "the hardware conponent of robotics needs more work and this will take some time, compared to AI capabilities/software" Or soemthing like that.

Maybe you could clarify what your experience on the matter is, how the state of th art looks to you, and most of all what timelines you imagine?


Just look up “fine dexterous manipulation with pressure feedback” to see the SOTA for dexterous manipulation

There’s at least a half dozen products, two recently from Unitree and Allegro announced.

Rodney Brooks wrote about the challenges - but frankly it was a submarine piece for his work

https://rodneybrooks.com/why-todays-humanoids-wont-learn-dex...


you are talking high cost emvironments, at least for the moment?

Come on... show me a robot that can run a farm that grows organic produce at an affordable price. It is the lowest wage job out there. Automating it would make prices far out of range for the 99% - but the billionaires could care less?


You need an AI to do that, affordable robots are already here but the intelligence is not.


For most things they don't need to be "human equivalent." I'd be willing to be the current crop of robots we're seeing could do most tasks like vacuuming, cooking, picking up clutter, folding laundry and putting it aways, making beds, touch up painting, gardening etc. It seems to be getting better very fast. And if mechanical tendons break, you replace them. Big deal. You don't even need a person to do the repair.


I don't think "replaced" is a good word here.. augmented and expanded. With AI we are expanding our activities, users expect more, competition forces companies to do more.

But AI can't be held liable for its actions, that is one role. It has no direct access to the context it is working in, so it needs humans as a bridge. In the end AI produce outcomes in the same local context, which is for the user. So from intent to guidance to outcomes they are all user based, costs and risks too.

I find it pessimistic to take that static view on work, as if "that's it, all we needed is invented", and now we are fighting for positions like musical chairs


> I don't think "replaced" is a good word here.. augmented and expanded. With AI we are expanding our activities, users expect more, competition forces companies to do more.

Daily reminder that the vast majority of value generated by productivity boost brought by technology in the last 50 years doesn't benefit the workers

https://encrypted-tbn0.gstatic.com/images?q=tbn:ANd9GcSG4s-x...


Agree for almost all jobs, but some, like my fathers, was about crawling inside huge metal pieces to do precision machining. For unique piecework, it might not be economical to train AI. Surely equivalents to this exist elsewhere


I'd suggest enjoying that vindication while it lasts.

From my perspective, your perspective is like a horse and buggy driver feeling vindicated when a "horseless carriage" driver accidentally drives one into a tree. The cars will get easier to drive and safer in crashes, and the drivers will learn to pay attention in certain ways they previously didn't have to.

Will there still be occasional problems? Sure, but that doesn't mean that tying your career to horses would have been a wise move. Same here.

(Also, this article is about "poisoned ChatGPT-like tools." Which says very little about using the tools that most developers are using)

I'm always reminded of this: "Logged onto the World Wide Web, I hunt for the date of the Battle of Trafalgar. Hundreds of files show up, and it takes 15 minutes to unravel them—one's a biography written by an eighth grader, the second is a computer game that doesn't work and the third is an image of a London monument. None answers my question, and my search is periodically interrupted by messages like, "Too many connections, try again later."" -- Cliff Stoll, 1995


Counter-analogy: This is ultimately people copy-pasting the first answer they see from a web-forum. That's been bad advice for decades, and the same underlying problems remain because most of them involve human foibles and allocating attention.

What these tools change is making the process much faster and adding a (rather questionable) imprimatur of quality from a vendor that may not actually be a good curator of code-samples.


Re: your apparent derision of Cliff Stoll's writings, the OP results seem to speak to a trend he was among the first to point out in the book you cite from: people overwhelmingly bias towards the easiest to obtain information, even when they know that information is of worse quality than other information that's available but harder to get.


It was cited from a Newsweek article, and Cliff said this about it later: "Of my many mistakes, flubs, and howlers, few have been as public as my 1995 howler ... Now, whenever I think I know what's happening, I temper my thoughts: Might be wrong, Cliff ..."

You may be right about humans biasing toward easiest to obtain information, but that doesn't say "don't use AI assistance", it says "use care when using AI assistance".

Also, Cliff wasn't saying the information was easier to use, since in his case, it was actually harder to use than just looking it up in a printed encyclopedia or the like. But none of the problems he mentioned were inherent problems with the internet, they were because it was a brand new medium still working out its kinks. AI may well be harder to use for coding right now, at least for many use cases. However, a look at the bigger picture strongly suggests it is the future, just as a look at the bigger picture in 1995 would have suggested that the internet was the future, at least for answering questions like "when was the battle of Trafalgar?"

This is consistent with my horse/car analogy: the car wasn't the problem, the problem was people who assumed cars were going to keep themselves on the road like a horse would naturally do. You can get a huge gain, but you have to be smart about how you use it.


Imagine if a regular for profit startup did that. It gets 60 million in initial funding, and later their valuation goes up to 100 billion. Of course they can't just give the 60 million back.

This is different and has a lot of complications that are basically things we've never seen before, but still, just giving the 60 million back doesn't make any sense at all. They would've never achieved what they've achieved without his 60 million.


I don't see how opening it makes it safer. It's very different from security things, where some "white hat" can find a security, and they can then fix it so instances don't get hacked. Sure, a bad person could run the software without fixing the bug, but that isn't going to harm anyone but themselves.

That isn't the case here. If some well meaning person discovers a way that you can create a pandemic causing superbug, they can't just "fix" the AI to make that impossible. Not if it is open source. Very different thing.


Guidelines | FAQ | Lists | API | Security | Legal | Apply to YC | Contact

Search: