Enhance README #421

Anakin100100 · 2025-11-21T19:26:08Z

I recently reanimated my laptop with an Nvidia GPU and installed Ubuntu there. I noticed this project and wanted to play with it but the documentation is quite sparse on the setup which can deter potential contributors and users.

This PR enhances the README with a step by step instructions on which software needs to be installed and how to use PufferLib to train and export a neural net. I tested the setup on a fresh container. The expected observations and comments on the training process are based on my limited knowledge of RL so I'd welcome your guidance there.

The setup instructions are centered on Ubuntu but provide alternative instructions for all major pieces of software that are required for other Linux flavors.

build_ocean.sh and build_simple.sh are converted to interactive scripts to enable alias loading for uses who need to alias specific binaries e.g. clang

When there isn't enough arguments to execute a command, a command is invalid or the user tries to access a help section for a command without specifying en environment which isn't defined a global HELP_MESSAGE is displayed instead of a massive error which can confuse new users.

It would be nice to rewrite the cli to typer https://typer.tiangolo.com/ to enable autocomplete and nicer documentation. Is this something that you think would be useful?

I really like the feeling of seeing the neural net train itself only from playing with itself. I wanted to write the environment for tic tac toe but I can see that there is already an open pr for this. Can you suggest a similarly difficult environment to contribute, potentially with a guide to writing one, similar to the readme?

leanke · 2025-11-21T19:32:00Z

https://puffer.ai/docs.html heres a link to the docs 😉

Anakin100100 · 2025-11-21T19:47:27Z

I know, I read them. It's just that the setup instructions and step by step instructions on training, exporting and using an env are missing. It could be a good idea to copy this to the docs and enhance the guide for writing a new env because this one is somewhat confusing because it doesn't provide a guide on writing a new env it just points to a quite simple one and then shows how to perform some tasks after it has already been written without explaining the process of designing the env, agent evaluation etc.

Having the end to end walkthrough in readme is imho important because a lot of people find this first so having it in both places would be useful.

elevatorguy · 2025-11-22T16:26:20Z

scripts/build_ocean.sh

@@ -1,4 +1,4 @@
-#!/bin/bash
+#!/bin/bash -i


Pufferlib aside, using dash by default - instead of bash - and running a compile or build script that relies upon .bashrc, for example oneapi initialization; thanks.

Anakin100100 added 7 commits November 21, 2025 17:50

save draft

3143ada

update the readme

45aa4ac

update installation instructions

5cbe943

remove target bin file

eb89ceb

display help message when --help used as env_name

7d4fa2e

make help message more helpful

03dabe2

fix typo

dccba1e

elevatorguy reviewed Nov 22, 2025

View reviewed changes

Provide feedback

Saved searches

Use saved searches to filter your results more quickly

Uh oh!

Enhance README #421

Enhance README #421

Uh oh!

Anakin100100 commented Nov 21, 2025

Uh oh!

leanke commented Nov 21, 2025

Uh oh!

Anakin100100 commented Nov 21, 2025 •

edited

Loading

Uh oh!

elevatorguy Nov 22, 2025 •

edited

Loading

Uh oh!

Reviewers

Assignees

Labels

Projects

Milestone

Development

Uh oh!

3 participants

		@@ -1,4 +1,4 @@
		#!/bin/bash
		#!/bin/bash -i

Enhance README #421

Are you sure you want to change the base?

Enhance README #421

Uh oh!

Conversation

Anakin100100 commented Nov 21, 2025

Uh oh!

leanke commented Nov 21, 2025

Uh oh!

Anakin100100 commented Nov 21, 2025 • edited Loading Uh oh! There was an error while loading. Please reload this page.

Uh oh!

Uh oh!

elevatorguy Nov 22, 2025 • edited Loading Uh oh! There was an error while loading. Please reload this page.

Uh oh!

Choose a reason for hiding this comment

Uh oh!

Reviewers

Assignees

Labels

Projects

Milestone

Development

Uh oh!

3 participants

Anakin100100 commented Nov 21, 2025 •

edited

Loading

elevatorguy Nov 22, 2025 •

edited

Loading