From 810b5a0cc23e36fe0f120c6071b50b6d33a28c28 Mon Sep 17 00:00:00 2001
From: austinvhuang <austinh@alum.mit.edu>
Date: Fri, 15 Mar 2024 14:10:24 -0400
Subject: [PATCH] Update README with more details on contributing code, add
 experimental/ directory, add READMEs for subdirectories, clean up DEVELOPER
 notes

---
 DEVELOPERS.md          | 15 ++++++++-------
 README.md              |  7 ++++++-
 examples/README.md     |  7 +++++++
 experimental/.gitkeep  |  0
 experimental/README.md |  3 +++
 5 files changed, 24 insertions(+), 8 deletions(-)
 create mode 100644 examples/README.md
 create mode 100644 experimental/.gitkeep
 create mode 100644 experimental/README.md

diff --git a/DEVELOPERS.md b/DEVELOPERS.md
index 43b3187..4e104b9 100644
--- a/DEVELOPERS.md
+++ b/DEVELOPERS.md
@@ -118,8 +118,7 @@ jax / pytorch / keras for NN deployments.
 
 ### Gemma struct contains all the state of the inference engine - tokenizer, weights, and activations
 
-`Gemma(...)` - constructor, creates a gemma model object, which is a wrapper
-around 3 things - the tokenizer object, weights, activations, and KV Cache.
+`Gemma(...)` - constructor, creates a gemma model object. 
 
 In a standard LLM chat app, you'll probably use a Gemma object directly, in
 more exotic data processing or research applications, you might decompose
@@ -129,11 +128,13 @@ only using a Gemma object.
 
 ### Use the tokenizer in the Gemma object (or interact with the Tokenizer object directly)
 
-You pretty much only do things with the tokenizer, call `Encode()` to go from
-string prompts to token id vectors, or `Decode()` to go from token id vector
-outputs from the model back to strings.
+The Gemma object contains contains a pointer to a Tokenizer object. The main
+operations performed on the tokenizer are to load the tokenizer model from a
+file (usually `tokenizer.spm`), call `Encode()` to go from string prompts to
+token id vectors, or `Decode()` to go from token id vector outputs from the
+model back to strings.
 
-### The main entrypoint for generation is `GenerateGemma()`
+### `GenerateGemma()` is the entrypoint for token generation
 
 Calling into `GenerateGemma` with a tokenized prompt will 1) mutate the
 activation values in `model` and 2) invoke StreamFunc - a lambda callback for
@@ -150,7 +151,7 @@ constrained decoding type of use cases where you want to force the generation
 to fit a grammar. If you're not doing this, you can send an empty lambda as a
 no-op which is what `run.cc` does.
 
-### If you want to invoke the neural network forward function directly call the `Transformer()` function
+### `Transformer()` implements the inference (i.e. `forward()` method in PyTorch or Jax) computation of the neural network
 
 For high-level applications, you might only call `GenerateGemma()` and never
 interact directly with the neural network, but if you're doing something a bit
diff --git a/README.md b/README.md
index b0b9aad..5b0df18 100644
--- a/README.md
+++ b/README.md
@@ -36,7 +36,12 @@ For production-oriented edge deployments we recommend standard deployment
 pathways using Python frameworks like JAX, Keras, PyTorch, and Transformers
 ([all model variations here](https://www.kaggle.com/models/google/gemma)).
 
-Community contributions large and small are welcome. This project follows
+## Contributing
+
+Community contributions large and small are welcome. See
+[DEVELOPERS.md](https://github.com/google/gemma.cpp/blob/main/DEVELOPERS.md)
+for additional notes contributing developers and [join the discord by following
+this invite link](https://discord.gg/H5jCBAWxAe). This project follows
 [Google's Open Source Community
 Guidelines](https://opensource.google.com/conduct/).
 
diff --git a/examples/README.md b/examples/README.md
new file mode 100644
index 0000000..87eb54d
--- /dev/null
+++ b/examples/README.md
@@ -0,0 +1,7 @@
+# Examples
+
+In this directory are some simple examples illustrating usage of `gemma.cpp` as
+a library beyond the interactive `gemma` app implemented in `run.cc`.
+
+- `hello_world/` - minimal/template project for using `gemma.cpp` as a library.
+  It sets up the model state and generates text for a single hard coded prompt.
diff --git a/experimental/.gitkeep b/experimental/.gitkeep
new file mode 100644
index 0000000..e69de29
diff --git a/experimental/README.md b/experimental/README.md
new file mode 100644
index 0000000..2b6ff83
--- /dev/null
+++ b/experimental/README.md
@@ -0,0 +1,3 @@
+# Experimental
+
+This directory is for experimental code and features.