diff --git a/docs/source/_static/img/llama_ios_app.mp4 b/docs/source/_static/img/llama_ios_app.mp4
new file mode 100644
index 0000000000..fead47644d
Binary files /dev/null and b/docs/source/_static/img/llama_ios_app.mp4 differ
diff --git a/docs/source/_static/img/llama_ios_app.png b/docs/source/_static/img/llama_ios_app.png
new file mode 100644
index 0000000000..4f9020efb8
Binary files /dev/null and b/docs/source/_static/img/llama_ios_app.png differ
diff --git a/docs/source/demo-apps-ios.md b/docs/source/demo-apps-ios.md
index e04b6cae68..d68b1309e2 100644
--- a/docs/source/demo-apps-ios.md
+++ b/docs/source/demo-apps-ios.md
@@ -1 +1 @@
-```{include} ../../examples/demo-apps/apple_ios/README.md
+```{include} ../../examples/demo-apps/apple_ios/ExecuTorchDemo/README.md
diff --git a/docs/source/llm/getting-started.md b/docs/source/llm/getting-started.md
index a50ccb447e..def86568a8 100644
--- a/docs/source/llm/getting-started.md
+++ b/docs/source/llm/getting-started.md
@@ -758,6 +758,6 @@ The remaining steps are the same as the normal flow. Now you can run this module
## How to build Mobile Apps
You can execute an LLM using ExecuTorch on iOS and Android.
-**For iOS details see the [iOS Sample App](https://github.com/pytorch/executorch/tree/main/examples/demo-apps/apple_ios).**
+**For iOS see the [iLLaMA App](https://pytorch.org/executorch/main/llm/llama-demo-ios.html).**
**For Android, see the [Android Sample App](https://pytorch.org/executorch/main/llm/llama-demo-android.html).**
diff --git a/docs/source/llm/llama-demo-ios.md b/docs/source/llm/llama-demo-ios.md
new file mode 100644
index 0000000000..cc25a24f33
--- /dev/null
+++ b/docs/source/llm/llama-demo-ios.md
@@ -0,0 +1,2 @@
+```{include} ../../../examples/demo-apps/apple_ios/LLaMA/README.md
+```
\ No newline at end of file
diff --git a/examples/demo-apps/apple_ios/README.md b/examples/demo-apps/apple_ios/ExecuTorchDemo/README.md
similarity index 100%
rename from examples/demo-apps/apple_ios/README.md
rename to examples/demo-apps/apple_ios/ExecuTorchDemo/README.md
diff --git a/examples/demo-apps/apple_ios/LLaMA/README.md b/examples/demo-apps/apple_ios/LLaMA/README.md
new file mode 100644
index 0000000000..04a6eaef67
--- /dev/null
+++ b/examples/demo-apps/apple_ios/LLaMA/README.md
@@ -0,0 +1,37 @@
+# Building ExecuTorch LLaMA iOS Demo App
+
+This app demonstrates the use of the LLaMA chat app demonstrating local inference use case with ExecuTorch.
+
+data:image/s3,"s3://crabby-images/b330a/b330aac1c1be6a19e75f8f05c69286ad43534c84" alt="iOS LLaMA App"
+
+## Prerequisites
+* [Xcode 15](https://developer.apple.com/xcode).
+* [iOS 17 SDK](https://developer.apple.com/ios).
+* Set up your ExecuTorch repo and environment if you haven’t done so by following the [Setting up ExecuTorch](https://pytorch.org/executorch/stable/getting-started-setup) to set up the repo and dev environment.
+
+## Exporting models
+Please refer to the [ExecuTorch Llama2 docs](https://github.com/pytorch/executorch/blob/main/examples/models/llama2/README.md) to export the model.
+
+## Run the App
+
+1. Open the [project](https://github.com/pytorch/executorch/blob/main/examples/demo-apps/apple_ios/LLaMA/LLaMA.xcodeproj) in Xcode.
+2. Run the app (cmd+R).
+3. In app UI pick a model and tokenizer to use, type a prompt and tap the arrow buton as on the [video](../_static/img/llama_ios_app.mp4).
+
+```{note}
+ExecuTorch runtime is distributed as a Swift package providing some .xcframework as prebuilt binary targets. Xcode will dowload and cache the package on the first run, which will take some time.
+```
+
+## Copy the model to Simulator
+
+1. Drag&drop the model and tokenizer files onto the Simulator window and save them somewhere inside the iLLaMA folder.
+2. Pick the files in the app dialog, type a prompt and click the arrow-up button.
+
+## Copy the model to Device
+
+1. Wire-connect the device and open the contents in Finder.
+2. Navigate to the Files tab and drag&drop the model and tokenizer files onto the iLLaMA folder.
+3. Wait until the files are copied.
+
+## Reporting Issues
+If you encountered any bugs or issues following this tutorial please file a bug/issue here on [Github](https://github.com/pytorch/executorch/issues/new).
diff --git a/examples/models/llama2/README.md b/examples/models/llama2/README.md
index 4fefc37780..17abc9f5bc 100644
--- a/examples/models/llama2/README.md
+++ b/examples/models/llama2/README.md
@@ -218,11 +218,13 @@ adb push cmake-out-android/examples/models/llama2/llama_main /data/local/tmp/lla
```
adb shell "cd /data/local/tmp/llama && ./llama_main --model_path --tokenizer_path --prompt "Once upon a time" --seq_len 120
```
-## Step 6: Build iOS and/or Android apps
+## Step 6: Build Mobile apps
-TODO
+### iOS
+
+Please refer to [this tutorial](https://pytorch.org/executorch/main/llm/llama-demo-ios.html) to for full instructions on building the iOS LLAMA Demo App.
-### Android app
+### Android
Please refer to [this tutorial](https://pytorch.org/executorch/main/llm/llama-demo-android.html) to for full instructions on building the Android LLAMA Demo App.
# What is coming next?