diff --git a/.gitignore b/.gitignore
index 1bffc02..2ad4d73 100644
--- a/.gitignore
+++ b/.gitignore
@@ -1,4 +1,5 @@
 **/__pycache__
+.DS_Store
 
 dist
 *.egg-info
diff --git a/README.md b/README.md
index adad6d0..793b6ea 100644
--- a/README.md
+++ b/README.md
@@ -1,21 +1,35 @@
+<div align="middle">
+  <img src="https://raw.githubusercontent.com/atomiechen/HandyLLM/main/assets/banner.svg" />
+</div><br>
 # HandyLLM
 
-[![GitHub](https://img.shields.io/badge/github-HandyLLM-blue?logo=github)](https://github.com/atomiechen/HandyLLM) [![PyPI](https://img.shields.io/pypi/v/HandyLLM?logo=pypi&logoColor=white)](https://pypi.org/project/HandyLLM/)
+[![GitHub](https://img.shields.io/badge/github-HandyLLM-blue?logo=github)](https://github.com/atomiechen/HandyLLM) [![PyPI](https://img.shields.io/pypi/v/HandyLLM?logo=pypi&logoColor=white)](https://pypi.org/project/HandyLLM/) [![vsmarketplace](https://vsmarketplacebadges.dev/version-short/atomiechen.handyllm.svg)](https://marketplace.visualstudio.com/items?itemName=atomiechen.handyllm)
 
-A handy toolkit for using LLM.
+A handy toolkit for using LLM, with both [***development support***](https://pypi.org/project/HandyLLM/) and [***VSCode editor support***](https://marketplace.visualstudio.com/items?itemName=atomiechen.handyllm).
 
 
 
 ## 🌟 Why HandyLLM?
 
-☯️ Both sync and async APIs supported with unified design
+📃 **Handy Prompt**: self-containing prompt in a human-friendly mark-up format `.hprompt`. 
+
+- **Easy write**: mark-up format, placeholder variables, request arguments, output logs... And most importantly VSCode syntax highlight!
+- **Easy run**: both CLI and APIs available for parsing and running; run it with the CLI tool *WITHOUT* any code! 
+- **Easy chain**: You can chain `hprompt` files to construct dynamic logic.
+
+<p float="left" align="center">
+  <img src="https://raw.githubusercontent.com/atomiechen/vscode-handyllm/main/demo/example.jpg" width="46%" style="vertical-align: middle;" />
+  <img src="https://raw.githubusercontent.com/atomiechen/vscode-handyllm/main/demo/run.gif" width="53%" style="vertical-align: middle;" />
+</p>
+
+**Other features:**
+
+☯️ Unified API design with both sync and async support
 
 🍡 OpenAI and Azure APIs all in one
 
 ☕️ Easy life with API endpoint management
 
-📃 Writing chat prompt in a human-friendly mark-up format
-
 
 
 ## Installation
@@ -30,245 +44,11 @@ or, install from the Github repo to get latest updates:
 pip3 install git+https://github.com/atomiechen/handyllm.git
 ```
 
+Please check [HandyLLM VSCode extension](https://marketplace.visualstudio.com/items?itemName=atomiechen.handyllm) for editor support.
 
 
-## Usage
-
-More example scripts are placed in [tests](./tests) folder.
-
-### Using OpenAIClient
-
-Each API method of `OpenAIClient` returns a `Requestor`, and you can execute its `call()` or `acall()` to get synchronous or asynchronous API calls.
-
-Synchronous API usage:
-
-```python
-from handyllm import OpenAIClient
-with OpenAIClient(api_key='<your-key>') as client:
-    response = client.chat(
-      	model="gpt-4-turbo",
-      	messages=[{"role": "user", "content": "please tell me a joke"}]
-    ).call()  ## note .call() here
-    print(response['choices'][0]['message']['content'])
-```
-
-Asynchronous API usage:
-
-```python
-async with OpenAIClient('async', api_key='<your-key>') as client_async:
-    response = await client_async.chat(
-      	model="gpt-4-turbo",
-      	messages=[{"role": "user", "content": "please tell me a joke"}]
-    ).acall()  ## note .acall() here
-    print(response['choices'][0]['message']['content'])
-```
-
-You can instantiate a client that supports both modes:
-
-```python
-client = OpenAIClient('sync')  ## only supports sync APIs
-client = OpenAIClient('async')  ## only supports async APIs
-client = OpenAIClient('both')  ## supports both versions
-```
-
-
-
-### Legacy: Using OpenAIAPI proxy
-
-> [!IMPORTANT]
-> This is not recommended anymore. Use `OpenAIClient` instead.
-
-Under the hood it connects to a module client and only provides **synchronous** APIs, **without** `call()`.
-
-```python
-from handyllm import OpenAIAPI
-OpenAIAPI.api_key = '<your-key>'
-response = OpenAIAPI.chat(
-    model="gpt-4-turbo",
-    messages=[{"role": "user", "content": "please tell me a joke"}]
-)  ## no .call() here
-print(response['choices'][0]['message']['content'])
-```
-
-
-
-## OpenAI API Request
-
-### Endpoints
-
-Each API request will connect to an endpoint along with some API configurations, which include:
-
-|                  | Description                                                  | Value                   |
-| ---------------- | ------------------------------------------------------------ | ----------------------- |
-| api_type         | API type. Defaults to `openai`.                              | str: `openai` / `azure` |
-| api_base         | API base url. Defaults to OpenAI base url.                   | str                     |
-| api_key          | API key.                                                     | str                     |
-| organization     | Organization.                                                | str                     |
-| api_version      | API version. **Must be provided for Azure end-points.**      | str                     |
-| model_engine_map | Map model name to engine name. Useful for Azure end-points if you have custom model names. | dict                    |
-
-An `Endpoint` object contains these information. An `EndpointManager` acts like a list and can be used to rotate the next endpoint. See [test_endpoint.py](./tests/test_endpoint.py).
-
-Methods for configuring endpoint info (values will be inferred in **top-town** order):
-
-| Configuration method                               | Description                                                  |
-| -------------------------------------------------- | ------------------------------------------------------------ |
-| API keyword parameters                             | e.g.: `chat(api_key='xxx', ...)`                             |
-| API `endpoint` keyword parameter                   | Providing an `Endpoint`, e.g.: `chat(endpoint=MyEndpoint)`   |
-| API `endpoint_manager` keyword parameter           | Providing an `EndpointManager`, e.g.: `chat(endpoint_manager=MyEndpointManager)` |
-| `OpenAIClient` instance (or `OpenAIAPI`) variables | e.g.: `client.api_key = 'xxx'` / `OpenAIAPI.api_key = 'xxx'` |
-| Environment variables                              | `OPENAI_API_KEY`, `OPENAI_ORGANIZATION`/`OPENAI_ORG_ID`, `OPENAI_API_BASE`, `OPENAI_API_TYPE`, `OPENAI_API_VERSION`, `MODEL_ENGINE_MAP`. |
-
-> [!TIP]
->
-> **Azure OpenAI APIs are supported:** Specify `api_type='azure'`, and set `api_base` and `api_key` accordingly. Set `model_engine_map` if you want to use `model` parameter instead of `engine`/`deployment_id`. See [test_azure.py](./tests/test_azure.py). Please refer to [Azure OpenAI Service Documentation](https://learn.microsoft.com/en-us/azure/ai-services/openai/) for details.
-
-### Logger
-
-You can pass custom `logger` and `log_marks` (a string or a collection of strings) to `chat`/`completions` to get input and output logging.
-
-### Timeout control
 
-This toolkit supports client-side `timeout` control:
+## Documentation
 
-```python
-from handyllm import OpenAIClient
-client = OpenAIClient()
-prompt = [{
-    "role": "user",
-    "content": "please tell me a joke"
-    }]
-response = client.chat(
-    model="gpt-3.5-turbo",
-    messages=prompt,
-    timeout=10
-    ).call()
-print(response['choices'][0]['message']['content'])
-```
-
-### Stream response
-
-Stream response of `chat`/`completions`/`finetunes_list_events` can be achieved using `steam` parameter:
-
-```python
-from handyllm import OpenAIClient, stream_chat
-
-client = OpenAIClient()
-response = client.chat(
-    model="gpt-3.5-turbo",
-    messages=prompt,
-    timeout=10,
-    stream=True
-    ).call()
-
-# you can use this to stream the response text
-for text in stream_chat(response):
-    print(text, end='')
-
-# or you can use this to get the whole response
-# for chunk in response:
-#     if 'content' in chunk['choices'][0]['delta']:
-#         print(chunk['choices'][0]['delta']['content'], end='')
-```
-
-### Supported APIs
-
-- chat
-- completions
-- edits
-- embeddings
-- models_list
-- models_retrieve
-- moderations
-- images_generations
-- images_edits
-- images_variations
-- audio_transcriptions
-- audtio_translations
-- files_list
-- files_upload
-- files_delete
-- files_retrieve
-- files_retrieve_content
-- finetunes_create
-- finetunes_list
-- finetunes_retrieve
-- finetunes_cancel
-- finetunes_list_events
-- finetunes_delete_model
-
-Please refer to [OpenAI official API reference](https://platform.openai.com/docs/api-reference) for details.
-
-
-
-## Chat Prompt
-
-### Prompt Conversion
-
-`PromptConverter` can convert this text file `prompt.txt` into a structured prompt for chat API calls:
-
-```
-$system$
-You are a helpful assistant.
-
-$user$
-Please help me merge the following two JSON documents into one.
-
-$assistant$
-Sure, please give me the two JSON documents.
-
-$user$
-{
-    "item1": "It is really a good day."
-}
-{
-    "item2": "Indeed."
-}
-%output_format%
-%misc1%
-%misc2%
-```
-
-```python
-from handyllm import PromptConverter
-converter = PromptConverter()
-
-# chat can be used as the message parameter for OpenAI API
-chat = converter.rawfile2chat('prompt.txt')
-
-# variables wrapped in %s can be replaced at runtime
-new_chat = converter.chat_replace_variables(
-    chat, 
-    {
-        r'%misc1%': 'Note1: do not use any bad word.',
-        r'%misc2%': 'Note2: be optimistic.',
-    }
-)
-```
-
-> [!IMPORTANT]
->
-> About the prompt format, each role key (e.g. `$system$` / `$user$` / `$assistant`) should be placed in a separate line.
-
-### Substitute
-
-`PromptConverter` can also substitute placeholder variables like `%output_format%` stored in text files to make multiple prompts modular. A substitute map `substitute.txt` looks like this:
-
-```
-%output_format%
-Please output a SINGLE JSON object that contains all items from the two input JSON objects.
-
-%variable1%
-Placeholder text.
-
-%variable2%
-Placeholder text.
-```
-
-```python
-from handyllm import PromptConverter
-converter = PromptConverter()
-converter.read_substitute_content('substitute.txt')  # read substitute map
-chat = converter.rawfile2chat('prompt.txt')  # variables are substituted already
-```
+Please check out our [wiki](https://github.com/atomiechen/HandyLLM/wiki) for comprehensive guides ^_^
 
diff --git a/assets/banner.svg b/assets/banner.svg
new file mode 100644
index 0000000..c1d64a6
--- /dev/null
+++ b/assets/banner.svg
@@ -0,0 +1,39 @@
+<?xml version="1.0" encoding="utf-8"?>
+<svg version="1.1" xmlns="http://www.w3.org/2000/svg" xmlns:xlink="http://www.w3.org/1999/xlink" x="0px" y="0px"
+	 viewBox="0 0 3000 500" style="enable-background:new 0 0 3000 500;" xml:space="preserve">
+<style type="text/css">
+	.st0{fill:#1C3251;}
+	.st1{fill:#FCB401;stroke:#1C3251;stroke-width:5;stroke-miterlimit:10;}
+	.st2{fill:none;stroke:#1C3251;stroke-width:5;stroke-miterlimit:10;}
+	.st3{fill:#FFFFFF;}
+</style>
+<rect x="0.69" y="-1.79" class="st0" width="3002.33" height="502.86"/>
+<path class="st1" d="M130.62,308.19c1.01,1.33-9.63,36.53-26.32,89.48h112.86c13.72-47.36,18.67-64.72,18.84-65.29
+	c0.01-0.04,0.14-0.46,0.3-1.04c0.11-0.42,0.19-0.72,0.19-0.74c0.69-2.87,5.7-1.83,11.17-5.37c19.9-12.86,24.36-23.54,39.21-45.81
+	c6.86-10.29,13.62-21.78,23.49-34.67c2.22-2.89-6.24-11.62-16.75-11.62c-10.25,0-19.57,6.92-26.1,12.48
+	c-5.14,4.38-11.14,9.9-13.71,9.9c-2.95,0-5.43-2.56-4.67-7.02c1.46-8.57,3.62-11.17,7.05-21.27c7.99-23.54,20.48-81.61,23.29-97.86
+	c1.1-6.33,2.34-10.91,0.14-15.54c-2.41-5.08-10.24-6.77-14.6-6.1c-3.3,0.51-7.83,4.27-9.9,10.98c-4.89,15.81-7.13,29.05-12.95,53.4
+	c-1.71,7.17-7.29,5.51-6.41-2.73c3.52-33.24,6.2-54.06,7.24-67.43c0.44-5.71-5.56-10.29-9.49-10.67c-5.03-0.49-8.67,2.16-11.14,5.71
+	c-2.58,3.7-3.75,8.13-6.6,22.79c-4.16,21.36-6.85,35.16-8.95,44.38c-1.78,7.81-8,7.37-7.17-0.57c0.83-8,2.48-30.82,3.94-51.17
+	c0.44-6.22,0.68-11.93-0.44-16.44c-0.76-3.05-4.96-7.33-10.29-7.68c-4.76-0.32-9.78,3.17-10.86,5.84
+	c-1.82,4.49-2.73,11.94-4.25,22.48c-1.38,9.57-9.27,55.47-12.1,55.46c-2.34-0.01-3.18-12.5-3.14-26c0.03-10.35,1.55-16-1.59-22.86
+	c-2.1-4.57-6.22-7.68-12.57-7.05c-3.81,0.38-11.3,6.6-12.38,14.54c-3.09,22.74-3.55,28.5-8.79,60.79c-1.5,9.24-2.29,13.98-3.9,23.71
+	c-8.28,49.86-3.49,65.89-1.67,75.1c1.27,6.41,4.09,10.68,5.24,12.67c0.86,1.48,1.29,2,2.05,3C128.86,306,130,307.37,130.62,308.19z"
+	/>
+<path class="st1" d="M327,245.4l0.13-77.4c0-0.32,0.01-0.96,0.01-0.96s-7.24-6.88-10.89-12.65c-3.66-5.79-4.48-12.29-3.9-20.16
+	c0.66-9.06,4.76-15.64,11.68-22.48c4.95-4.89,11.3-9.08,18.73-13.4l16,12.06l-14.57,18.48l-1.62,2.22c-2.7,4-1.14,9.14,1.9,11.75
+	c4.38,2.73,9.49,2.57,15.43-2.98l16.13-16l14.86,11.81c0.09,1.07,0.08,4.13-0.25,7.03c-1.27,11.43-6.21,18.76-11.05,24.21v0.72
+	l0,0.35c-0.05,12.89-0.92,228.22,0,229.14c0.85,0.85-41.84,0.18-50.9,0.03c-0.93-0.02-1.67-0.78-1.67-1.71v-96.22h-45.62
+	c0,0,34.19-54.19,34.29-53.9C315.76,245.62,327,245.4,327,245.4z"/>
+<path class="st3" d="M349.29,106.35c-8.13,6.46-14.71,13.17-16.38,15.94c-2.19,3.62-4.64,9.2-5.14,12.48
+	c-0.74,4.87,0.63,9.82,1.9,12.38c2,4.03,5.67,8.35,11.05,10.1c3.81,1.24,7.56,2.53,14.19,1.24c7.81-1.52,13.85-5.74,17.05-10.57
+	c5.85-8.83,9.7-20.45,7.42-21.69c-0.58-0.32-2.4-1.38-3.71-0.59c-5.97,3.57-7.44,8.09-9.81,10.1c-7.42,6.26-13.47,8.3-14.79,8.46
+	c-4.81,0.58-5.83-1.94-6.45-2.37c-3.4-2.35-3.51-3.98-3.81-6.48c-0.56-4.67,2.31-7.06,4.19-9.62c3.15-4.29,7.71-9.05,7.71-9.05
+	c2.03-1.8,2.98-3.64,2.97-5.65c-0.01-1.44-1.1-3.87-2.16-3.97C352.74,106.98,349.59,106.11,349.29,106.35z"/>
+<path class="st2" d="M327,245.4l0.13-77.4c0-0.32,0.01-0.96,0.01-0.96s-7.24-6.88-10.89-12.65c-3.66-5.79-4.48-12.29-3.9-20.16
+	c0.66-9.06,4.76-15.64,11.68-22.48c4.12-4.06,9.19-7.64,15.06-11.21c2.19-1.34,4.99-1.19,7.04,0.36l7.9,5.96
+	c2.6,1.96,3.08,5.67,1.06,8.22l-10.9,13.82l-1.62,2.22c-2.7,4-1.14,9.14,1.9,11.75c4.38,2.73,9.49,2.57,15.43-2.98l12.29-12.2
+	c2.19-2.17,5.65-2.36,8.06-0.44l7.22,5.74c2.29,1.82,3.54,4.64,3.35,7.56c-0.05,0.71-0.11,1.45-0.19,2.18
+	c-1.27,11.43-6.21,18.76-11.05,24.21v0.72l0,0.35c-0.05,12.89-0.92,228.22,0,229.14c0.85,0.85-41.84,0.18-50.9,0.03
+	c-0.93-0.02-1.67-0.78-1.67-1.71v-96.22h-45.62c0,0,34.19-54.19,34.29-53.9C315.76,245.62,327,245.4,327,245.4z"/>
+</svg>
diff --git a/pyproject.toml b/pyproject.toml
index 4d70e99..16426d5 100644
--- a/pyproject.toml
+++ b/pyproject.toml
@@ -4,7 +4,7 @@ build-backend = "setuptools.build_meta"
 
 [project]
 name = "HandyLLM"
-version = "0.6.3"
+version = "0.7.0"
 authors = [
   { name="Atomie CHEN", email="atomic_cwh@163.com" },
 ]
@@ -19,12 +19,16 @@ classifiers = [
 keywords = ["LLM", "Large Language Model", "Prompt", "OpenAI", "API"]
 dependencies = [
   "requests",
-  "httpx"
+  "httpx",
+  "python-frontmatter",
+  "mergedeep",
+  "python-dotenv",
+  "PyYAML",
 ]
 
 [project.urls]
 "Homepage" = "https://github.com/atomiechen/HandyLLM"
 "Bug Tracker" = "https://github.com/atomiechen/HandyLLM/issues"
 
-[project.optional-dependencies]
-test = ["python-dotenv"]
+[project.scripts]
+handyllm = "handyllm.__main__:cli"
diff --git a/src/handyllm/__init__.py b/src/handyllm/__init__.py
index b2ee3c9..c95b646 100644
--- a/src/handyllm/__init__.py
+++ b/src/handyllm/__init__.py
@@ -4,3 +4,4 @@
 from .endpoint_manager import Endpoint, EndpointManager
 from .prompt_converter import PromptConverter
 from .utils import *
+from .hprompt import *
diff --git a/src/handyllm/__main__.py b/src/handyllm/__main__.py
new file mode 100644
index 0000000..491a4d1
--- /dev/null
+++ b/src/handyllm/__main__.py
@@ -0,0 +1,5 @@
+from .cli import cli
+
+
+if __name__ == "__main__":
+    cli()
diff --git a/src/handyllm/cli.py b/src/handyllm/cli.py
new file mode 100644
index 0000000..7fa651f
--- /dev/null
+++ b/src/handyllm/cli.py
@@ -0,0 +1,60 @@
+import argparse
+
+
+def register_hprompt_command(subparsers: argparse._SubParsersAction):
+    parser_hprompt = subparsers.add_parser(
+        'hprompt', 
+        help="Run hprompt files",
+        description="Run hprompt files",
+        formatter_class=argparse.ArgumentDefaultsHelpFormatter,
+    )
+    parser_hprompt.add_argument("path", nargs='+', help="Path(s) to hprompt file")
+    parser_hprompt.add_argument("-o", "--output", help="Output path; if not provided, output to stderr")
+    parser_hprompt.add_argument("-v", "--verbose", action="store_true", default=False, help="Verbose output")
+    parser_hprompt.add_argument("-vm", "--var-map", help="Variable map in the format key1=value1|key2=value2")
+    parser_hprompt.add_argument("-vmp", "--var-map-path", help="Variable map file path")
+
+def hprompt_command(args):
+    import sys
+    from handyllm import hprompt
+    
+    run_config = hprompt.RunConfig()
+    if args.var_map:
+        var_map = {}
+        for pair in args.var_map.split("|"):
+            key, value = pair.split("=", maxsplit=1)
+            var_map[key.strip()] = value.strip()
+        run_config.var_map = var_map
+    if args.var_map_path:
+        run_config.var_map_path = args.var_map_path
+    if args.output:
+        run_config.output_path = args.output
+    else:
+        run_config.output_fd = sys.stderr
+    if args.verbose:
+        run_config.verbose = True
+        print(f"Input paths: {args.path}", file=sys.stderr)
+    prompt = hprompt.load_from(args.path[0])
+    result_prompt = prompt.run(run_config=run_config)
+    for next_path in args.path[1:]:
+        prompt += result_prompt
+        prompt += hprompt.load_from(next_path)
+        result_prompt = prompt.run(run_config=run_config)
+
+def cli():
+    """Main entry point for the handyllm CLI."""
+    parser = argparse.ArgumentParser(
+        prog="handyllm",
+        description="HandyLLM CLI",
+        formatter_class=argparse.ArgumentDefaultsHelpFormatter,
+    )
+    subparsers = parser.add_subparsers(dest="command")
+    register_hprompt_command(subparsers)
+    args = parser.parse_args()
+    if args.command == "hprompt":
+        return hprompt_command(args)
+
+
+if __name__ == "__main__":
+    cli()
+
diff --git a/src/handyllm/hprompt.py b/src/handyllm/hprompt.py
new file mode 100644
index 0000000..333b6ac
--- /dev/null
+++ b/src/handyllm/hprompt.py
@@ -0,0 +1,822 @@
+from __future__ import annotations
+__all__ = [
+    "HandyPrompt",
+    "ChatPrompt",
+    "CompletionsPrompt",
+    "loads",
+    "load",
+    "load_from",
+    "dumps",
+    "dump",
+    "dump_to",
+    "load_var_map",
+    "RunConfig",
+    "RecordRequestMode",
+]
+
+import json
+import re
+import copy
+import io
+import os
+import sys
+from pathlib import Path
+from datetime import datetime
+from typing import Optional, Union, TypeVar
+from enum import Enum, auto
+from abc import abstractmethod, ABC
+from dataclasses import dataclass, asdict, fields, replace
+
+import yaml
+import frontmatter
+from mergedeep import merge, Strategy
+from dotenv import load_dotenv
+
+from .prompt_converter import PromptConverter
+from .openai_client import ClientMode, OpenAIClient
+from .utils import (
+    astream_chat_with_role, astream_completions, 
+    stream_chat_with_role, stream_completions, 
+)
+
+
+PromptType = TypeVar('PromptType', bound='HandyPrompt')
+PathType = Union[str, os.PathLike[str]]
+
+converter = PromptConverter()
+handler = frontmatter.YAMLHandler()
+p_var_map = re.compile(r'(%\w+%)')
+
+DEFAULT_BLACKLIST = (
+    "api_key", "organization", "api_base", "api_type", "api_version", 
+    "endpoint_manager", "endpoint", "engine", "deployment_id", 
+    "model_engine_map", "dest_url", 
+)
+
+
+def loads(
+    text: str, 
+    encoding: str = "utf-8",
+    base_path: Optional[PathType] = None
+) -> HandyPrompt:
+    if handler.detect(text):
+        metadata, content = frontmatter.parse(text, encoding, handler)
+        meta = metadata.pop("meta", None) or {}
+        request = metadata
+    else:
+        content = text
+        request = {}
+        meta = {}
+    api: str = meta.get("api", "")
+    if api:
+        api = api.lower()
+        if api.startswith("completion"):
+            api = "completions"
+        else:
+            api = "chat"
+    else:
+        if converter.detect(content):
+            api = "chat"
+        else:
+            api = "completions"
+    if api == "completions":
+        return CompletionsPrompt(content, request, meta, base_path)
+    else:
+        chat = converter.raw2chat(content)
+        return ChatPrompt(chat, request, meta, base_path)
+
+def load(
+    fd: io.IOBase, 
+    encoding: str = "utf-8",
+    base_path: Optional[PathType] = None
+) -> HandyPrompt:
+    text = fd.read()
+    return loads(text, encoding, base_path=base_path)
+
+def load_from(
+    path: PathType,
+    encoding: str = "utf-8"
+) -> HandyPrompt:
+    with open(path, "r", encoding=encoding) as fd:
+        return load(fd, encoding, base_path=Path(path).parent.resolve())
+
+def dumps(
+    prompt: HandyPrompt, 
+    base_path: Optional[PathType] = None
+) -> str:
+    return prompt.dumps(base_path)
+
+def dump(
+    prompt: HandyPrompt, 
+    fd: io.IOBase, 
+    base_path: Optional[PathType] = None
+) -> None:
+    return prompt.dump(fd, base_path)
+
+def dump_to(
+    prompt: HandyPrompt, 
+    path: PathType
+) -> None:
+    return prompt.dump_to(path)
+
+def load_var_map(path: PathType) -> dict[str, str]:
+    # read all content that needs to be replaced in the prompt from a text file
+    with open(path, 'r', encoding='utf-8') as fin:
+        content = fin.read()
+    substitute_map = {}
+    blocks = p_var_map.split(content)
+    for idx in range(1, len(blocks), 2):
+        key = blocks[idx]
+        value = blocks[idx+1]
+        substitute_map[key] = value.strip()
+    return substitute_map
+
+
+class RecordRequestMode(Enum):
+    BLACKLIST = auto()  # record all request arguments except specified ones
+    WHITELIST = auto()  # record only specified request arguments
+    NONE = auto()  # record no request arguments
+    ALL = auto()  # record all request arguments
+
+
+@dataclass
+class RunConfig:
+    # record request arguments
+    record_request: Optional[RecordRequestMode] = None  # default: RecordRequestMode.BLACKLIST
+    record_blacklist: Optional[list[str]] = None  # default: DEFAULT_BLACKLIST
+    record_whitelist: Optional[list[str]] = None
+    # variable map
+    var_map: Optional[dict[str, str]] = None
+    # variable map file path
+    var_map_path: Optional[PathType] = None
+    # output the result to a file or a file descriptor
+    output_path: Optional[PathType] = None
+    output_fd: Optional[io.IOBase] = None
+    # output the evaluated prompt to a file or a file descriptor
+    output_evaled_prompt_path: Optional[PathType] = None
+    output_evaled_prompt_fd: Optional[io.IOBase] = None
+    # credential file path
+    credential_path: Optional[PathType] = None
+    # credential type: env, json, yaml
+    # if env, load environment variables from the credential file
+    # if json or yaml, load the content of the file as request arguments
+    credential_type: Optional[str] = None  # default: guess from the file extension
+    
+    # verbose output to stderr
+    verbose: Optional[bool] = None  # default: False
+    
+    def __len__(self):
+        return len([f for f in fields(self) if getattr(self, f.name) is not None])
+    
+    @classmethod
+    def from_dict(cls, obj: dict, base_path: Optional[PathType] = None):
+        input_kwargs = {}
+        for field in fields(cls):
+            if field.name in obj:
+                input_kwargs[field.name] = obj[field.name]
+        # convert string to Enum
+        record_str = input_kwargs.get("record_request")
+        if record_str is not None:
+            input_kwargs["record_request"] = RecordRequestMode[record_str.upper()]
+        # add base_path to path fields and convert to resolved path
+        if base_path:
+            for path_field in ("output_path", "output_evaled_prompt_path", "var_map_path", "credential_path"):
+                if path_field in input_kwargs:
+                    org_path = input_kwargs[path_field]
+                    new_path = str(Path(base_path, org_path).resolve())
+                    # retain trailing slash
+                    if org_path.endswith(('/')):
+                        new_path += '/'
+                    input_kwargs[path_field] = new_path
+        return cls(**input_kwargs)
+    
+    def pretty_print(self, file=sys.stderr):
+        print("RunConfig:", file=file)
+        for field in fields(self):
+            value = getattr(self, field.name)
+            if value is not None:
+                print(f"  {field.name}: {value}", file=file)
+    
+    def to_dict(self, retain_fd=False, base_path: Optional[PathType] = None) -> dict:
+        # record and remove file descriptors
+        tmp_output_fd = self.output_fd
+        tmp_output_evaled_prompt_fd = self.output_evaled_prompt_fd
+        self.output_fd = None
+        self.output_evaled_prompt_fd = None
+        # convert to dict
+        obj = asdict(self, dict_factory=lambda x: { k: v for k, v in x if v is not None })
+        # restore file descriptors
+        self.output_fd = tmp_output_fd
+        self.output_evaled_prompt_fd = tmp_output_evaled_prompt_fd
+        if retain_fd:
+            # keep file descriptors
+            obj["output_fd"] = self.output_fd
+            obj["output_evaled_prompt_fd"] = self.output_evaled_prompt_fd
+        # convert Enum to string
+        record_enum = obj.get("record_request")
+        if record_enum is not None:
+            obj["record_request"] = obj["record_request"].name
+        # convert path to relative path
+        if base_path:
+            for path_field in ("output_path", "output_evaled_prompt_path", "var_map_path", "credential_path"):
+                if path_field in obj:
+                    org_path = obj[path_field]
+                    try:
+                        new_path = str(Path(org_path).relative_to(base_path))
+                        obj[path_field] = new_path
+                    except ValueError:
+                        # org_path is not under base_path, keep the original path
+                        pass
+        return obj
+
+    def merge(self, other: RunConfig, inplace=False) -> RunConfig:
+        # merge the RunConfig object with another RunConfig object
+        # return a new RunConfig object if inplace is False
+        if not inplace:
+            new_run_config = replace(self)
+        else:
+            new_run_config = self
+        for field in fields(new_run_config):
+            v = getattr(other, field.name)
+            if v is not None:
+                setattr(new_run_config, field.name, v)
+        return new_run_config
+
+
+DEFAULT_CONFIG = RunConfig()
+
+
+class HandyPrompt(ABC):
+    
+    TEMPLATE_OUTPUT_FILENAME = "result.%Y%m%d-%H%M%S.hprompt"
+    TEMPLATE_OUTPUT_EVAL_FILENAME = "evaled.%Y%m%d-%H%M%S.hprompt"
+    
+    def __init__(
+        self, data: Union[str, list], request: Optional[dict] = None, 
+        meta: Optional[Union[dict, RunConfig]] = None, 
+        base_path: Optional[PathType] = None):
+        self.data = data
+        self.request = request or {}
+        # parse meta to run_config
+        if isinstance(meta, RunConfig):
+            self.run_config = meta
+        else:
+            self.run_config = RunConfig.from_dict(meta or {}, base_path=base_path)
+        self.base_path = base_path
+    
+    def __str__(self) -> str:
+        return str(self.data)
+    
+    def __repr__(self) -> str:
+        return "{}({}, {}, {})".format(
+            self.__class__.__name__,
+            repr(self.data),
+            repr(self.request),
+            repr(self.run_config)
+        )
+    
+    @property
+    def result_str(self) -> str:
+        return str(self.data)
+    
+    def _serialize_data(self, data) -> str:
+        '''
+        Serialize the data to a string. 
+        This method can be overridden by subclasses.
+        '''
+        return str(data)
+    
+    @staticmethod
+    def _dumps_frontmatter(request: dict, run_config: RunConfig, base_path: Optional[PathType] = None) -> str:
+        # dump frontmatter
+        if not run_config and not request:
+            return ""
+        front_data = copy.deepcopy(request)
+        if run_config:
+            front_data['meta'] = run_config.to_dict(retain_fd=False, base_path=base_path)
+        post = frontmatter.Post("", None, **front_data)
+        return frontmatter.dumps(post, handler).strip() + "\n\n"
+    
+    @classmethod
+    def _dumps(cls, request, run_config: RunConfig, content: str, base_path: Optional[PathType] = None) -> str:
+        return cls._dumps_frontmatter(request, run_config, base_path) + content
+    
+    def dumps(self, base_path: Optional[PathType] = None) -> str:
+        serialized_data = self._serialize_data(self.data)
+        base_path = base_path or self.base_path
+        return self._dumps(self.request, self.run_config, serialized_data, base_path)
+    
+    def dump(self, fd: io.IOBase, base_path: Optional[PathType] = None) -> None:
+        text = self.dumps(base_path=base_path)
+        fd.write(text)
+    
+    def dump_to(self, path: PathType) -> None:
+        with open(path, "w", encoding="utf-8") as fd:
+            self.dump(fd, base_path=Path(path).parent.resolve())
+    
+    @abstractmethod
+    def _eval_data(self: PromptType, run_config: RunConfig) -> Union[str, list]:
+        ...
+    
+    def eval(self: PromptType, run_config: RunConfig) -> PromptType:
+        new_data = self._eval_data(run_config)
+        if new_data != self.data:
+            return self.__class__(
+                new_data,
+                copy.deepcopy(self.request),
+                replace(self.run_config),
+            )
+        return self
+    
+    def eval_run_config(
+        self: PromptType, 
+        run_config: RunConfig, 
+        ) -> RunConfig:
+        # merge runtime run_config with the original run_config
+        run_config = self.run_config.merge(run_config)
+        
+        start_time = datetime.now()
+        if run_config.output_path:
+            run_config.output_path = self._prepare_output_path(
+                run_config.output_path, start_time, self.TEMPLATE_OUTPUT_FILENAME
+            )
+        if run_config.output_evaled_prompt_path:
+            run_config.output_evaled_prompt_path = self._prepare_output_path(
+                run_config.output_evaled_prompt_path, start_time, 
+                self.TEMPLATE_OUTPUT_EVAL_FILENAME
+            )
+        
+        if run_config.credential_path:
+            if not run_config.credential_type:
+                # guess the credential type from the file extension
+                p = Path(run_config.credential_path)
+                if p.suffix:
+                    run_config.credential_type = p.suffix[1:]
+                    if run_config.credential_type == "yml":
+                        run_config.credential_type = "yaml"
+                else:
+                    run_config.credential_type = 'env'
+            run_config.credential_type = run_config.credential_type.lower()
+        return run_config
+    
+    @abstractmethod
+    def _run_with_client(
+        self: PromptType, 
+        client: OpenAIClient, 
+        run_config: RunConfig,
+        new_request: dict,
+        stream: bool,
+        ) -> PromptType:
+        ...
+    
+    def run(
+        self: PromptType, 
+        client: OpenAIClient = None, 
+        run_config: RunConfig = DEFAULT_CONFIG,
+        **kwargs) -> PromptType:
+        run_config, new_request, stream = self._prepare_run(run_config, kwargs)
+        if client:
+            new_prompt = self._run_with_client(client, run_config, new_request, stream)
+        else:
+            with OpenAIClient(ClientMode.SYNC) as client:
+                new_prompt = self._run_with_client(client, run_config, new_request, stream)
+        self._post_check_output(stream, run_config, new_prompt)
+        return new_prompt
+    
+    @abstractmethod
+    async def _arun_with_client(
+        self: PromptType, 
+        client: OpenAIClient, 
+        run_config: RunConfig,
+        new_request: dict,
+        stream: bool,
+        ) -> PromptType:
+        ...
+    
+    async def arun(
+        self: PromptType, 
+        client: OpenAIClient = None, 
+        run_config: RunConfig = DEFAULT_CONFIG,
+        **kwargs) -> PromptType:
+        run_config, new_request, stream = self._prepare_run(run_config, kwargs)
+        if client:
+            new_prompt = await self._arun_with_client(client, run_config, new_request, stream)
+        else:
+            async with OpenAIClient(ClientMode.ASYNC) as client:
+                new_prompt = await self._arun_with_client(client, run_config, new_request, stream)
+        self._post_check_output(stream, run_config, new_prompt)
+        return new_prompt
+    
+    def _prepare_output_path(
+        self, output_path: PathType, start_time: datetime, template_filename: str
+        ) -> str:
+        output_path = str(output_path).strip()
+        p = Path(output_path)
+        if p.is_dir() or output_path.endswith(('/')):
+            # output_path wants to be a directory, append the default filename
+            output_path = Path(output_path, template_filename)
+        # format output_path with the current time
+        output_path = start_time.strftime(str(output_path))
+        # create the parent directory if it does not exist
+        Path(output_path).parent.mkdir(parents=True, exist_ok=True)
+        return output_path
+    
+    def _prepare_run(self: PromptType, run_config: RunConfig, kwargs: dict):
+        # update the request with the keyword arguments
+        new_request = copy.deepcopy(self.request)
+        new_request.update(kwargs)
+        # get the stream flag
+        stream = new_request.get("stream", False)
+        
+        # evaluate the run_config
+        run_config = self.eval_run_config(run_config)
+        
+        # verbose output
+        if run_config.verbose:
+            print("---", file=sys.stderr)
+            print("NEW RUN")
+            print(f"Start time: {datetime.now()}", file=sys.stderr)
+            run_config.pretty_print()
+            print("---", file=sys.stderr)
+        
+        # load the credential file
+        if run_config.credential_path:
+            if run_config.credential_type == "env":
+                load_dotenv(run_config.credential_path, override=True)
+            elif run_config.credential_type in ("json", "yaml"):
+                with open(run_config.credential_path, 'r', encoding='utf-8') as fin:
+                    if run_config.credential_type == "json":
+                        credential_dict = json.load(fin)
+                    else:
+                        credential_dict = yaml.safe_load(fin)
+                new_request.update(credential_dict)
+            else:
+                raise ValueError(f"unsupported credential type: {run_config.credential_type}")
+        
+        # output the evaluated prompt to a file or a file descriptor
+        if run_config.output_evaled_prompt_path \
+            or run_config.output_evaled_prompt_fd:
+            evaled_data = self._eval_data(run_config)
+            serialized_data = self._serialize_data(evaled_data)
+            text = self._dumps(
+                self.request, run_config, serialized_data, 
+                Path(run_config.output_evaled_prompt_path).parent.resolve() \
+                    if run_config.output_evaled_prompt_path else None
+            )
+            if run_config.output_evaled_prompt_path:
+                with open(run_config.output_evaled_prompt_path, 'w', encoding='utf-8') as fout:
+                    fout.write(text)
+            elif run_config.output_evaled_prompt_fd:
+                run_config.output_evaled_prompt_fd.write(text)
+        return run_config, new_request, stream
+    
+    def _post_check_output(self: PromptType, stream: bool, run_config: RunConfig, new_prompt: PromptType):
+        if not stream:
+            # if stream is True, the response is already streamed to 
+            # a file or a file descriptor
+            if run_config.output_path:
+                new_prompt.dump_to(run_config.output_path)
+            elif run_config.output_fd:
+                new_prompt.dump(run_config.output_fd)
+        return new_prompt
+
+    def _merge_non_data(self: PromptType, other: PromptType, inplace=False) -> Union[None, tuple[dict, RunConfig]]:
+        if inplace:
+            merge(self.request, other.request, strategy=Strategy.ADDITIVE)
+            self.run_config.merge(other.run_config, inplace=True)
+        else:
+            merged_request = merge({}, self.request, other.request, strategy=Strategy.ADDITIVE)
+            merged_run_config = self.run_config.merge(other.run_config)
+            return merged_request, merged_run_config
+    
+    def _filter_request(
+        self, request: dict, 
+        run_config: RunConfig,
+        ) -> dict:
+        if run_config.record_request == RecordRequestMode.WHITELIST:
+            if run_config.record_whitelist:
+                request = {key: value for key, value in request.items() if key in run_config.record_whitelist}
+            else:
+                request = {}
+        elif run_config.record_request == RecordRequestMode.NONE:
+            request = {}
+        elif run_config.record_request == RecordRequestMode.ALL:
+            pass
+        else:
+            # default: blacklist
+            # will modify the original request
+            real_blacklist = run_config.record_blacklist or DEFAULT_BLACKLIST
+            for key in real_blacklist:
+                request.pop(key, None)
+        return request
+    
+    def _parse_var_map(self, run_config: RunConfig):
+        var_map = {}
+        if run_config.var_map_path:
+            var_map = merge(
+                var_map, 
+                load_var_map(run_config.var_map_path), 
+                strategy=Strategy.REPLACE
+            )
+        if run_config.var_map:
+            var_map = merge(
+                var_map, 
+                run_config.var_map, 
+                strategy=Strategy.REPLACE
+            )
+        return var_map
+
+
+class ChatPrompt(HandyPrompt):
+        
+    def __init__(self, chat: list, request: dict, meta: Union[dict, RunConfig], base_path: Optional[PathType] = None):
+        super().__init__(chat, request, meta, base_path)
+    
+    @property
+    def chat(self) -> list:
+        return self.data
+    
+    @chat.setter
+    def chat(self, value: list):
+        self.data = value
+    
+    @property
+    def result_str(self) -> str:
+        if len(self.chat) == 0:
+            return ""
+        return self.chat[-1]['content']
+    
+    def _serialize_data(self, data) -> str:
+        return converter.chat2raw(data)
+    
+    def _eval_data(self, run_config: RunConfig) -> list:
+        var_map = self._parse_var_map(run_config)
+        if var_map:
+            return converter.chat_replace_variables(
+                self.chat, var_map, inplace=False)
+        else:
+            return self.chat
+    
+    def _stream_chat_proc(self, response, fd: Optional[io.IOBase] = None) -> tuple[str, str]:
+        # stream response to fd
+        role = ""
+        content = ""
+        for r, text in stream_chat_with_role(response):
+            if r != role:
+                role = r
+                if fd:
+                    fd.write(f"${role}$\n")
+            elif fd:
+                fd.write(text)
+            content += text
+        return role, content
+    
+    def _run_with_client(
+        self, client: OpenAIClient, 
+        run_config: RunConfig,
+        new_request: dict,
+        stream: bool,
+        ) -> ChatPrompt:
+        response = client.chat(
+            messages=self._eval_data(run_config),
+            **new_request
+            ).call()
+        new_request = self._filter_request(new_request, run_config)
+        base_path = Path(run_config.output_path).parent.resolve() if run_config.output_path else None
+        if stream:
+            if run_config.output_path:
+                # stream response to a file
+                with open(run_config.output_path, 'w', encoding='utf-8') as fout:
+                    # dump frontmatter
+                    fout.write(self._dumps_frontmatter(new_request, run_config, base_path))
+                    role, content = self._stream_chat_proc(response, fout)
+            elif run_config.output_fd:
+                # dump frontmatter, no base_path
+                run_config.output_fd.write(self._dumps_frontmatter(new_request, run_config))
+                # stream response to a file descriptor
+                role, content = self._stream_chat_proc(response, run_config.output_fd)
+            else:
+                role, content = self._stream_chat_proc(response)
+        else:
+            role = response['choices'][0]['message']['role']
+            content = response['choices'][0]['message']['content']
+        return ChatPrompt(
+            [{"role": role, "content": content}],
+            new_request, run_config, base_path
+        )
+    
+    async def _astream_chat_proc(self, response, fd: Optional[io.IOBase] = None) -> tuple[str, str]:
+        # stream response to fd
+        role = ""
+        content = ""
+        async for r, text in astream_chat_with_role(response):
+            if r != role:
+                role = r
+                if fd:
+                    fd.write(f"${role}$\n")
+            elif fd:
+                fd.write(text)
+            content += text
+        return role, content
+    
+    async def _arun_with_client(
+        self, client: OpenAIClient, 
+        run_config: RunConfig,
+        new_request: dict,
+        stream: bool,
+        ) -> ChatPrompt:
+        response = await client.chat(
+            messages=self._eval_data(run_config),
+            **new_request
+            ).acall()
+        new_request = self._filter_request(new_request, run_config)
+        base_path = Path(run_config.output_path).parent.resolve() if run_config.output_path else None
+        if stream:
+            if run_config.output_path:
+                # stream response to a file
+                with open(run_config.output_path, 'w', encoding='utf-8') as fout:
+                    fout.write(self._dumps_frontmatter(new_request, run_config, base_path))
+                    role, content = await self._astream_chat_proc(response, fout)
+            elif run_config.output_fd:
+                # stream response to a file descriptor
+                run_config.output_fd.write(self._dumps_frontmatter(new_request, run_config))
+                role, content = await self._astream_chat_proc(response, run_config.output_fd)
+            else:
+                role, content = await self._astream_chat_proc(response)
+        else:
+            role = response['choices'][0]['message']['role']
+            content = response['choices'][0]['message']['content']
+        return ChatPrompt(
+            [{"role": role, "content": content}],
+            new_request, run_config, base_path
+        )
+
+    def __add__(self, other: Union[str, list, ChatPrompt]):
+        # support concatenation with string, list or another ChatPrompt
+        if isinstance(other, str):
+            return ChatPrompt(
+                self.chat + [{"role": "user", "content": other}],
+                copy.deepcopy(self.request),
+                replace(self.run_config),
+                self.base_path
+            )
+        elif isinstance(other, list):
+            return ChatPrompt(
+                self.chat + [{"role": msg['role'], "content": msg['content']} for msg in other],
+                copy.deepcopy(self.request),
+                replace(self.run_config),
+                self.base_path
+            )
+        elif isinstance(other, ChatPrompt):
+            # merge two ChatPrompt objects
+            merged_request, merged_run_config = self._merge_non_data(other)
+            return ChatPrompt(
+                self.chat + other.chat, merged_request, merged_run_config, 
+                self.base_path
+            )
+        else:
+            raise TypeError(f"unsupported operand type(s) for +: 'ChatPrompt' and '{type(other)}'")
+    
+    def __iadd__(self, other: Union[str, list, ChatPrompt]):
+        # support concatenation with string, list or another ChatPrompt
+        if isinstance(other, str):
+            self.chat.append({"role": "user", "content": other})
+        elif isinstance(other, list):
+            self.chat += [{"role": msg['role'], "content": msg['content']} for msg in other]
+        elif isinstance(other, ChatPrompt):
+            # merge two ChatPrompt objects
+            self.chat += other.chat
+            self._merge_non_data(other, inplace=True)
+        else:
+            raise TypeError(f"unsupported operand type(s) for +: 'ChatPrompt' and '{type(other)}'")
+        return self
+
+
+class CompletionsPrompt(HandyPrompt):
+    
+    def __init__(self, prompt: str, request: dict, meta: Union[dict, RunConfig], base_path: PathType = None):
+        super().__init__(prompt, request, meta, base_path)
+    
+    @property
+    def prompt(self) -> str:
+        return self.data
+    
+    @prompt.setter
+    def prompt(self, value: str):
+        self.data = value
+    
+    def _eval_data(self, run_config: RunConfig) -> str:
+        var_map = self._parse_var_map(run_config)
+        if var_map:
+            new_prompt = self.prompt
+            for key, value in var_map.items():
+                new_prompt = new_prompt.replace(key, value)
+            return new_prompt
+        else:
+            return self.prompt
+    
+    def _stream_completions_proc(self, response, fd: Optional[io.IOBase] = None) -> str:
+        # stream response to fd
+        content = ""
+        for text in stream_completions(response):
+            if fd:
+                fd.write(text)
+            content += text
+        return content
+
+    def _run_with_client(
+        self, client: OpenAIClient, 
+        run_config: RunConfig,
+        new_request: dict,
+        stream: bool,
+        ) -> CompletionsPrompt:
+        response = client.completions(
+            prompt=self._eval_data(run_config),
+            **new_request
+            ).call()
+        new_request = self._filter_request(new_request, run_config)
+        base_path = Path(run_config.output_path).parent.resolve() if run_config.output_path else None
+        if stream:
+            if run_config.output_path:
+                # stream response to a file
+                with open(run_config.output_path, 'w', encoding='utf-8') as fout:
+                    fout.write(self._dumps_frontmatter(new_request, run_config, base_path))
+                    content = self._stream_completions_proc(response, fout)
+            elif run_config.output_fd:
+                # stream response to a file descriptor
+                run_config.output_fd.write(self._dumps_frontmatter(new_request, run_config))
+                content = self._stream_completions_proc(response, run_config.output_fd)
+            else:
+                content = self._stream_completions_proc(response)
+        else:
+            content = response['choices'][0]['text']
+        return CompletionsPrompt(content, new_request, run_config, base_path)
+
+    async def _astream_completions_proc(self, response, fd: Optional[io.IOBase] = None) -> str:
+        # stream response to fd
+        content = ""
+        async for text in astream_completions(response):
+            if fd:
+                fd.write(text)
+            content += text
+        return content
+    
+    async def _arun_with_client(
+        self, client: OpenAIClient, 
+        run_config: RunConfig,
+        new_request: dict,
+        stream: bool,
+        ) -> CompletionsPrompt:
+        response = await client.completions(
+            prompt=self._eval_data(run_config),
+            **new_request
+            ).acall()
+        new_request = self._filter_request(new_request, run_config)
+        base_path = Path(run_config.output_path).parent.resolve() if run_config.output_path else None
+        if stream:
+            if run_config.output_path:
+                # stream response to a file
+                with open(run_config.output_path, 'w', encoding='utf-8') as fout:
+                    fout.write(self._dumps_frontmatter(new_request, run_config, base_path))
+                    content = await self._astream_completions_proc(response, fout)
+            elif run_config.output_fd:
+                # stream response to a file descriptor
+                run_config.output_fd.write(self._dumps_frontmatter(new_request, run_config))
+                content = await self._astream_completions_proc(response, run_config.output_fd)
+            else:
+                content = await self._astream_completions_proc(response)
+        else:
+            content = response['choices'][0]['text']
+        return CompletionsPrompt(content, new_request, run_config, base_path)
+    
+    def __add__(self, other: Union[str, CompletionsPrompt]):
+        # support concatenation with string or another CompletionsPrompt
+        if isinstance(other, str):
+            return CompletionsPrompt(
+                self.prompt + other,
+                copy.deepcopy(self.request),
+                replace(self.run_config),
+                self.base_path
+            )
+        elif isinstance(other, CompletionsPrompt):
+            # merge two CompletionsPrompt objects
+            merged_request, merged_run_config = self._merge_non_data(other)
+            return CompletionsPrompt(
+                self.prompt + other.prompt, merged_request, merged_run_config,
+                self.base_path
+            )
+        else:
+            raise TypeError(f"unsupported operand type(s) for +: 'CompletionsPrompt' and '{type(other)}'")
+    
+    def __iadd__(self, other: Union[str, CompletionsPrompt]):
+        # support concatenation with string or another CompletionsPrompt
+        if isinstance(other, str):
+            self.prompt += other
+        elif isinstance(other, CompletionsPrompt):
+            # merge two CompletionsPrompt objects
+            self.prompt += other.prompt
+            self._merge_non_data(other, inplace=True)
+        else:
+            raise TypeError(f"unsupported operand type(s) for +: 'CompletionsPrompt' and '{type(other)}'")
+        return self
+
diff --git a/src/handyllm/prompt_converter.py b/src/handyllm/prompt_converter.py
index e847fd3..9f271ac 100644
--- a/src/handyllm/prompt_converter.py
+++ b/src/handyllm/prompt_converter.py
@@ -12,6 +12,12 @@ def split_pattern(self):
         # build a regex pattern to split the prompt by role keys
         return r'^\$(' + '|'.join(self.role_keys) + r')\$$'
 
+    def detect(self, raw_prompt: str):
+        # detect the role keys in the prompt
+        if re.search(self.split_pattern, raw_prompt, flags=re.MULTILINE):
+            return True
+        return False
+
     def read_substitute_content(self, path: str):
         # 从文本文件读取所有prompt中需要替换的内容
         with open(path, 'r', encoding='utf-8') as fin:
diff --git a/tests/assets/magic.hprompt b/tests/assets/magic.hprompt
new file mode 100644
index 0000000..b96271e
--- /dev/null
+++ b/tests/assets/magic.hprompt
@@ -0,0 +1,32 @@
+---
+# you can run this script by `handyllm hprompt magic.hprompt`
+meta:
+  # paths are relative to the parent dir of this file
+  credential_path: ../credential.yml
+  var_map_path: substitute.txt
+  output_path: tmp_out/%Y-%m-%d/result.%H-%M-%S.hprompt
+  output_evaled_prompt_path: tmp_out/%Y-%m-%d/evaled.%H-%M-%S.hprompt
+model: gpt-3.5-turbo
+temperature: 0.2
+stream: true
+---
+
+$system$
+You are a helpful assistant.
+
+$user$
+Please help me merge the following two JSON documents into one.
+
+$assistant$
+Sure, please give me the two JSON documents.
+
+$user$
+{
+    "item1": "It is really a good day."
+}
+{
+    "item2": "Indeed."
+}
+%output_format%
+%variable1%
+%variable2%
\ No newline at end of file
diff --git a/tests/credential.yml b/tests/credential.yml
new file mode 100644
index 0000000..9e160b3
--- /dev/null
+++ b/tests/credential.yml
@@ -0,0 +1,2 @@
+api_key: xxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxx
+organization: org-xxxxxxxxxxxxxxxxxxxxxxxx
diff --git a/tests/test_hprompt.py b/tests/test_hprompt.py
new file mode 100644
index 0000000..29a45fd
--- /dev/null
+++ b/tests/test_hprompt.py
@@ -0,0 +1,28 @@
+from pathlib import Path
+
+from handyllm import hprompt
+
+
+cur_dir = Path(__file__).parent
+
+# load hprompt
+prompt: hprompt.ChatPrompt = hprompt.load_from(cur_dir / './assets/magic.hprompt')
+print(prompt.data)
+print(prompt)
+print(repr(prompt))
+
+# run hprompt
+result_prompt = prompt.run()
+print(result_prompt.result_str)
+
+# dump result hprompt
+result_prompt.dump_to(cur_dir / './assets/tmp_out.hprompt')
+
+# chain result hprompt
+prompt += result_prompt
+# chain another hprompt
+prompt += hprompt.load_from(cur_dir / './assets/magic.hprompt')
+# run again
+result2 = prompt.run()
+
+