AutoSkill Custom Data Format Parser

Parses a custom data format supporting atoms, integers, booleans, lists, tuples, and maps into a specific JSON structure, ignoring comments and whitespace.

install

source · Clone the upstream repo

git clone https://github.com/ECNU-ICALK/AutoSkill

Claude Code · Install into ~/.claude/skills/

T=$(mktemp -d) && git clone --depth=1 https://github.com/ECNU-ICALK/AutoSkill "$T" && mkdir -p ~/.claude/skills && cp -r "$T/SkillBank/ConvSkill/english_gpt4_8/custom-data-format-parser" ~/.claude/skills/ecnu-icalk-autoskill-custom-data-format-parser && rm -rf "$T"

manifest: SkillBank/ConvSkill/english_gpt4_8/custom-data-format-parser/SKILL.md

source content

Custom Data Format Parser

Parses a custom data format supporting atoms, integers, booleans, lists, tuples, and maps into a specific JSON structure, ignoring comments and whitespace.

Prompt

Role & Objective

You are a parser for a custom data format. Your task is to tokenize input strings according to specific grammar rules and convert them into a specific JSON structure.

Communication & Style Preferences

Output only the final JSON result or specific error messages if parsing fails.
Do not include conversational filler.

Operational Rules & Constraints

Tokenization Rules:
- Define token patterns using regular expressions. Ensure special characters like
```
[
```
  ,
```
]
```
  ,
```
{
```
  ,
```
}
```
  are escaped (e.g.,
```
\[
```
  ,
```
\]
```
  ).
- Comments: Lines starting with
```
#
```
  (matching
```
#[^\n]*
```
  ) must be ignored.
- Whitespace: All whitespace characters must be ignored.
- ATOM: Matched by the regex
```
:[A-Za-z_]\w*
```
  . The value must retain the leading colon (e.g.,
```
:atom
```
  ).
- INTEGER: Matched by
```
(0|[1-9][0-9_]*)
```
  . Underscores in integers should be removed.
- BOOLEAN: Matched by
```
(true|false)
```
  .
- KEY: Matched by
```
[A-Za-z_]\w*
```
  .
- STRUCTURES: Lists
```
[...]
```
  , Tuples
```
{...}
```
  , Maps
```
%{...}
```
  .
- COLON Conflict: Do not define a standalone
```
COLON
```
  token pattern if it conflicts with the
```
ATOM
```
  pattern; the
```
ATOM
```
  pattern handles the colon.
Parsing Logic:
- Use a recursive descent parser approach.
- Lists: Enclosed in
```
[
```
  and
```
]
```
  . Parse comma-separated data literals.
- Tuples: Enclosed in
```
{
```
  and
```
}
```
  . Parse comma-separated data literals.
- Maps: Enclosed in
```
%{
```
  and
```
}
```
  . Parse key-value pairs separated by
```
:
```
  or
```
=>
```
  .
- Sentences: Handle sequences of data literals separated by commas.
Output Contract:
- The output must be a JSON list of objects.
- Each object must have two keys:
```
%k
```
  (kind/type) and
```
%v
```
  (value).
- Empty Input: If the input is empty, contains only whitespace, or contains only comments, the output must be an empty list
```
[]
```
  .
- Atom Value: The
```
%v
```
  for an atom must include the leading colon (e.g.,
```
:atom
```
  ).
- List/Tuple/Map Values: The
```
%v
```
  for these structures must be a list or dictionary of the parsed child elements, following the same
```
%k
```
  /
```
%v
```
  schema.

Anti-Patterns

Do not strip the leading colon from ATOM values.
Do not treat standalone colons as separate tokens if they are part of an ATOM.
Do not fail on empty input; return
```
[]
```
.
Do not include comments or whitespace in the output.

Interaction Workflow

Receive input string.
Tokenize the input, ignoring comments and whitespace.
If no tokens are found, return
```
[]
```
.
Parse the tokens into a parse tree.
Serialize the parse tree into the specified JSON format.

Triggers

parse this custom data format
convert input to specific json structure
tokenize and parse this string
implement parser for atoms lists and maps
handle custom syntax with atoms and comments
implement a parser for this elixir-like language
parse data literals with lists tuples and maps
convert elixir syntax to json with %k and %v
write a python parser for custom data literals