tsfile-cli is a single, pipe-friendly C++ command-line tool for inspecting and
importing Apache TsFile (.tsfile) files from the shell — the TsFile analogue of
parquet-cli / pqrs. Read commands print data to stdout and diagnostics to
stderr, so they compose with awk, jq, sort, and friends; the write command
imports CSV/TSV into a new .tsfile. It is built on the public storage::TsFileReader
and storage::TsFileTableWriter APIs and does not modify the storage engine.
The CLI is part of the C++ module and is built by default (CMake option BUILD_TOOLS=ON).
The CMake target is tsfile_cli; the produced executable is named tsfile-cli.
Prerequisites: a C++11 compiler (GCC / Clang / MSVC) and CMake ≥ 3.11. The third-party
dependencies (ANTLR4, Snappy, LZ4, LZOKAY, Zlib, GoogleTest) are bundled under
cpp/third_party/ and built automatically — no separate install step needed.
Choose any one of the following.
1. Build script (recommended). From cpp/:
bash build.sh -t=Debug # -> cpp/build/Debug/bin/tsfile-cli
bash build.sh # Release (default) -> cpp/build/Release/bin/tsfile-cli
bash build.sh install # Release build, then run make install2. Maven (builds the whole C++ module). From the repository root:
./mvnw clean package -P with-cpp # -> cpp/target/build/bin/tsfile-cli3. Plain CMake. From cpp/:
mkdir -p build/Debug && cd build/Debug
cmake ../.. -DCMAKE_BUILD_TYPE=Debug
make -j tsfile_cli # -> build/Debug/bin/tsfile-cliCMake 4.x note. The bundled ANTLR4 runtime sets old CMake policies that CMake 4 rejects (
Policy CMP00xx may not be set to OLD behavior). The reader and CLI do not use ANTLR4, so disable it —--disable-antlr4for the build script, or-DENABLE_ANTLR4=OFFfor plain CMake:bash build.sh -t=Debug --disable-antlr4
Verify the binary:
./build/Debug/bin/tsfile-cli --version # -> tsfile-cli (Apache TsFile C++) <version>
./build/Debug/bin/tsfile-cli --helpThe executable links the tsfile shared library built alongside it. To run it from
anywhere, either run it in place by its full path, or explicitly install it with
bash build.sh install, cmake --install ., or make install. The install step places
the binary under <prefix>/bin and libtsfile under <prefix>/lib. The build script
does not install by default.
tsfile-cli <command> [options] <file.tsfile>
tsfile-cli --help | --version | help
Exit codes: 0 success, 1 usage/argument error, 2 file open/corrupt,
3 query/runtime error.
| Command | Description |
|---|---|
ls |
List devices (tree model) or tables (table model), one name per line |
schema |
Per-series target, measurement, datatype, encoding, compression |
meta |
File summary: model, device/table/series counts, time range, file size |
stats |
Per-series count, start_time, end_time, min, max, first, last, sum |
count |
Per-series row counts plus a total row (from statistics, no page scan) |
head |
First N rows (default 10; use -n) |
cat |
All matching rows, streamed (table format buffers to align columns) |
sample |
Reproducible reservoir sample (default 10; -n, --seed) |
The metadata commands (ls / schema / meta / stats / count) answer most questions
without decoding data pages.
Shared options:
| Option | Meaning |
|---|---|
-f, --format csv|tsv|json|table |
Output format; defaults to table on a TTY, tsv when piped |
-d, --device <id> / -t, --table <name> |
Scope to one device / table (mutually exclusive) |
-m, --measurements a,b,c |
Column projection (schema, stats, count, head, cat, sample) |
-n, --limit N / --offset N |
Max rows / rows to skip (head, cat; --offset not valid for sample) |
--start <ms> / --end <ms> |
Inclusive epoch-millisecond time range (head, cat, sample) |
--seed N |
Reproducible sampling seed (sample only) |
--tag-filter C OP V / --tag-between C L U / --tag-not-between C L U |
Table TAG predicate for head, cat, sample; OP is eq, neq, lt, lteq, gt, gteq, regexp, or not-regexp |
--no-header |
Omit the header row |
--model tree|table |
Force the model (otherwise auto-detected) |
json output is NDJSON (one object per line; numbers/booleans bare, other values quoted,
nulls as null; non-finite floats — NaN/Inf — become null). CSV output follows RFC 4180.
Timestamps are raw epoch milliseconds. The table format buffers all rows in memory to
align columns, so prefer csv/tsv/json when dumping large files.
BIN=cpp/build/Debug/bin/tsfile-cli
$BIN ls -f tsv data.tsfile # list tables / devices
$BIN meta data.tsfile # quick file overview
$BIN count -t table1 -f tsv data.tsfile # row counts, no page scan
$BIN cat -t table1 --tag-filter device eq dev_1 -m temp -f tsv data.tsfile
$BIN cat -m temp,humidity --start 1700000000000 -f csv data.tsfile | head
$BIN sample -m temp -n 20 --seed 42 -f json data.tsfile | jq .tsfile-cli write imports CSV/TSV rows into a new table-model .tsfile (the output is
overwritten). The first input column is the timestamp (epoch milliseconds); the remaining
columns are declared explicitly with --columns — there is no type inference.
Timestamps must be strictly increasing per device, where a device is identified by its
tag column values (rows that share the same tags form one device's timeline). Rows for
different tag combinations may freely interleave and reuse timestamps. Out-of-order input is
rejected with the offending line number, and a failed import leaves no output file behind.
--output must differ from the input file.
tsfile-cli write --table <name> --columns <spec> -o <out.tsfile> \
[-f csv|tsv] [--no-header] [--header-match] [-v] [<input> | -]
--columns is a comma-separated list of name:TYPE:category, where category (case-insensitive)
is tag or field and TYPE (case-insensitive) is one of BOOLEAN, INT32, INT64, FLOAT, DOUBLE, STRING, TEXT, TIMESTAMP, DATE, BLOB — for example --columns "id1:STRING:tag,s1:INT64:field".
DATE cells are written as YYYY-MM-DD; TIMESTAMP cells as epoch milliseconds. Each column is
stored with the engine's default encoding and compression for its type.
| Option | Meaning |
|---|---|
--table <name> |
Output table name (lower-cased) |
--columns <spec> |
Ordered data columns (excludes the leading timestamp column) |
-o, --output <path> |
Output .tsfile (required; overwritten) |
<input> / - |
Input file, or - / omitted for stdin |
-f csv|tsv |
Input delimiter (default csv; json / table are rejected) |
--no-header |
Input has no header row (default: first line is a header and is skipped) |
--header-match |
Validate header names against --columns |
-v, --verbose |
Print wrote N rows to <out> to stderr (otherwise silent on success) |
An empty cell is written as null. The command is silent on success (Unix-style); pass -v
for a one-line summary.
# round-trip through a pipe
printf 'time,id1,s1\n0,dev,0\n1,dev,10\n' \
| tsfile-cli write --table t1 --columns "id1:STRING:tag,s1:INT64:field" -o out.tsfile -
tsfile-cli count -f tsv out.tsfile # -> t1.dev s1 2For tree-model writes, JSON input, or programmatic use, use the C++ SDK directly — see
cpp/examples/cpp_examples/demo_write.cpp (TsFileTableWriter / TsFileWriter + Tablet).
cpp/tools/skills/tsfile-cli/SKILL.md is a machine-readable reference that teaches AI
coding assistants (e.g. Claude Code) how to drive tsfile-cli correctly. Such assistants
auto-discover skills from a .claude/skills/ directory at session start, so "installing"
the skill just means placing it there — either project-level or user-level:
# project-level (this repository only)
mkdir -p .claude/skills/tsfile-cli
cp cpp/tools/skills/tsfile-cli/SKILL.md .claude/skills/tsfile-cli/SKILL.md
# or user-level (available in all your projects)
mkdir -p ~/.claude/skills/tsfile-cli
cp cpp/tools/skills/tsfile-cli/SKILL.md ~/.claude/skills/tsfile-cli/SKILL.mdThe installed
SKILL.mdmust begin with its YAML front-matter (--- … ---) for the assistant to detect it. The in-repo copy carries an Apache license header comment above the front-matter; if discovery fails, delete that leading<!-- … -->block from the installed copy so---is the first line.
Start a new assistant session afterward. The skill then activates automatically when you
ask to inspect or import a .tsfile; you can also invoke it explicitly (e.g. "use the
tsfile-cli skill").
cpp/tools/
├── tools_main.cc # main(): forwards argv to run_cli
├── cli/ # argument parsing, top-level dispatch, exit codes
├── format/ # csv/tsv/json/table output + CSV/TSV input parsing
├── commands/ # one file per command + shared row-query / statistics helpers
└── skills/tsfile-cli/ # model-facing skill reference (for AI assistants)