Power-bi-agentic-development pbip
Expert guidance for the Power BI Project (PBIP) file format; project structure, cross-cutting operations (renames, forking), and PBIX extraction/conversion. Automatically invoke when the user mentions PBIP, PBIX, .pbip/.pbism/.platform files, or asks about "PBIP project structure", "PBIP vs PBIX", "thin report vs thick report", "rename a table", "cascade rename", "fork a PBIP project", "convert pbix to pbip", "extract pbix", "what files are in a PBIP", "PBIP encoding", "definition.pbir", or discusses project-level file structure and post-rename verification.
git clone https://github.com/data-goblin/power-bi-agentic-development
T=$(mktemp -d) && git clone --depth=1 https://github.com/data-goblin/power-bi-agentic-development "$T" && mkdir -p ~/.claude/skills && cp -r "$T/plugins/pbip/skills/pbip" ~/.claude/skills/data-goblin-power-bi-agentic-development-pbip && rm -rf "$T"
plugins/pbip/skills/pbip/SKILL.mdPBIP Project Format
PBIP (Power BI Project) is the developer-mode file format for Power BI. It decomposes a
.pbix binary into human-readable text files organized in folders, enabling source control, external editing, and multi-author collaboration.
General, critical guidance
- This skill covers project structure, not file editing. To modify TMDL files (semantic model), load the
skill. To modify PBIR JSON files (report), load thetmdl
skill -- or preferably use thepbir-format
CLI with thepbir
skill if available. Install withpbir-cli
oruv tool install pbir-cli
; check withpip install pbir-cli
. If the required skill is not loaded, ask the user to install the appropriate plugin before proceeding.pbir --version - PBIX is a black box; PBIP is transparent. PBIX is a single binary that cannot be diffed or edited externally. PBIP splits the same content into text files. Convert between them with File > Save As in PBI Desktop.
- Thick vs thin reports: A thick report bundles
+.Report/
in the same project (.SemanticModel/
usesdefinition.pbir
). A thin report hasbyPath
only, connecting to a remote model via.Report/
. Thin reports are preferred for managed/shared BI.byConnection - A project can contain multiple items. Multiple
and.Report/
folders can coexist. The.SemanticModel/
file is optional -- open.pbip
directly.definition.pbir - UTF-8 without BOM. All files must be saved as UTF-8 without BOM. A BOM prefix causes parse errors in some tools.
- Git line endings: PBI Desktop writes CRLF. Configure
orcore.autocrlf
in* text=auto
to normalize..gitattributes - 260-char Windows path limit. Use short root paths. Deep nesting of page/visual GUIDs can exceed this limit.
- PBI Desktop does not detect external changes. Close and reopen PBI Desktop after editing files externally.
- Rename cascades are cross-cutting. Renaming a table, measure, or column requires updating references in TMDL files, visual JSONs, report extensions, culture files, DAX queries, and diagram layouts. Missing even one location causes broken visuals or DAX errors.
- SparklineData metadata selectors embed Entity references in compact strings that do not follow the standard
JSON structure. Easy to miss.SourceRef.Entity - DAX query files exist in TWO locations:
and<Name>.SemanticModel/DAXQueries/
. Always check both during renames.<Name>.Report/DAXQueries/
Working with PBIX Files
A
.pbix file is a ZIP archive following the OPC (Open Packaging Convention) standard. It can be extracted with standard zip tools to inspect its contents or manually assemble a PBIP from the extracted files.
PBIX Internal Structure
Thick PBIX -- contains an embedded semantic model (
DataModel binary). The report and model are bundled together:
ThickReport.pbix (ZIP archive) +-- [Content_Types].xml # OPC manifest (UTF-8 with BOM) +-- Version # Power BI version string (UTF-16LE) +-- Settings # Query settings JSON (UTF-16LE) +-- Metadata # Creation timestamp JSON (UTF-16LE) +-- SecurityBindings # Binary (empty for new reports) +-- DataModel # <-- THIS MAKES IT THICK: binary ABF blob (opaque, not programmatically readable) +-- Report/ | +-- definition/ # PBIR report definition (modern PBIX) | | +-- report.json | | +-- pages/ | | +-- ... | +-- Layout # Legacy monolithic JSON (legacy PBIX, UTF-16LE) | +-- StaticResources/ # Themes, images
Thin PBIX -- no embedded model. Uses a
Connections file to reference a remote semantic model:
ThinReport.pbix (ZIP archive) +-- [Content_Types].xml # OPC manifest (UTF-8 with BOM) +-- Version # Power BI version string (UTF-16LE) +-- Settings # Query settings JSON (UTF-16LE) +-- Metadata # Creation timestamp JSON (UTF-16LE) +-- SecurityBindings # Binary (empty for new reports) +-- Connections # <-- Remote model reference (UTF-8 JSON, contains connection string) +-- Report/ | +-- definition/ # PBIR report definition (modern PBIX) | | +-- report.json | | +-- pages/ | | +-- ... | +-- Layout # Legacy monolithic JSON (legacy PBIX, UTF-16LE) | +-- StaticResources/ # Themes, images
A PBIX is thick if
DataModel exists in the ZIP; thin if it has Connections instead. The Report/ folder structure is the same in both cases. A PBIX will have either Report/definition/ (modern PBIR format) or Report/Layout (legacy format), not both.
Thick vs Thin PBIX
A thick PBIX contains a
DataModel entry -- a binary ABF (Analysis Services Backup) blob with the semantic model data and metadata baked in. A thin PBIX has no DataModel and instead has a Connections file (UTF-8 JSON) pointing to a remote semantic model. The DataModel binary cannot be deserialized programmatically -- thick PBIX semantic models are opaque.
Legacy vs Modern PBIX
Legacy PBIX files (pre-PBIR) store the report as a single
Report/Layout file encoded in UTF-16LE -- a monolithic JSON blob with nested JSON strings (e.g. config, filters, query are JSON-encoded strings inside the outer JSON). Modern PBIX files store the report in Report/definition/ using the PBIR JSON format with separate files per page and visual. Detect legacy format by checking for the Report/Layout entry in the ZIP.
Encoding
PBIX internal files use mixed encodings:
| File | Encoding |
|---|---|
, , | UTF-16LE |
| UTF-8 |
contents | UTF-8 |
(legacy) | UTF-16LE |
| UTF-8 with BOM |
, | Binary |
Mismatched encoding when reading or writing these files causes parse failures.
Extracting a PBIX
import zipfile from pathlib import Path pbix_path = Path("MyReport.pbix") output_dir = Path("MyReport_extracted") with zipfile.ZipFile(pbix_path, "r") as z: # Safety: validate no entries escape the target directory (Zip Slip protection) resolved_output = output_dir.resolve() for member in z.infolist(): member_path = (output_dir / member.filename).resolve() if not member_path.is_relative_to(resolved_output): raise ValueError(f"Zip entry escapes target: {member.filename}") z.extractall(output_dir) # Detect PBIX type is_thick = (output_dir / "DataModel").exists() is_legacy = (output_dir / "Report" / "Layout").exists() is_modern = (output_dir / "Report" / "definition" / "report.json").exists()
# Quick extraction via CLI unzip MyReport.pbix -d MyReport_extracted/ # Check contents without extracting unzip -l MyReport.pbix
Assembling a PBIP from an Extracted Thin PBIX
For thin PBIX files (no
DataModel), a PBIP can be assembled from the extracted contents:
- Extract the PBIX ZIP
- Create the PBIP folder structure:
MyReport/ +-- MyReport.pbip +-- MyReport.Report/ | +-- definition.pbir | +-- definition/ # Copy from extracted Report/definition/ | +-- StaticResources/ # Copy from extracted Report/StaticResources/ | +-- .platform - Create
:MyReport.pbip{ "version": "1.0", "artifacts": [ { "report": { "path": "MyReport.Report" } } ], "settings": { "enableAutoRecovery": true } } - Create
withdefinition.pbir
derived from the extractedbyConnection
file. TheConnections
file contains a JSON array with connection string details:Connections{ "$schema": "https://developer.microsoft.com/json-schemas/fabric/item/report/definitionProperties/2.0.0/schema.json", "version": "4.0", "datasetReference": { "byConnection": { "connectionString": "Data Source=powerbi://api.powerbi.com/v1.0/myorg/WorkspaceName;Initial Catalog=ModelName" } } } - Create
with a new.platform
GUID andlogicalId"type": "Report"
This only works for thin PBIX with modern PBIR format. Thick PBIX files require the semantic model to be handled separately (e.g. exported via XMLA/TOM, or deployed from a
model.bim or TMDL source). Legacy PBIX report content (Report/Layout) is not compatible with the PBIR definition/ structure.
PBIX vs PBIP
| Aspect | PBIX | PBIP |
|---|---|---|
| Format | Single binary file | Folder of text files |
| Source control | Not diff-friendly | Git-ready, human-readable diffs |
| Collaboration | Single author at a time | Multiple authors, merge-friendly |
| External editing | Not supported | VS Code, Tabular Editor, scripts |
| Deployment | File > Publish | Git integration, Fabric APIs, fabric-cicd |
| Data | Contains cached data | is gitignored; metadata only in Git |
| Convert | File > Save As > PBIP | File > Save As > PBIX |
Project Structure
The
.Report/ folder is required in a PBIP. The .SemanticModel/ folder is optional — a thin report has only .Report/ and points at a remote semantic model via definition.pbir byConnection.
The
.pbi/ subfolders in both items (localSettings.json, editorSettings.json, cache.abf, etc.) are all optional — they are per-user/per-machine runtime state generated by Power BI Desktop. A freshly authored PBIP from an external tool may not have any of them, and the project still opens fine in Desktop.
<ProjectName>/ +-- <Name>.pbip # Entry point (references .Report folder) — optional +-- .gitignore # Recommended; excludes .pbi/localSettings.json and cache.abf +-- <Name>.SemanticModel/ # OPTIONAL — absent for thin reports | +-- .pbi/ # All contents OPTIONAL, per-user runtime state | | +-- localSettings.json # User-specific (gitignored) | | +-- editorSettings.json # Editor settings (committed) | | +-- cache.abf # Data cache (gitignored) | | +-- unappliedChanges.json # Pending Power Query changes | | +-- daxQueries.json # DAX query view tab settings | | +-- tmdlscripts.json # TMDL view script tab settings | +-- definition.pbism # SM entry point (required if .SemanticModel exists) | +-- definition/ # TMDL format (preferred) — see tmdl skill | +-- model.bim # TMSL format (legacy alt to definition/, mutually exclusive) | +-- diagramLayout.json # SM diagram (no external edit) | +-- DAXQueries/ # .dax files from DAX query view | +-- TMDLScripts/ # .tmdl files from TMDL view | +-- Copilot/ # Copilot tooling metadata | +-- .platform # Fabric identity (displayName, logicalId) +-- <Name>.Report/ # REQUIRED | +-- .pbi/ # OPTIONAL runtime state | | +-- localSettings.json # User-specific (gitignored) | +-- definition.pbir # Report entry point (required for PBIR format) | +-- definition/ # PBIR format — see pbir-format skill | | +-- report.json | | +-- version.json | | +-- pages/ | | | +-- pages.json | | | +-- <page-slug>/ # See "Page folder naming" below | | | | +-- page.json | | | | +-- visuals/... | +-- report.json # PBIR-Legacy format (legacy alt to definition/) | +-- mobileState.json # Mobile layout (no external edit) | +-- semanticModelDiagramLayout.json # Diagram positions (table renames) | +-- CustomVisuals/ # Private custom visual metadata | +-- StaticResources/ | | +-- SharedResources/ # Base themes, shared resources | | | +-- BaseThemes/<name>.json # Resolution path: <report>/StaticResources/SharedResources/<item.path> | | +-- RegisteredResources/ # Custom themes, images, .pbiviz files | +-- DAXQueries/ # .dax files from report DAX query view | +-- .platform # Fabric identity
Page folder naming
Power BI Desktop uses opaque 20-character hex slugs for new page, visual, bookmark, and filter folders by default (e.g.
847663d71e27e0840063). Per Microsoft docs, these can be renamed to friendly names but the replacement must satisfy:
- Regex:
— word characters (letters, digits, underscore) or hyphen only.^[\w-]+$ - No spaces, no dots, no other punctuation. Names outside this set are silently ignored by Power BI Desktop and the page/visual vanishes from the loaded report. This is the hardest bug class to diagnose because there is no error dialog.
- Folder name and
field must match exactly (case-sensitive). The folder may be bare (name
) or suffixed (<slug>/
) — both forms are valid on disk. pbir-cli uses the<slug>.Page/
suffix in its CLI path syntax; current Desktop saves omit the suffix..Page
entries must reference the slug, not the display name.pages.json.pageOrder
must be one of the entries inactivePageName
.pageOrder- Restart Desktop after external rename. PBI Desktop does not detect file changes while open.
If you rename a page from a hex slug to a friendly name, you must also update every reference to the old slug in visual JSONs, filter configs, bookmarks, sparkline metadata, and DAX queries — see
references/rename-cascade.md.
SharedResources path resolution
Items listed in
resourcePackages[] are resolved relative to:
<Report>/StaticResources/<package_type>/<item.path>
For a
SharedResources package with an item { "path": "BaseThemes/Fluent2-CY26SU03.json" }, Power BI Desktop looks for:
<Report>/StaticResources/SharedResources/BaseThemes/Fluent2-CY26SU03.json
Missing resource files are a common blocking error. If
report.json declares themeCollection.baseTheme.type = "SharedResources" and points at a resource that doesn't exist on disk, the report will not open. validate_pbip.py checks this explicitly.
What to Read for Common Tasks
| Task | Read |
|---|---|
| Inspect or extract a PBIX file | Working with PBIX Files section above -- internal structure, thick vs thin detection, encoding, extraction, assembling a PBIP from extracted contents |
| Understand entry point file structure | -- , , , JSON structure, version properties, byPath vs byConnection |
| Rename a table, measure, or column | -- before/after examples for every cascade location. See also skill's for visual JSON patterns |
| Fork / duplicate a PBIP project | -- update path, byPath, logicalId and displayName |
| Work with Copilot tooling files | -- AI instructions, verified answers, schema, example prompts |
| Edit TMDL model files | skill -- syntax, authoring, column properties, naming conventions |
| Edit PBIR report files | skill -- visual.json, theme, filters, report extensions, page layout |
| Verify no broken references after rename | Grep commands below |
Forking a PBIP Project
- Copy the project folder -- duplicate the entire root folder with a new name.
- Rename artifact folders -- rename
and.Report/
subfolders to match the new project name..SemanticModel/ - Rename and update
-- rename the.pbip
file and update.pbip
to point to the renamedartifacts[].report.path
folder..Report - Update
-- if the report uses.pbir
, update the path to point to the renamedbyPath
folder..SemanticModel - Update
files -- set.platform
to the new project name in eachdisplayName
file. Regenerate.platform
(new GUID) if deploying as a separate Fabric item.logicalId
Verification
Two tools for validation, used together:
-
— project-level validator for cross-cutting concerns:scripts/validate_pbip.py
root file,.pbip
identity, semantic model format (TMDL vs TMSL),.platform
resolution, theme resource resolution on disk, orphan page folders, and the silent-ignore page name regex rule (page names outsidedatasetReference
are silently ignored by Power BI Desktop). Delegates deep^[\w-]+$
schema validation to.Report
if it is on PATH.pbir validatepython3 scripts/validate_pbip.py <path-to-.pbip-or-project> # validate python3 scripts/validate_pbip.py <path> --fix # scaffold .gitignore python3 scripts/validate_pbip.py <path> --json # machine-readable python3 scripts/validate_pbip.py <path> --no-pbir-cli # skip delegationExit codes:
clean,0
warnings only,1
errors,2
usage error.3 -
(from thepbir validate <Report.Report>
skill) — canonical JSON schema + PBIR structure validator for thepbir-cli
folder. Covers JSON syntax, schema compliance, required fields, and optional.Report
/--qa
/--fields
checks.--strict -
agent — use for interactive, LLM-driven checking of orphaned references after renames, when you need reasoning over the whole project rather than a deterministic report.pbip-validator
Known gotcha with
: if the project's pbir validate
.pbi/localSettings.json uses a schema version newer than the one bundled in pbir-cli, pbir validate returns SCHEMA_UNSUPPORTED. Pass --allow-download-schemas to let it fetch the missing schema on demand, or ignore .pbi/ files (they are per-user runtime state and not part of the committed definition).
After any rename or fork operation, verify no old references remain.
# Search for old name across all project files grep -r "Old Name" "Project.Report/" "Project.SemanticModel/" --include="*.json" --include="*.tmdl" --include="*.dax" # Search with word boundaries to avoid partial matches grep -rP "\bOld Name\b" "Project.Report/" "Project.SemanticModel/" # Look for old name in single-quoted DAX references grep -r "'Old Name'" --include="*.tmdl" --include="*.dax"
Common missed locations:
- SparklineData metadata -- compact string format outside standard JSON structure
- Conditional formatting expressions --
refs nested inEntityConditional.Cases - Filter config -- page-level and visual-level filters in
sectionsfilterConfig - Sort definitions --
blocks in visual JSONsortDefinition - DAX queries in Report folder -- the second DAX query location
- Culture file linguisticMetadata --
andConceptualEntity
inside embedded JSONConceptualProperty
Related Skills
Within this plugin:
-- TMDL syntax, authoring, and editing rules for directtmdl
file editing.tmdl
-- PBIR JSON format, visual.json, theme, filters, report extensionspbir-format
Other plugins:
plugin -- tooling and workflows for semantic model development (naming conventions, model quality). Use for working with the actual model content, not just its file format.semantic-models
plugin -- connecting to Power BI Desktop's local Analysis Services instance via TOM/ADOMD.NETpbi-desktop
plugin -- Tabular Editor CLI, C# scripting, BPA rules, documentation searchtabular-editor
References
Project structure:
-- Entry point file structures (references/pbip-file-types.md
,.pbip
,.pbir
,.pbism
),.platform
subfolder,.pbi/
,DAXQueries/
,TMDLScripts/
,model.bim
, version properties, JSON examples.gitignore
-- Copilot/ folder structure (AI instructions, verified answers, schema, example prompts)references/copilot-folder.md
Rename operations:
-- Detailed before/after examples for each rename cascade location (TMDL + report files)references/rename-cascade.md
Fetching Docs: To retrieve current PBIP reference docs, use
microsoft_docs_search + microsoft_docs_fetch (MCP) if available, otherwise mslearn search + mslearn fetch (CLI). Search based on the user's request and run multiple searches as needed to ensure sufficient context before proceeding.
External references: