~/guides/-blog-xml-vs-json-

guides · Formatting

XML vs JSON: When to Use Each Format in Modern APIs

A practical comparison of XML and JSON for APIs, configuration, and data exchange — with concrete guidance on when each format actually wins.

last updated · June 13, 2026by @vultio

XML did not die — and this comparison still matters

Every few years, a wave of blog posts declares XML dead. JSON won. Move on. And yet, in 2026, XML is quietly powering an enormous share of the systems that the internet actually runs on. SOAP web services used in banking and healthcare still exchange XML envelopes. RSS and Atom feeds are XML. SVG — every icon in your design system, every chart in your dashboard — is XML. Android resources, Maven build files, Gradle configurations, SAML authentication tokens, Office Open XML (the format behind every .docx and .xlsx file), and OpenDocument files are all XML. Enterprise integrations that have been running reliably for fifteen years are not getting rewritten because JSON is fashionable.

So the practical question for a developer in 2026 is not "which format won?" It is "when should I reach for each one?" JSON is the clear default for new REST APIs and browser-facing data. But there are specific domains and specific technical requirements where XML is genuinely the better choice — and confusing the two leads to real problems. This article gives you a concrete framework for making that call.

The fundamental difference: markup language vs. serialization format

The comparison is often framed as a syntax contest — angle brackets vs. curly braces — but the real difference is conceptual. XML is a markup language. It was designed to describe documents where the relationship between text and structure matters. An XML element can contain text, child elements, and more text interleaved — what the spec calls mixed content. Attributes carry metadata about elements. Namespaces let you combine vocabularies from multiple standards in a single document without name collisions. Comments are part of the format. The entire model reflects the original problem XML was built to solve: structured documents like HTML, legal contracts, and configuration files where ordering is semantically meaningful.

JSON is a data serialization format. It was designed to represent objects, arrays, strings, numbers, booleans, and null — the primitive types of JavaScript — in a form that is compact, easy to parse, and directly mappable to the data structures of most programming languages. JSON has no concept of attributes, namespaces, mixed content, or comments. It does not try to be a document format. It tries to be the simplest possible wire representation for structured data moving between systems. That simplicity is both its greatest strength and its ceiling.

The same data in both formats

The verbosity difference becomes concrete when you look at the same data structure expressed in both formats. Here is a product record with a nested array of size variants:

JSON

{
  "product": {
    "id": "SKU-8821",
    "name": "Trail Running Shoe",
    "price": 129.99,
    "inStock": true,
    "variants": [
      { "size": "US 9",  "color": "Slate", "qty": 12 },
      { "size": "US 10", "color": "Slate", "qty": 4  },
      { "size": "US 11", "color": "Slate", "qty": 0  }
    ]
  }
}

XML

<?xml version="1.0" encoding="UTF-8"?>
<product id="SKU-8821">
  <name>Trail Running Shoe</name>
  <price>129.99</price>
  <inStock>true</inStock>
  <variants>
    <variant size="US 9"  color="Slate" qty="12" />
    <variant size="US 10" color="Slate" qty="4"  />
    <variant size="US 11" color="Slate" qty="0"  />
  </variants>
</product>

The XML version is about 30% more characters for equivalent data. The difference grows with deeply nested structures because every element requires both an opening and closing tag. The JSON version also maps directly to a JavaScript object with no transformation — JSON.parse() returns something your code can use immediately. The XML version requires a parser that understands the DOM or an XML-to-object library. For simple data exchange, JSON wins on almost every practical metric.

Where JSON wins

JSON is the right default for the majority of new API work. The advantages are concrete, not just stylistic.

Native browser support

JSON.parse() and JSON.stringify() are built into every browser and every JavaScript runtime. There is no dependency to install, no parser to configure, no schema to compile. The round-trip from network response to usable object is a single function call. XML requires DOMParser or a third-party library, and the resulting DOM tree requires additional traversal code to extract values.

Smaller payloads over the wire

For data-centric APIs — lists of records, search results, configuration objects — JSON payloads are consistently smaller than their XML equivalents. At scale, across millions of requests per day, this difference in payload size translates directly into bandwidth costs and latency.

Simpler tooling and ecosystem

Every REST framework generates and parses JSON out of the box. OpenAPI specifications describe JSON APIs. Most logging systems, monitoring dashboards, and observability tools understand JSON natively. The developer tooling around JSON is vastly larger than what exists for XML in modern stacks.

Readability for developer-to-developer communication

When a team shares an API response in a pull request comment, a Slack message, or a bug report, JSON is easier to read and reason about at a glance. The cognitive overhead of matching opening and closing tags in XML is real, especially for engineers who do not work with XML regularly.

Where XML wins

XML's advantages are real but domain-specific. In each of these areas, the format provides capabilities that JSON simply does not have.

Document-centric data where ordering matters

If your data is fundamentally a document — a legal contract, a technical manual, a structured report with prose, tables, and annotations interleaved — XML is the right model. Mixed content (text and child elements in the same parent) is a first-class concept in XML. JSON has no equivalent.

Namespaces for combining standards

XML namespaces allow you to merge elements from different vocabularies in the same document without name collisions. SOAP envelopes, for example, combine the envelope namespace, the encoding namespace, and the application-specific namespace in one document. JSON has no standard mechanism for this.

XSLT for transformations

XSLT is a complete transformation language built specifically for XML. You can take an XML document and produce HTML, a different XML structure, plain text, or any other output format using a stylesheet. There is no equivalent in the JSON ecosystem — transformations require general-purpose code.

XPath for querying

XPath provides a standardized, expressive query language for selecting nodes within an XML document. Libraries like lxml in Python or Saxon in Java implement XPath efficiently. JSON has JSONPath, which is less standardized and less widely supported.

Strong schema validation with XSD

XML Schema Definition (XSD) provides precise, strongly typed schema validation including complex type inheritance, occurrence constraints, and pattern-based string validation. XSD validation is mature, well-tooled, and commonly required in enterprise integrations. JSON Schema is improving but remains less expressive in certain areas.

Digital signatures and security standards

SOAP and WS-Security provide XML-native digital signatures that cover specific parts of a message. This is a requirement in many regulated industries — healthcare, financial services, government — where the message must be signed and verifiable. These standards have no JSON equivalent in widespread enterprise use.

Domains where XML is dominant

Some technical domains have standardized on XML and are unlikely to change. If you are working in any of these areas, XML is not a choice — it is a requirement.

Domain	Format / Standard
Enterprise web services	SOAP / WSDL
Android app resources	Android XML manifests and layout files
Build systems	Maven pom.xml, legacy Gradle
Syndication feeds	RSS 2.0, Atom
Vector graphics	SVG
Office documents	Office Open XML (.docx, .xlsx, .pptx)
Open documents	OpenDocument Format (.odt, .ods)
Federated authentication	SAML 2.0
Publishing	DocBook, DITA, TEI

For all of these, the question of "should I use XML?" is already answered. The question becomes how to work with XML effectively — which is where a good formatter and validator saves significant time. The XML Formatter handles indentation, syntax highlighting, and validation so you can inspect and debug XML payloads without fighting the raw markup.

The attribute vs. element debate in XML

One of the most common design mistakes in XML is misusing attributes and child elements. Both can represent data, which leads to inconsistent schemas where some values are attributes and others are child elements for no principled reason. The conventional rule is:

Use attributes for metadata about the element

Attributes describe properties of the element itself — its identifier, its version, its type. In the product example above, id="SKU-8821" as an attribute is appropriate because it identifies the element, not the data it contains.

Use child elements for data that belongs to the document

The product name and price are content — they belong as child elements, not attributes. The practical reason is that attributes cannot contain child elements, cannot have their own attributes, and cannot appear more than once per element. If a field might ever become complex, use a child element.

<!-- Avoid: burying data in attributes -->
<product id="SKU-8821" name="Trail Running Shoe" price="129.99" />

<!-- Prefer: attributes for identity, elements for content -->
<product id="SKU-8821">
  <name>Trail Running Shoe</name>
  <price currency="USD">129.99</price>
</product>

Performance: parsing speed and payload size

JSON parsing is measurably faster than XML parsing in browser environments. The V8 engine (Chrome, Node.js) and SpiderMonkey (Firefox) have highly optimized native JSON parsers. XML parsing requires constructing a DOM tree, which involves more allocations, more traversal, and more memory overhead for equivalent data. For browser-heavy workloads — dashboards, single-page apps, mobile web — this difference is meaningful.

For server-to-server communication, the difference is less significant. Mature XML libraries like libxml2 (used in Python, PHP, Ruby, and many others) and Saxon parse XML very efficiently, and at the scale of a single inter-service call the parse time is rarely the bottleneck. Payload size is still a factor — XML overhead is real over millions of requests — but the raw parse speed argument is weaker in a pure server context than it is in the browser.

The hybrid approach: content negotiation

Some APIs serve both formats and let the client declare which it prefers using the HTTP Acceptheader. This is called content negotiation and it is a legitimate pattern for APIs that need to serve both modern JavaScript clients and legacy enterprise consumers.

# Request JSON
GET /api/products/SKU-8821
Accept: application/json

# Request XML
GET /api/products/SKU-8821
Accept: application/xml

Content negotiation adds server complexity — you need to serialize the same data model into both formats — but it is valuable when your API needs to bridge ecosystems. Spring Boot, ASP.NET Core, and most enterprise Java frameworks support content negotiation out of the box with minimal configuration. If you are building a new API that only has JavaScript and mobile clients, skip it and default to JSON. If you are building an API that enterprise systems will integrate with via SOAP-like tooling, consider offering XML.

Decision guide: which format to choose

The choice is rarely ambiguous once you map your situation to the criteria below.

Use JSON when building a new REST API

If you are designing an API from scratch and your primary consumers are JavaScript applications, mobile apps, or other modern services, use JSON. It is the default that your framework, your tooling, your documentation tools, and your clients all expect. Choosing XML here creates unnecessary friction with no compensating benefit.

Use JSON when working with JavaScript on either side

If the producer or consumer is JavaScript — Node.js on the server, a React app in the browser — JSON is the native format. The parse and serialize operations are zero-dependency and optimized. Any XML you introduce requires an additional library and an additional transformation step.

Use XML when integrating with enterprise systems

If you are integrating with an existing SOAP service, a banking API, a healthcare system, or any enterprise integration platform, the format will be XML regardless of your preference. Learn to work with it effectively rather than fighting it.

Use XML when the domain already uses it

Android resources, Maven configuration, RSS feeds, SVG, Office documents — if the domain has standardized on XML, use XML. Attempting to convert these to JSON typically means abandoning the entire toolchain (validators, transformers, editors) that the community has built around the format.

Use XML when you need document semantics

If your data is fundamentally a document — with mixed content, ordered elements, or transformation requirements — use XML. A technical specification, a contract, a structured report are all better modeled as XML documents than as JSON objects.

Use XML when schema validation is a hard requirement

If the data must be validated against a strict, typed contract before processing — as is common in regulated industries — XSD provides more mature and more expressive validation than JSON Schema. The tooling around XSD validation in Java, .NET, and Python is more battle-tested for high-stakes validation scenarios.

When you are working with either format and need to inspect, validate, or clean up the structure, the XML Formatter and JSON Formatter handle pretty-printing, syntax validation, and error highlighting for both. Neither format is going anywhere — knowing when each one is the right tool is the skill worth developing.