Multiple UV Sets: no
Texture Transform: no
ClipSets (multiple animations): no
Transparency: partial
Multiple UV Sets: yes
Texture Transform: broken / inverted
ClipSets (multiple animations): no
Transparency: specific USD version
Multiple UV Sets: yes
Texture Transform: yes
Transparency: not fully supported
Coordinate Systems are at the same time very easy to explain and very hard to get right in practical terms.
In contrast to common belief that there are only two coordinte systems, "left-handed" and "right-handed", there are way more –
The up axis can be positive or negative (in practice most often positive Y or positive Z);
the right axis can be positive or negative (in practice positive X or negative X);
the forward axis can point into or out of the screen (often +/-Y or +/-Z).
Right-handed means when you place an imaginary screw perpendicular to both X and Y and then rotate X axis to Y axis, the screw goes in the direction of Z axis.
Often, coordinate system conversions are "avoided" by rotating objects around 90 or 180 degrees or introducing rotated/flipped root objects; this is incorrect and most often leads to roundtrip errors. A better way is converting coordinates into the right system, which may also involve converting UV data, animation data, textures, cameras and lights.
This defines a world coordinate system to place geometry in. However, there are more coordinate systems to think about:
- texture coordinates may start at the bottom left or the top left
- cameras may point along positive or negative Y or Z ("forward" or "backward")
- lights are often, but not always, defined the same way as cameras (which axis is "forward")
- some applications have an opinion on the correct axis for bones (e.g. which axis is "along the bone")
To make matters worse, there are varying definitions on what "forward" means in any given application, completely independent from its coordinate system convention. This has most often to do with the placement of a "default viewer camera" in that application – it can point along forward (so that by default you look at the back of an object) or it can point along backward (so that by default you look at the forward-facing side of an object).
This makes conversion between applications especially challenging if those applications have implicit, non-user-definable camera positions, such as pure 3D viewers like model-viewer, or AR viewers like QuickLook (Apple iOS AR) or SceneViewer (Google AR). If those default camera views point in different directions than a conversion has to introduce a 180° rotation, even when that rotation means the actual data isn't strictly converted anymore.
USD: Y-up X-right Z-forward or Z-up glTF: Y-up X-left Z-forward FBX: by default Y-Up axis system, right handed
- very configurable which has led to some applications (e.g. Blender) allowing to flip axis arbitrarily on import Unity: Y-up X-right Z-forward
Software / Standard | Default System | Handedness |
---|---|---|
3ds Max | Z-up | right-handed |
Blender | Z-up | right-handed |
Bullet/Ammo.js | Y-up | right-handed |
Cinema4D | Y-up | left-handed |
DirectX | Y-up | left-handed |
Houdini | Y-up | right-handed |
Maya | Y-up | right-handed, configurable to Z-up right-handed |
OpenGL/WebGL/glTF | Y-up | right-handed |
Three.js | Y-up | right-handed |
Unity | Y-up | left-handed |
Unreal Engine | Z-up | left-handed |
Verge3D | Y-up | right-handed |
WebGPU | Y-up | left-handed |
USD | Y-up or Z-up | right-handed |
Starts top left or bottom left
UV is a per-vertex attribute.
It is usually used for texturing - defining which parts of a texture end up on which part of the model.
There's a distinction between non-overlapping UVs (for example for lightmaps or when painting on a model) and overlapping UVs (multiple parts of the model get the same section of the texture applied, for example with trim sheets).
Vertices can store zero, one, or more sets of UV coordinates. Commonly one or two are supported, with some formats supporting 8 or more. A typical case is storing UVs for regular texturing in the first channel, and having a second set of UVs in the second channel for lightmaps or other cases.
UV ranges outside 0..1 are sometimes used:
- to distinguish between materials (also called UDIMs) - e.g. range 0..1 is material A, range 1..2 is material B
- for tiling textures - the texture repeats over the surface of the model
- for texture arrays - 0..1 is index 0, 1..2 is index 1, can be used both in X and Y
Wrap Modes specify what happens with a texture when accessing data outside its normalized 0..1 range. Typically, formats support at least Clamp and Repeat. Sometimes, there's also Mirror. Often wrap modes can be specified separately for U and V directions.
Wrap modes are especially important to get right for textures with mimaps where opposite edges look different. A good example is a grass texture — typically it will be transparent at the top and opaque at the bottom, and repeating in X. So it would have wrap modes U: Repeat and V: Clamp.
Default orientation
Light values / real-world or not
Falloff
Inner and outer radius
Punctual light sources: Spot, Directional, Point
Image-Based Lighting
Ambient Lights
Area, Tube, Spherical
IES profiles
Realtime (usually shadow maps)
Baked (precalculated)
Hard vs. Soft shadows
Very soft shadows
Shadow cascades
Default orientation
Default look direction - may be different between viewers, not just file formats
Some viewers by default look at the forward-facing side of a model, some viewers look along the forward direction.
For example, while coordinate systems match between glTF and USD, the default view direction is flipped.
Software | Default camera | Identity camera | Model forward |
---|---|---|---|
Blender | Default camera looks along +Y | Identity camera looks along -Z | Model forward is -Y |
Unity | Default camera looks along +Z | Identity camera looks along +Z | Model forward is +Z |
three.js | Default camera looks along -Z | Identity camera looks along -Z | Model forward is +Z |
Many models require soft parts and deformations. To achieve this, a technique commonly used is "rigging" (creating bone objects with a hierarchical relationship, also called skeleton) and "skinning" (defining how bones affect the model).
Bones usually have a bind pose (the orientation they have in relation to vertices) and vertices store bone weights (which bones influence them and how much). Typically, for a deformable model (such as a human hand) each vertex is affected by multiple bones, while for a rigid model (such as a robot) each vertex may only be affected by a single bone, while some wires may be affected by multiple.
Some softwares are opinionated about which axis a bone must be aligned on (which axis is forward, which axis is up). This often leads to difficulties when exchanging already rigged data between softwares, especially when constraints (kinematic chains) are used.
Most file formats allow storing multiple skinned meshes in one file. Support for im- and exporting such data varies and can cause problems.
Usually 1, 2, 4, 8 or more bones can influence a single vertex (has a performance impact). Sometimes this is configurable (for example in Unity).
Since skinned meshes are usually driven by bones, their own transformation doesn't / shouldn't matter. Despite this, transforming the skinned mesh itself can lead to culling errors or lighting problems.
Some formats enforce bones to all be hierarchical children of the skinned mesh, others allow arbitrary hierarchies.
In some cases, deformations are too complex to represent them with bones. One example of such complex deformations are animated faces.
To represent them, additional data can be stored per vertex that defines additional vertex positions. These are called blend shapes, shape keys, morph targets or similar.
Blend shapes can usually store additional copies of existing per-vertex data, such as positions, normals, or tangents. Some formats also allow storing color data or UV data as blend shapes. Usually, data is stored additively — that means only the difference in data to the base mesh is stored. Often, only a few vertices differ between shapes (for example, a small wrinkle in a face), so only saving the difference reduces storage. This is called sparse data in some formats.
Some formats (e.g. glTF) define that viewers must recalculate normal and tangent data when missing, which means often only vertices need to be stored.
Blend shapes can and often are combined with bone animations. Typical cases include humanoid meshes with rigged heads (neck/skull/yaw/eyes) and shapes for facial expression.
As blend shapes are interpolated linearly between each other, some applications support in-between shapes, so that a series of shapes can be driven by a single value. This can always be baked down to individual blend shapes, but is nicer to author/animate. Alternative name: progressive morphing (3ds Max)
In-between shapes are supported in: FBX, USD, Unity, Maya, 3ds Max
They are not supported in: glTF, Blender, Unreal
- FACS
- Apple
- iClone
Viewers support a varying amount of blend shapes. Some only support 4 or 8, which is usually not sufficient for complex facial animation. Many viewers support a large number (hundreds) of blend shapes.
Blend shapes are efficient to calculate on the GPU but rather expensive on the CPU, so some operations are slow. This includes
culling / bounding box calculations, or finding intersections with meshes with blend shapes.
Usually, blend shapes are defined and used with normalized weights 0..1, but technically weights can be any number (both positive and negative).
Viewers that support only a limited amount of blend shapes have varying strategies for which targets get used. Some just take the first N shapes, others sort and take the first N shapes applying to each vertex.
- FBX: AnimStacks + AnimLayer + AnimCurveNode (AnimLayer rarely used / usually only one)
- AnimLayer can blend between multiple animations (BlendMode/AccumulationMode)
- animations can target pretty much anything
- Blender: NLA strips + actions + F-Curves
- glTF: animations + channels
- USD: valueClips/clipSets + timeSamples on attributes (without clip sets, only one animation is supported)
- Unity: AnimationClip + AnimationCurve (Animators can blend between multiple AnimationClips)
- Unreal: AnimSequence + AnimCurves
- three.js: AnimationClip + KeyframeTrack (Mixers can blend between multiple AnimationClips)
- Unity: per keyframe – constant, linear, freeform (configured via in/out tangents)
- glTF: per channel – constant, linear, cubic
- three.js: per AnimationTrack – Discrete (constant), Linear, Smooth (Cubic), can create custom interpolators
- Blender: per keyframe – constant, linear, bézier (configured via in/out tangents)
- USD: per attribute – held (constant) or linear
- FBX: per keyframe – constant, linear, cubic, freeform, various auto modes, in/out velocity per keyframe
Arranging animations on a timeline
Blending/Mixing of animations
Animation layers
- Unity: PlayableDirector
- glTF: no
- FBX: no
- USD: clips on a stage
State management is often used in interactive applications. For example, a UI element might have states such as "normal", "hovered", "clicked" and so on, that each have their own animation data to apply to the UI element graphics. Another example are characters, where a controlled character might have states such as "idle", "walk", "run", "walk left" and so on, which can be mixed depending on user input or machine input.
- Unity: Animator. States can have additional code and logic executed on enter/exit/stay.
- Blend Trees provide additional blending capabilities beyond transitioning from one state to another.
- Each state has a choice of "Write Defaults" or not. Basically, this collects all animated objects of the entire Animator at start and sets their "at rest" values when a state does not have an opinion for some objects animated by other states.
- three.js: AnimationMixer / PropertyMixer. States and transitions are handled manually, states can be blended together.
- Each property is reset after a binding stops. Effectively, this has the same result as "Write defaults".
- Blender: no
- USD: no
- FBX: FBXAudioLayer
- USD: UsdAudio (supported in QuickLook, not supported in usdview)
- glTF: KHR_audio (not ratified)
- Blender: audio clips / tracks
- Unity: AudioSource / AudioClip
- three.js: PositionalAudio / Audio
Object space vs. tangent space
Tangent space explicit (saved in the file) or implicit (caluclated)
MiKKtSpace vs. other tagent space definitions
XYZ format vs. DXT5nm format (1X1Y) (Unity) vs. XXXY format (toktx)
FBX: partially, by naming convention ("LOD_X") USD: no glTF: yes, via MSFT_lod extension
FBX:
- bones are regular nodes and part of the hierarchy
- bones can
USD: arbitrary number of skeletons per file
- Skeleton / SkelRoot / SkelAnim
- bones form a virtual hierarchy (they are not nodes in the scene hierarchy) and need to be animated separately
- thus they must be virtual childs of their SkelRoot
- nodes can't be parented to bones
- virtual hierarchy is defined by string arrays (e.g.
["Bone1/Bone2", "Bone3", "Bone4/Bone5"]
)
glTF: arbitrary number of skeletons per file
- bones are regular nodes and part of the hierarchy
- bones can be used by multiple skeletons
- bones don't need to be parented to their skeleton or each other
USD: UsdLux
lights with a primary axis emit along -Z no specification for what the power unit actually is Lights are treated as having a size (default diameter 1 unit), with an extra attribute treatAsPoint
- UsdLuxCylinderLight
- UsdLuxDiskLight
- UsdLuxDistantLight
- UsdLuxDomeLight
- UsdLuxRectLight
- UsdLuxSphereLight
- MeshLightAPI/VolumeLightAPI for turning prims into lights (emissive meshes)
- Decay: physical (inverse square)
- control over shadow behaviour with UsdLuxShadowAPI
- light color can be specified
glTF: KHR_lights_punctual
an untransformed light points down the -Z axis The light's transform is affected by the node's world scale, but all properties of the light (such as range and intensity) are unaffected Lights are treated as punctual (zero size).
- directional (lux (lm/m2))
- point (candela (lm/sr))
- distance = 0 is not allowed, undefined should result in infinite distance
- spot (candela (lm/sr))
- inner/outer cone angle
- Decay: physical (inverse square) and additional falloff with a "range" parameter
- Shadows: not defined in the format, up to the application
- light color can be specified
FBX: FBXLight
- Point
- Directional
- Spot
- Area (Rectangle, Sphere)
- Volume
- DecayType: None, Linear, Quadratic, Cubic and additional near/far attenuation
- Shadows: can be defined
- light color can be specified
Blender:
- Point
- Distance = 0 turns the light off
- Sun (Directional)
- Spot
- Area
Unity:
- Point
- Distance = 0 turns the light off
- Directional
- Spot
- Area (baked only in URP + BiRP, realtime in HDRP)
- Lights are effectively used as unitless in URP + BiRP, and with physical units in HDRP (e.g. Directional Light as sun: intensity 100.000)
- Unity imports FBX lights incorrectly in URP – they end up in the wrong color space (need a gamma > linear step).
- Unity imports FBX lights incorrectly in BiRP too, and also emissive colors are wrong (so they match but are both wrong)
Links:
- KhronosGroup/glTF-Blender-IO#564
- KhronosGroup/glTF#2214 (comment)
- Spreadsheet by Ben Houston for light values across engines
- Switching Godot to physical light units
- FBX Docs regarding Light Intensity
- FBX lights in FBX2glTF and this
- FBX lights in three.js
- glTF light export in Blender
- PBR material model in core
- material extensions allow for complex and well-defined shading
- no node-based shaders (KHR_procedurals in the works)
- unlit material support via extension
- UsdPreviewSurface
- fixed definition of surface materials
- only "opacity" which functions as transmission
- can choose between "opacity" and "masked, not combine them
- relationship between opacity and emission not clearly defined
- UsdShade
- node-based shading networks
- UsdShade can contain MaterialX data
- only "Phong" and "Lambert" surface models, no support for "Unlit" materials
- Materials are often exporter-specific (e.g. Maya, 3ds Max write materials very differently with app-specific data)
- Material importers can often import the FBX material descriptions from multiple exporters (e.g. Maya-specific, 3dsmax-specific)
- Unity has postprocessors for MaterialDescriptions to read more data than Unity supports, and modify generated materials from it
Blender adds an empty node called "Armature" on export.
This is problematic when a FBX file has been created e.g. in Maya (exports without such a root node).
Blender has options what to do with the root (Null, Root, LimbNode) but neither of those skips the object.
Unreal removes "Armature" root nodes by default to counter Blender behaviour.
Unity does not remove "Armature" root nodes and doesn't have an option for it.
Blender Docs: https://docs.blender.org/manual/en/3.3/addons/import_export/scene_fbx.html
FBX bones seems to be -X aligned, Blender’s are Y aligned
-
import skinned + animated glTF file into Blender, make changes, export again
- need to be careful with bone orientations, Blender likes flipping into its own format but doesn't allow flipping back
- working in Blender is a bit annoying as bones look "wrong"
-
import skinned + animated FBX file into Blender, make changes, export again
- hierarchy will change if original FBX wasn't created in Blender
- impossible to use in external tools since that extra hierarchy node breaks existing animation clips in other softwares (e.g. Unity)
- Unreal seems to have fixed this on an import level (by skipping explicitly the "Armature" node if it exists)
- people have went as far as making modifications to the default exporter in custom ones to allow skipping this (not tested)
-
bake lightmaps in Blender that take Filmic into account and export to other softwares (bake the look transform in)
- image editor has an option to bake LUT into textures