The Fx Special Effects Image Operator • The Anatomy of an Fx Expression
Use the Fx special effects image operator to apply a mathematical expression to an image or image channels. Use Fx to:
- create canvases, gradients, mathematical colormaps
- move color values between images and channels
- translate, flip, mirror, rotate, scale, shear and generally distort images
- merge or composite multiple images together
- convolve or merge neighboring pixels together
- generate image metrics or 'fingerprints'
The expression can be simple:
convert -size 64x64 canvas:black -channel blue -fx "1/2" fx_navy.png
Here, we convert a black to a navy blue image:
Or the expression can be complex:
convert rose.jpg \ -fx "(1.0/(1.0+exp(10.0*(0.5-u)))-0.006693)*1.0092503" \ rose-sigmoidal.png'
This expression results in a high contrast version of the image:
The expression can include variable assignments. Assignments, in most cases, reduce the complexity of an expression and permit some operations that might not be possible any other way. For example, lets create a radial gradient:
convert -size 70x70 canvas: \ -fx "Xi=i-w/2; Yj=j-h/2; 1.2*(0.5-hypot(Xi,Yj)/70.0)+0.5" radial-gradient.png'
The command above returns this image:
This FX expression adds random noise to an image:
convert photo.jpg -fx \'iso=32; rone=rand(); rtwo=rand(); \ myn=sqrt(-2*ln(rone))*cos(2*Pi*rtwo); myntwo=sqrt(-2*ln(rtwo))* \ cos(2*Pi*rone); pnoise=sqrt(p)*myn*sqrt(iso)* \ channel(4.28,3.86,6.68,0)/255; max(0,p+pnoise)\' noisy.png
See Using FX, The Special Effects Image Operator for more examples.
The next section discusses the Fx expression language.
The Anatomy of an Fx Expression
The Fx Expression Language
The formal Fx expression language is defined here:
- numbers:
- integer, floating point, scientific notation (+/- required, e.g. 3.81469e-06), International System number postfixes (.e.g KB, Mib, GB, etc.)
- constants:
- E (Euler's number), Epsilon, QuantumRange, QuantumScale, Opaque, Phi (golden ratio), Pi, Transparent
- Fx operators (in order of precedence):
- ^ (power), unary -, *, /, % (modulo), +, -, <<, >>, <, <=, >, >=, ==, !=, & (bitwise AND), | (bitwise OR), && (logical AND), || (logical OR), ~ (logical NOT), ?: (ternary conditional)
- math functions:
- abs(), acos(), acosh(), airy(), alt(), asin(), asinh(), atan(), atanh(), atan2(), ceil(), clamp(), cos(), cosh(), debug(), drc(), exp(), floor(), gauss(), gcd(), hypot(), int(), isnan(), j0(), j1(), jinc(), ln(), log(), logtwo(), max(), min(), mod(), not(), pow(), rand(), round(), sign(), sin(), sinc(), sinh(), sqrt(), squish(), tan(), tanh(), trunc()
- channel functions:
- channel(r,g,b,a), channel(c,m,y,k,a)
- color names:
- red, cyan, black, etc.
- color functions:
- srgb(), srgba(), rgb(), rgba(), cmyk(), cmyka(), hsl(), hsla(), etc.
- color hex values:
- #ccc, #cbfed0, #b9e1cc00, etc.
- symbols:
u
=> first image in listv
=> second image in lists
=> current image in list (for %[fx:] otherwise = u)t
=> index of current image (s) in listn
=> number of images in listi
=> column offsetj
=> row offsetp
=> pixel to use (absolute or relative to current pixel)w
=> width of this imageh
=> height of this imagez
=> channel depthr
=> red value (from RGBA), of a specific or current pixelg
=> green ''b
=> blue ''a
=> alpha ''o
=> opacity ''c
=> cyan value of CMYK color of pixely
=> yellow ''m
=> magenta ''k
=> black ''intensity
=> pixel intensityhue
=> pixel huesaturation
=> pixel saturationlightness
=> pixel lightnessluma
=> pixel lumapage.width
=> page widthpage.height
=> page heightpage.x
=> page x offsetpage.y
=> page y offsetresolution.x
=> x resolutionresolution.y
=> y resolutiondepth
=> image depthminima
=> image minimamaxima
=> image maximamean
=> image meanstandard_deviation
=> image standard deviationkurtosis
=> image kurtosisskewness
=> image skewness (add a channel specifier to compute a statistic for that channel, e.g. depth.r)
The Fx Expression
An Fx expression may include any combination of the following:
- x
^
y - exponentiation (xy)
-
(
...)
- grouping
- x
*
y - multiplication (the asterisk
*
is optional, for example,2u
or2(x+y)
are acceptable) - x
/
y - division
- x
%
y - modulo
- x
+
y - addition
- x
-
y - subtraction
- x
<<
y - left shift
- x
>>
y - right shift
- x
<
y - boolean relation, return value 1.0 if x < y, otherwise 0.0
- x
<=
y - boolean relation, return value 1.0 if x <= y, otherwise 0.0
- x
>
y - boolean relation, return value 1.0 if x > y, otherwise 0.0
- x
>=
y - boolean relation, return value 1.0 if x >= y, otherwise 0.0
- x
==
y - boolean relation, return value 1.0 if x == y, otherwise 0.0
- x
!=
y - boolean relation, return value 1.0 if x != y, otherwise 0.0
- x
&
y - binary AND
- x
|
y - binary OR
- x
&&
y - logical AND connective, return value 1.0 if x > 0 and y > 0, otherwise 0.0
- x
||
y - logical OR connective (inclusive), return value 1.0 if x > 0 or y > 0 (or both), otherwise 0.0
-
~
x - logical NOT operator, return value 1.0 if not x > 0, otherwise 0.0
-
+
x - unary plus, return 1.0*value
-
-
x - unary minus, return -1.0*value
- x
?
y - z: ternary conditional expression, return value y if x != 0, otherwise z; only one ternary conditional permitted per statement
- x
=
y - assignment; assignment variables are restricted to letter combinations only (e.g. Xi not X1)
- x
;
y - statement separator
-
phi
- constant (1.618034...)
-
pi
- constant (3.141659...)
-
e
- constant (2.71828...)
-
QuantumRange
- constant maximum pixel value (255 for Q8, 65535 for Q16)
-
QuantumScale
- constant 1.0/
QuantumRange
-
intensity
- pixel intensity; equivalent to 0.299*red+0.587*green+0.114*blue
-
hue
- pixel hue
-
saturation
- pixel saturation
-
lightness
- pixel lightness; equivalent to 0.5*max(red,green,blue) + 0.5*min(red,green,blue)
-
luminance
- pixel luminance; equivalent to 0.212656*red + 0.715158*green + 0.072186*blue
-
red, green, blue
, etc. - color names
-
#ccc, #cbfed0, #b9e1cc00
, etc. - color hex values
-
rgb(), rgba(), cmyk(), cmyka(), hsl(), hsla()
- color functions
-
s, t, u, v, n, i, j, w, h, z, r, g, b, a, o, c, y, m, k
- symbols
-
abs(
x)
- absolute value function
-
acos(
x)
- arc cosine function
-
acosh(
x)
- inverse hyperbolic cosine function
-
airy(
x)
- Airy function (max=1, min=0); airy(x)=[jinc(x)]2=[2*j1(pi*x)/(pi*x)]2
-
alt(
x)
- sign alternation function (return 1.0 if int(x) is even, -1.0 if int(x) is odd)
-
asin(
x)
- arc sine function
-
asinh(
x)
- inverse hyperbolic sine function
-
atan(
x)
- arc tangent function
-
atanh(
x)
- inverse hyperbolic tangent function
-
atan2(
x,y)
- arc tangent function of two variables
-
ceil(
x)
- smallest integral value not less than argument
-
channel(
r,g,b,a)
- select numeric argument based on current channel
-
channel(
c,m,y,k,a)
- select numeric argument based on current channel
-
clamp(
x)
- clamp value
-
cos(
x)
- cosine function
-
cosh(
x)
- hyperbolic cosine function
-
debug(
x)
- print x (useful for debugging your expression)
-
drc(
x,y)
- dynamic range compression (knee curve); drc(x,y)=(x)/(y*(x-1)+1); -1<y<1
-
exp(
x)
- natural exponential function (ex)
-
floor(
x)
- largest integral value not greater than argument
-
gauss(
x)
- gaussian function; gauss(x)=exp(-x*x/2)/sqrt(2*pi)
-
gcd(
x,y)
- greatest common denominator
-
hypot(
x,y)
- the square root of x2+y2
-
int(
x)
- greatest integer function (return greatest integer less than or equal to x)
-
isnan(
x)
- return 1.0 if x is NAN, 0.0 otherwise
-
j0(
x)
- Bessel functions of x of the first kind of order 0
-
j1(
x)
- Bessel functions of x of the first kind of order 1
-
jinc(
x)
- jinc function (max=1, min=-0.1323); jinc(x)=2*j1(pi*x)/(pi**x)
-
ln(
x)
- natural logarithm function
-
log(
x)
- logarithm base 10
-
logtwo(
x)
- logarithm base 2
-
ln(
x)
- natural logarithm
-
max(
x, y)
- maximum of x and y
-
min(
x, y)
- minimum of x and y
-
mod(
x, y)
- floating-point remainder function
-
not(
x)
- return 1.0 if x is zero, 0.0 otherwise
-
pow(
x,y)
- power function (xy)
-
rand()
- value uniformly distributed over the interval [0.0, 1.0) with a 2 to the 128th-1 period
-
round()
- round to integral value, regardless of rounding direction
-
sign(
x)
- return -1.0 if x is less than 0.0 otherwise 1.0
-
sin(
x)
- sine function
-
sinc(
x)
- sinc function (max=1, min=-0.21); sinc(x)=sin(pi*x)/(pi*x)
-
squish(
x)
- squish function; squish(x)=1.0/(1.0+exp(-x))
-
sinh(
x)
- hyperbolic sine function
-
sqrt(
x)
- square root function
-
tan(
x)
- tangent function
-
tanh(
x)
- hyperbolic tangent function
-
trunc(
x)
- round to integer, towards zero
-
while(
condition,expression)
- iterate while the condition is not equal to 0
The expression semantics include these rules:
- symbols are case insensitive
- only one ternary conditional (e.g. x ? y : z) per statement
- statements are assignments or the final expression to return
- an assignment starts a statement, it is not an operator
- assignments to built-ins do not throw an exception and have no effect; e.g.
r=3.0; r
returns the pixel red color value, not 3.0 - Unary operators have a lower priority than binary operators, that is, the unary minus (negation) has lower precedence than exponentiation, so -3^2 is interpreted as -(3^2) = -9. Use parentheses to clarify your intent (e.g. (-3)^2 = 9).
- Similarly, care must be exercised when using the slash ('/') symbol. The string of characters 1/2x is interpreted as (1/2)x. The contrary interpretation should be written explicitly as 1/(2x). Again, the use of parentheses helps clarify the meaning and should be used whenever there is any chance of misinterpretation.
Source Images
The symbols u
and v
refer to the first and second images, respectively, in the current image sequence. Refer to a particular image in a sequence by appending its index to any image reference (usually u
), with a zero index for the beginning of the sequence. A negative index counts from the end. For example, u[0]
is the first image in the sequence, u[2]
is the third, u[-1]
is the last image, and u[t]
is the current image. The current image can also be referenced by s
. If the sequence number exceeds the length of the sequence, the count is wrapped. Thus in a 3-image sequence, u[-1]
, u[2]
, and u[5]
all refer to the same (third) image.
As an example, we form an image by averaging the first image and third images (the second (index 1) image is ignored and just junked):
convert image1.jpg image2.jpg image3.jpg -fx "(u+u[2])/2.0" image.jpg
By default, the image to which p
, r
, g
, b
, a
, etc., are applied is the current image s
in the image list. This is equivalent to u
except when used in an escape sequence %[fx:...]
.
It is important to note the special role played by the first image. This is the only image in the image sequence that is modified, other images are used only for their data. As an illustrative example, consider the following, and note that the setting -channel red instructs -fx to modify only the red channel; nothing in the green or blue channels will change. It is instructive to ponder why the result is not symmetric.
convert -channel red logo: -flop logo: -resize "20%" -fx "(u+v)/2" image.jpg
Accessing Pixels
All color values are normalized to the range of 0.0 to 1.0. The alpha channel ranges from 0.0 (fully transparent) to 1.0 (fully opaque).
The pixels are processed one at a time, but a different pixel of an image can be specified using a pixel index represented by p
. For example,
p[-1].g green value of pixel to the immediate left of the current pixel p[-1,-1].r red value of the pixel diagonally left and up from current pixel
To specify an absolute position, use braces, rather than brackets.
p{0,0}.r red value of the pixel in the upper left corner of the image p{12,34}.b blue pixel value at column number 12, row 34 of the image
Integer values of the position retrieve the color of the pixel referenced, while non-integer position values return a blended color according to the current -interpolate setting.
A position outside the boundary of the image retrieves a value dictated by the -virtual-pixel option setting.
Apply an Expression to Select Image Channels
Use the -channel setting to specify the output channel of the result. If no output channel is given, the result is set over all channels except the opacity channel. For example, to replace the red channel of alpha.png
with the average of the green channels from the images alpha.png
and beta.png
, use:
convert alpha.png beta.png -channel red -fx "(u.g+v.g)/2" gamma.png
Results
The -fx operator evaluates the given expression for each channel (set by -channel) of each pixel in the first image (u
) in the sequence. The computed values are temporarily stored in a copy (clone) of that first image until all the pixels have been processed, after which this single new image replaces the list of images in the current image sequence. As such, in the previous example the updated version of alpha.png
replaces both of the original images, alpha.png
and beta.png
, before being saved as gamma.png
.
The current image s
is set to the first image in the sequence (u
), and t
to its index, 0. The symbols i
and j
reference the current pixel being processed.
For use with -format, the value-escape %[fx:]
is evaluated just once for each image in the current image sequence. As each image in the sequence is being evaluated, s
and t
successively refer to the current image and its index, while i
and j
are set to zero, and the current channel set to red (-channel is ignored). An example:
convert canvas:'rgb(25%,50%,75%)' rose: -colorspace rgb \ -format 'Red channel of NW corner of image #%[fx:t] is %[fx:s]' info: Red channel of NW corner of image #0 is 0.453758 Red channel of NW corner of image #1 is 0.184588
Here we use the image indexes to rotate each image differently, and use -set
with the image index to set a different pause delay on the first image in the animation:
convert rose: -duplicate 29 -virtual-pixel Gray -distort SRT '%[fx:360.0*t/n]' \ -set delay '%[fx:t == 0 ? 240 : 10]' -loop 0 rose.gif"
The color-escape %[pixel:]
is evaluated once per image and per color channel in that image (-channel is ignored), The values generated are then converted into a color string (a named color or hex color value). The symbols i
and j
are set to zero, and s
and t
refer to each successively current image and index.