What is the difference between 'macro' and 'command'?
Clash Royale CLAN TAG#URR8PPP
Some people use the word 'macro' and 'command' interchangeably to refer to the instructions given to LaTeX. Is there a real difference between the two terms? If there is, can we see some examples on what the differences are?
macros
|
show 1 more comment
Some people use the word 'macro' and 'command' interchangeably to refer to the instructions given to LaTeX. Is there a real difference between the two terms? If there is, can we see some examples on what the differences are?
macros
2
We writemacro
orcommand
orcontrol sequence
because we don't want to use the same name all the time. We're not lazy ;-)
– Christian Hupfer
Jan 4 at 8:15
1
I personally do use these three words slightly differently, and I'm now wondering if this is correct. For me a "macro" is a command defined in terms of other commands (it can be expanded, so I exclude primitives and lengths), a "command" is an instruction that does something (which is admittedly vague, but this would include macros, but not lengths) and a "control" sequence" is a single token that consists of a backslash followed by a sequence of characters (irrespective of whether it is defined at all, but I'm not including active characters). Most commands are all three of these things.
– Circumscribe
Jan 4 at 8:39
@Circumscribe There are expandable primitives that are sometimes expanding to a value, e.g.eTeXrevision
. Is this a macro as per your definition?
– TeXnician
Jan 4 at 8:41
@TeXnician: Good point, I realised that only just before you posted your comment [for those reading along: the first version of my comment said that a macro is something that expands to something else]. I would say it is not (nor are e.g.the
,ifx
andcsname
), but I don't know how (or if) these terms are officially defined and I'm actually curious about what users think.
– Circumscribe
Jan 4 at 8:50
Clarification: for my definition a "macro" would be anything for which theshow<macro>
output starts with "macro:" (and which is thus ultimately defined using a version ofdef
?) and a command would be anything that is either a macro or a primitive (something for whichshow<primitive>
outputs<primitive>
), but is not a register (or a character). That last part is probably terrible as a definition though. I also tend to say "command" rather than "macro" when the instruction performs an assignment rather than producing output, but that's probably just me.
– Circumscribe
Jan 4 at 10:01
|
show 1 more comment
Some people use the word 'macro' and 'command' interchangeably to refer to the instructions given to LaTeX. Is there a real difference between the two terms? If there is, can we see some examples on what the differences are?
macros
Some people use the word 'macro' and 'command' interchangeably to refer to the instructions given to LaTeX. Is there a real difference between the two terms? If there is, can we see some examples on what the differences are?
macros
macros
edited Jan 4 at 5:12
Werner
440k669671662
440k669671662
asked Jan 4 at 4:45
Al-Motasem AldaoudeyehAl-Motasem Aldaoudeyeh
1,505312
1,505312
2
We writemacro
orcommand
orcontrol sequence
because we don't want to use the same name all the time. We're not lazy ;-)
– Christian Hupfer
Jan 4 at 8:15
1
I personally do use these three words slightly differently, and I'm now wondering if this is correct. For me a "macro" is a command defined in terms of other commands (it can be expanded, so I exclude primitives and lengths), a "command" is an instruction that does something (which is admittedly vague, but this would include macros, but not lengths) and a "control" sequence" is a single token that consists of a backslash followed by a sequence of characters (irrespective of whether it is defined at all, but I'm not including active characters). Most commands are all three of these things.
– Circumscribe
Jan 4 at 8:39
@Circumscribe There are expandable primitives that are sometimes expanding to a value, e.g.eTeXrevision
. Is this a macro as per your definition?
– TeXnician
Jan 4 at 8:41
@TeXnician: Good point, I realised that only just before you posted your comment [for those reading along: the first version of my comment said that a macro is something that expands to something else]. I would say it is not (nor are e.g.the
,ifx
andcsname
), but I don't know how (or if) these terms are officially defined and I'm actually curious about what users think.
– Circumscribe
Jan 4 at 8:50
Clarification: for my definition a "macro" would be anything for which theshow<macro>
output starts with "macro:" (and which is thus ultimately defined using a version ofdef
?) and a command would be anything that is either a macro or a primitive (something for whichshow<primitive>
outputs<primitive>
), but is not a register (or a character). That last part is probably terrible as a definition though. I also tend to say "command" rather than "macro" when the instruction performs an assignment rather than producing output, but that's probably just me.
– Circumscribe
Jan 4 at 10:01
|
show 1 more comment
2
We writemacro
orcommand
orcontrol sequence
because we don't want to use the same name all the time. We're not lazy ;-)
– Christian Hupfer
Jan 4 at 8:15
1
I personally do use these three words slightly differently, and I'm now wondering if this is correct. For me a "macro" is a command defined in terms of other commands (it can be expanded, so I exclude primitives and lengths), a "command" is an instruction that does something (which is admittedly vague, but this would include macros, but not lengths) and a "control" sequence" is a single token that consists of a backslash followed by a sequence of characters (irrespective of whether it is defined at all, but I'm not including active characters). Most commands are all three of these things.
– Circumscribe
Jan 4 at 8:39
@Circumscribe There are expandable primitives that are sometimes expanding to a value, e.g.eTeXrevision
. Is this a macro as per your definition?
– TeXnician
Jan 4 at 8:41
@TeXnician: Good point, I realised that only just before you posted your comment [for those reading along: the first version of my comment said that a macro is something that expands to something else]. I would say it is not (nor are e.g.the
,ifx
andcsname
), but I don't know how (or if) these terms are officially defined and I'm actually curious about what users think.
– Circumscribe
Jan 4 at 8:50
Clarification: for my definition a "macro" would be anything for which theshow<macro>
output starts with "macro:" (and which is thus ultimately defined using a version ofdef
?) and a command would be anything that is either a macro or a primitive (something for whichshow<primitive>
outputs<primitive>
), but is not a register (or a character). That last part is probably terrible as a definition though. I also tend to say "command" rather than "macro" when the instruction performs an assignment rather than producing output, but that's probably just me.
– Circumscribe
Jan 4 at 10:01
2
2
We write
macro
or command
or control sequence
because we don't want to use the same name all the time. We're not lazy ;-)– Christian Hupfer
Jan 4 at 8:15
We write
macro
or command
or control sequence
because we don't want to use the same name all the time. We're not lazy ;-)– Christian Hupfer
Jan 4 at 8:15
1
1
I personally do use these three words slightly differently, and I'm now wondering if this is correct. For me a "macro" is a command defined in terms of other commands (it can be expanded, so I exclude primitives and lengths), a "command" is an instruction that does something (which is admittedly vague, but this would include macros, but not lengths) and a "control" sequence" is a single token that consists of a backslash followed by a sequence of characters (irrespective of whether it is defined at all, but I'm not including active characters). Most commands are all three of these things.
– Circumscribe
Jan 4 at 8:39
I personally do use these three words slightly differently, and I'm now wondering if this is correct. For me a "macro" is a command defined in terms of other commands (it can be expanded, so I exclude primitives and lengths), a "command" is an instruction that does something (which is admittedly vague, but this would include macros, but not lengths) and a "control" sequence" is a single token that consists of a backslash followed by a sequence of characters (irrespective of whether it is defined at all, but I'm not including active characters). Most commands are all three of these things.
– Circumscribe
Jan 4 at 8:39
@Circumscribe There are expandable primitives that are sometimes expanding to a value, e.g.
eTeXrevision
. Is this a macro as per your definition?– TeXnician
Jan 4 at 8:41
@Circumscribe There are expandable primitives that are sometimes expanding to a value, e.g.
eTeXrevision
. Is this a macro as per your definition?– TeXnician
Jan 4 at 8:41
@TeXnician: Good point, I realised that only just before you posted your comment [for those reading along: the first version of my comment said that a macro is something that expands to something else]. I would say it is not (nor are e.g.
the
, ifx
and csname
), but I don't know how (or if) these terms are officially defined and I'm actually curious about what users think.– Circumscribe
Jan 4 at 8:50
@TeXnician: Good point, I realised that only just before you posted your comment [for those reading along: the first version of my comment said that a macro is something that expands to something else]. I would say it is not (nor are e.g.
the
, ifx
and csname
), but I don't know how (or if) these terms are officially defined and I'm actually curious about what users think.– Circumscribe
Jan 4 at 8:50
Clarification: for my definition a "macro" would be anything for which the
show<macro>
output starts with "macro:" (and which is thus ultimately defined using a version of def
?) and a command would be anything that is either a macro or a primitive (something for which show<primitive>
outputs <primitive>
), but is not a register (or a character). That last part is probably terrible as a definition though. I also tend to say "command" rather than "macro" when the instruction performs an assignment rather than producing output, but that's probably just me.– Circumscribe
Jan 4 at 10:01
Clarification: for my definition a "macro" would be anything for which the
show<macro>
output starts with "macro:" (and which is thus ultimately defined using a version of def
?) and a command would be anything that is either a macro or a primitive (something for which show<primitive>
outputs <primitive>
), but is not a register (or a character). That last part is probably terrible as a definition though. I also tend to say "command" rather than "macro" when the instruction performs an assignment rather than producing output, but that's probably just me.– Circumscribe
Jan 4 at 10:01
|
show 1 more comment
3 Answers
3
active
oldest
votes
macro programming is a programming style (as distinct from compiled programs) where the operation works by each token being replaced inline by its replacement text. Popular macro programming languages other than TeX include the C language pre-processor and the m4 system.
What macro means in LaTeX?
TeX's main mode of user customisation is via its macro processing language, but of course it requires some built in commands (primitives) that you may use to build up macros.
So commands such as def
, halign
, count
are not macros.
Tex also has some primitives that are classed as expandable and so act pretty much the same as pre-defined macros although technically they are not classed as macros in TeX, a list of these expandable primitives is here:
Is there a list of expandable TeX primitives? LaTeX? e-TeX? others?
A control sequence is a string of characters such as foobar
that references a command (or token, or is undefined) as opposed to a character such as x
that just references itself, it may reference tex primitives or macros, depending on the context. Note that being a macro is a property of the meaning of the token, but being a control sequence is a property of its syntax. So by default ~
is a macro but it is an active character token not a control sequence.
In TeX macros are always defined by def
and its variants such as edef
. LaTeX definition commands such as newcommand
are themselves macros that ultimately result in an application of def
to define a macro.
add a comment |
I seldom use the term "command" when it is about (La)TeX.
In the TeXbook the term "command" does also appear in contexts where not single tokens but the coming into being of sequences of control word tokens plays a rôle.
E.g., chapter 15 of the TeXbook includes a double dangerous bend paragraph where you can read:
[...] Therefore plain TeX provides an allocation function for registers; Appendix B includes the command
newinsertfootins
which definesfootins
as the number for footnote insertions. [...]
So, e.g., newinsertfootins
is a "command".
(By the way: The mentioned appendix B does actually not exhibit tokens. It exhibits TeX code, i.e., TeX-input which the programmer of the plain TeX format did produce. That TeX-input itself is not tokens. TeX will create tokens when processing that code.)
In the TeXbook you also find the phrase "primitive command". It is used in contexts where very short and easily to grasp sequences of TeX-input play a rôle. (Sequences that, when being read and tokenized, yield primitives...)
I suppose, a "command" according to the diction of the TeXbook is a sequence of characters which forms some portion of TeX-input. A portion of TeX-input which, when TeX reads and tokenizes that sequence of characters, yields a set of whatsoever tokens and/or (e.g., in case the command in question contains "invalid characters"/characters of category code 15) some error-messages. In case of yielding a set of tokens, that set of tokens is intended to form a syntactical entity which TeX's token-processing-apparatus can attempt to carry out/to process in whatsoever way. Such an attempt in turn may also yield error-messages and/or erroneous behavior.
Thus the term "command" in the diction of the TeXbook itself seems to actually not focus on things as results of TeX's "digestive processes". One of the earlier results of TeX's digestive processes are tokens. Tokens are produced in TeX's mouth. Thus the term "command" itself seems to actually not refer to tokens. It seems to refer to subsets of the things which TeX gets to "eat"—it refers to portions of the TeX-input, i.e., to the things which in interactive mode the user/programmer can type, or which TeX can read from .tex-input-files or read
-handles for tokenization and further processing.
In the LaTeX 2e-kernel you have things like newcommand
and renewcommand
.
These things during LaTeX's digestive processes cause LaTeX to perform assignments. But the point of view seems not to be the view of the LaTeX-program which "eats" LaTeX-input and while "digesting" produces "tokens" etc. The point of view seems to be the view of the programmer. The programmer does not produce tokens. She/he produces LaTeX-input. That LaTeX-input is not tokens yet. It is a sequence of input-characters.
After writing newcommand...
, the programmer in her/his code can take the thing introduced via newcommand
for a small portion of LaTeX-input which when tokenized by LaTeX yields a token which itself is considered a syntactical entity which, when used as intended, can be carried out via LaTeX's token-processing-apparatus without yielding erroneous behavior.
I could not find the phrase "command token" in texbook.tex.
This can also be taken for an indication that "command" and "token" in the diction of the TeXbook do in the upshot not refer to the same stage of work.
It seems, in the TeXbook the term "command" is used when the focus is on that stage where the programmer produces/delivers/types whatsoever .tex-input.
It seems, in the TeXbook the terms "token" and "macro" are used when the focus is on that stage where the TeX-program does process the .tex-input.
The term "macro" seems to refer to the concept of TeX's tokens while tokens and thus macro-tokens do not come into being while writing/producing/delivering TeX-input but while having the TeX-program read and tokenize TeX-input and from the resulting tokens expand the expandable ones.
As far as I know, macro-tokens are ⟨control sequences⟩ that both are expandable and do not have the meaning of one of those about 300 low-level atomic operations that are called "primitives".
What kinds of tokens can form such ⟨control sequences⟩?
⟨control sequences⟩ come in two flavors:
⟨control sequence tokens⟩
These divide into ⟨control symbol tokens⟩ and ⟨control word tokens⟩
⟨active character tokens⟩
With this remark we reach the question of how to divide up ⟨control sequences⟩.
As just seen, one way of dividing up ⟨control sequences⟩ is by looking at the question what kinds of tokens can form ⟨control sequences⟩:
⟨control sequence tokens⟩ are those lovely "thingies" that in TeX input need to be preceded by a character whose category code is 0 (escape).
Usually the backslash-character is the only character whose category code is 0.
⟨control sequence tokens⟩ come in two flavors:
⟨control symbol tokens⟩ have names which consist of a single character which does not have category code 11 (letter).
E.g., usually .
and \
are ⟨control symbol tokens⟩ as usually the characters .
and do not have category code 11 (letter).
a
usually is not a ⟨control symbol token⟩ as usually the character a
does have the category code 11 (letter).
⟨control word tokens⟩ have names which consist either of several characters which do not need to have the same category code, or of a single character which does have the category code 11 (letter).
Any ⟨control sequence⟩ (i.e., both ⟨control sequence tokens⟩ and ⟨active character tokens⟩, see below) can come into being
- when (La)TeX reads and tokenizes input.
- when (La)TeX inserts it during expansion; e.g., as part of the ⟨replacement text⟩ of a macro; e.g., as part of the
the
-expansion of a token-register.
A ⟨control sequence token⟩ can come into being also
- as the result of expanding some
csname..endcsname
-expression.
Any character whose current category code is not 11 (letter), even a character of category code 15 (invalid), can form the name of a ⟨control symbol token⟩ that came into being as a result of reading and tokenizing TeX-input.
(I say "current" because as soon as the character that forms the name of the ⟨control sequence token⟩ in question is switched to 11 (letter), the ⟨control sequence token⟩ in question will not be treated as a ⟨control symbol token⟩ (any more) but as a ⟨control word token⟩.)
Otherwise you could, e.g., not use ⟨control symbol tokens⟩ for changing the category codes of characters that currently have category code 15 (invalid).
But you can do, e.g.,:
catcode`A=15 % Now uppercase-a is invalid.
% Typing an "A" both outside a comment and not behind a character
% of category code 5 (end of line) which is not
% intended to form the name of a control-symbol-token would yield
% an error-message now.
% But you can do:
catcode`A=11 % Now uppercase-a is a letter again.
After elaborating on the names of ⟨control symbol tokens⟩, let's elaborate on the names of ⟨control word tokens⟩:
On the one hand, TeX's reading- and tokenizing-apparatus will only take sequences of characters of category code 11 (letter) as names of ⟨control word tokens⟩.
Thus names of ⟨control word tokens⟩ created via having TeX read and tokenize them directly from TeX-input consist only of characters that had category code 11 (letter) at the time of reading and tokenizeng them.
On the other hand names of ⟨control word tokens⟩ created via evaluating csname..endcsname
-expressions can consist either of more than one character of whatsoever category code differing from 5, 9, 14 and 15 at the time of reading and tokenizing the csname..endcsname
-expression , or of a single character of category code 11 (letter).
A character of category code 10 (space)—usually the space character (ASCII 32) is the only character with that category code — that in the TeX-input occurs behind a character-sequence that got tokenized as ⟨control symbol token⟩ will not be ignored but will be tokenized as an explicit space token.
The only exception to that rule is with such characters that follow the control-space, , i.e., that ⟨control symbol token⟩ whose name is formed by the space character. These will be ignored, too.
A character of category code 10 (space) that in the TeX-input occurs behind a character-sequence that got tokenized as ⟨control word token⟩ will be ignored/will not yield any token.
When (La)TeX does unexpanded-write a ⟨control word token⟩ (be it writing to file, be it writing to the screen), it automatically inserts a space character at the end of the character sequence that represents the ⟨control word token⟩ in question.
When (La)TeX does unexpanded-write a ⟨control symbol token⟩ it will not automatically insert such a trailing space character.
When applying string
for transforming the following token into a sequence of ⟨character tokens⟩ (either character tokens of category code 12 (other) and/or character tokens of category code 10 (space) and character code 32—32 encodes the space-character in ASCII—so called explicit space tokens—(La)TeX will not attach an additional trailing space token.
You can change the way in which TeX does unexpanded-write a ⟨control sequence token⟩ whose name consists of a single character by before writing changing the category code of that character either to 11 (letter)—in this case it will be written as a ⟨control word token⟩ with trailing space— or to a value differing from 11—in this case it will be written as a ⟨control symbol token⟩ without trailing space.
Thus you can have fun with TeX's automatic insertion of spaces when doing delayed writes with unexpanded ⟨control sequence tokens⟩ whose names consist of a single character—the code
documentclassarticle
begindocument
newwritemywrite
immediateopenoutmywrite test.txtrelax
catcode`A=12
writemywritebnoexpandAb
immediatewritemywritebnoexpandAb
catcode`A=11
Hello
enddocument
besides the .dvi- or .pdf-file yields a file test.txt with the following content:
bAb
bA b
As both write
-commands were issued while A
was of category code 12 (other), you might have expected each A
to be unexpanded-written as ⟨control symbol token⟩ without a trailing space, and thus you might have expected the output
bAb
bAb
. But the first write
was not immediate
and thus it got carried out not immediately but at the time of shipping out the page. At that time, the second write
, which was immediate
, was already done and A
had category code 11 (letter) again and thus the A
in the first write
was written within the second line as a ⟨control word token⟩, with LaTeX attaching a trailing space character.
The nameless ⟨control sequence token⟩—methods for forming it are: 1) expanding csnameendcsname
; 2) placing a backslash (a character of category code 0 (escape)) at the end of a line while the parameter endlinechar
has a non-positive value or a value outside the range of code-points of the TeX-engine's input-encoding—deserves special attention:
Applying string
to it always yields the catcode 12(other) character token sequence csnameendcsname
. There is no space token between csname
and endcsname
. There also is no trailing space token.
Unexpanded-writing it yields the character sequence csnameendcsname␣
—there is a trailing space character.
Besides the ⟨control sequence tokens⟩ there is another kind of ⟨control sequences⟩: ⟨active character tokens⟩.
An ⟨active character token⟩ is a ⟨character token⟩ whose category code is 13 (active).
An ⟨active character token⟩ can be used like a ⟨control sequence token⟩.
E.g., after catcode`b=13
you can do def bThe active character string b is a macro now.
Another way of dividing up ⟨control sequences⟩ is by looking at their "decomposability":
The TeXBook says:
About 300 of TeX's control sequences are called primitive; these are
the low-level atomic operations that are not decomposable into simpler
functions. All other control sequences are defined, ultimately, in
terms of the primitive ones. For example, input is a primitive
operation, but ’ and " are not; the latter are defined in terms of
an accent primitive.
(It should not be overlooked that when defining ⟨control sequences⟩ that are macros, ⟨non-active character tokens⟩ can play a rôle also.)
Yet an another way of dividing up ⟨control sequences⟩ is by looking at the actions triggered by them and at the point in time when these actions are triggered:
There is the analogy between TeX's input-processing and digestive processes:
- TeX input is divided up into so-called tokens (⟨control sequence tokens⟩ and ⟨character tokens⟩) in TeX's mouth.
- Expansion of expandable ⟨control sequences⟩ takes place in the gullet.
Unexpandable ⟨control sequences⟩ will be handled in the stomach.
So you distinguish expandable ⟨control sequences⟩ and unexpandable ⟨control sequences⟩.
Expandable ⟨control sequences⟩ (and their arguments) will in the gullet be replaced by other constellations of tokens. Replacement of expandable ⟨control sequences⟩ takes place in the gullet until there are no more expandable ⟨control sequences⟩ left.
Unexpandable ⟨control sequences⟩ (which came into being either during tokenization in TeX's mouth or via replacement of expandable tokens during expansion) pass the gullet and reach the stomach where boxes will be built, assignment-primitives will be carried out, etc.
There are primitive ⟨control sequences⟩ that are expandable:
E.g., the string
-primitive is an expandable primitive. Together with the following token it vanishes in TeX's gullet and as replacement you get a set of ⟨character tokens⟩ of category code 12 (other) or 10 (space) that represents the name of the ⟨control sequence token⟩ in question/is the catcode-12-pendant of the ⟨character token⟩ in question. (When an explicit space token trails string
, string
will deliver an explicit space token.)
E.g., the csname
-primitive is an expandable primitive. It triggers the processing of a csname..endcsname
-expression in TeX's gullet and it vanishes in TeX's gullet and as replacement you get the corresponding ⟨control symbol token⟩ or ⟨control word token⟩. In case that token is undefined, it will get the meaning of the relax
-primitive within the current scope.
E.g., the romannumeral
-primitive is an expandable primitive. It and a following ⟨number⟩ vanish in TeX's gullet and in case the ⟨number⟩ is positive as replacement you get a sequence of ⟨character tokens⟩ of category code 12 (other) that represent the corresponding ⟨number⟩ in lowercase-roman notation.
There are primitive ⟨control sequences⟩ that are not expandable:
E.g., the relax
-primitive is not expandable. It does pass the gullet and reach the stomach.
E.g., the assignment-primitives def
, edef
, gdef
, xdef
, countdef
etc are not expandable. They do not trigger their replacement in TeX's gullet but will reach the stomach.
There are non-primitive ⟨control sequences⟩ that are expandable:
Macros are non-primitive ⟨control sequences⟩ that are expandable. They and their arguments get replaced with their ⟨replacement text⟩ in TeX's gullet. Macros are defined in TeX's stomach in terms of the primitives def
, edef
, gdef
and xdef
. Be aware that LaTeX's newcommand
also is just a macro which "decomposes" in LaTeX's gullet to some set of tokens containing the def
-primitive.
There are non-primitive ⟨control sequences⟩ that are not expandable:
You can, e.g., do things like letbgroup={
.
bgroup
is a ⟨control sequence⟩. More specific: It is a ⟨control sequence token⟩. It is a ⟨control word token⟩. It is not expandable but it also does not have the meaning of one of the about 300 non-decomposable functions of TeX that are called primitives. bgroup
is an implicit character token. ;-)
Other kinds of non-primitive ⟨control sequences⟩ that are not expandable are, e.g., chardef
-tokens, countdef
-tokens, toksdef
-tokens, ...
In order to get used to the terminology, let's unravel some cases:
After carrying out the assignments
catcode`W=13 %
catcode`X=13 %
catcode`Y=13 %
catcode`Z=13 %
newcommand WThe active character stringW is a macro.
newcommandsomethingThis is a macro.
newcommand+This is a macro, too.
let X=a
let Y=relax
let Z=romannumeral
,
W
is a ⟨control sequence⟩: It is an ⟨active character token⟩. It is a macro and thus both expandable and not a primitive.X
is a ⟨control sequence⟩: It is an
⟨active character token⟩. It is not expandable. Thus it cannot be a macro. It also is not an unexpandable primitive. It has the same meaning as the catcode 11(letter) ⟨character token⟩a
. It is an implicit character token.Y
is a ⟨control sequence⟩: It is an ⟨active character token⟩. Its meaning equals the meaning of the unexpandablerelax
-primitive. Thus it is an unexpandable primitive.Z
is a ⟨control sequence⟩: It is an ⟨active character token⟩. Its meaning equals the meaning of the expandableromannumeral
-primitive. Thus it is an expandable primitive.something
is a ⟨control sequence⟩: It is a ⟨control sequence token⟩. It is a ⟨control word token⟩. It is expandable. Its meaning does not equal one of the about 300 non-decomposable functions of TeX that are called primitives. It is a macro.+
is a ⟨control sequence⟩: It is a ⟨control sequence token⟩. It is a ⟨control symbol token⟩. It is expandable. Its meaning does not equal one of the about 300 non-decomposable functions of TeX that are called primitives. It is a macro.
The term "macro" does always denote a non-primitive expandable ⟨control sequence⟩.
Thus when you know that a token is a macro, you can conclude that it cannot be a
⟨non-active character token⟩ because ⟨non-active character tokens⟩ cannot be ⟨control sequences⟩.
You can conclude that it is a ⟨control sequence⟩.
But you cannot conclude whether the ⟨control sequence⟩ is a ⟨control word token⟩ or a ⟨control symbol token⟩ or an
⟨active character token⟩.
Not all expandable ⟨control sequences⟩ are macros.
There are expandable ⟨primitives⟩ as well.
Not all non-expandable ⟨control sequences⟩ are ⟨primitives⟩.
There are non-expandable non-primitives as well:
E.g., implicit character tokens, chardef
-tokens, countdef
-tokens, toksdef
-tokens, ...
1
Sorry for asking, but just curiosity: are you writing everything from your mind?
– Sigur
Jan 5 at 0:33
I just observed you were editing a lot.
– Sigur
Jan 5 at 0:51
1
@Sigur I just thought about it, looked up the quotes and spontaneously typed into the input-field of the TeX-LaTeX-StackExchange-Website shown in my internet-browser. If I had really planned this answer systematically, I would have typed within my favorite editor-software, saved the text as a file and revised it a few times before pasting it from the editor-software to the input-field of the TeX-LaTeX-StackExchange-Website. But I didn't take it all too serious. I just wanted to have fun...;-)
– Ulrich Diez
Jan 5 at 1:10
2
Ulrich, Since TeX doesn't end in an orthographic 's', it's possessive form should be "TeX's" not just "TeX'".
– Alan Munn
Jan 5 at 17:04
2
@AlanMunn These comments might prevent others from making the same mistakes... ;-)
– Ulrich Diez
Jan 5 at 17:41
|
show 3 more comments
They're used interchangeably because they mean the same thing.
A macro is defined as
a single computer instruction that stands for a sequence of operations
The use of command probably stems from the way it's defined in LaTeX via
newcommand
andrenewcommand
and LaTeX3'sxparse
viaNewDocumentCommand
andRenewDocumentCommand
.There's also control sequence used by DEK in his TeX Book. They all point to the same thing.
7
I would not say they mean the same thing, tex primitives are not macros.
– David Carlisle
Jan 4 at 10:12
I'm confident the majority of people using (La)TeX use the terminology interchangeably.
– Werner
Jan 4 at 17:32
2
Certainly some do but that I think leads to confusion (as witnessed by this question)
– David Carlisle
Jan 4 at 17:35
add a comment |
Your Answer
StackExchange.ready(function()
var channelOptions =
tags: "".split(" "),
id: "85"
;
initTagRenderer("".split(" "), "".split(" "), channelOptions);
StackExchange.using("externalEditor", function()
// Have to fire editor after snippets, if snippets enabled
if (StackExchange.settings.snippets.snippetsEnabled)
StackExchange.using("snippets", function()
createEditor();
);
else
createEditor();
);
function createEditor()
StackExchange.prepareEditor(
heartbeatType: 'answer',
autoActivateHeartbeat: false,
convertImagesToLinks: false,
noModals: true,
showLowRepImageUploadWarning: true,
reputationToPostImages: null,
bindNavPrevention: true,
postfix: "",
imageUploader:
brandingHtml: "Powered by u003ca class="icon-imgur-white" href="https://imgur.com/"u003eu003c/au003e",
contentPolicyHtml: "User contributions licensed under u003ca href="https://creativecommons.org/licenses/by-sa/3.0/"u003ecc by-sa 3.0 with attribution requiredu003c/au003e u003ca href="https://stackoverflow.com/legal/content-policy"u003e(content policy)u003c/au003e",
allowUrls: true
,
onDemand: true,
discardSelector: ".discard-answer"
,immediatelyShowMarkdownHelp:true
);
);
Sign up or log in
StackExchange.ready(function ()
StackExchange.helpers.onClickDraftSave('#login-link');
);
Sign up using Google
Sign up using Facebook
Sign up using Email and Password
Post as a guest
Required, but never shown
StackExchange.ready(
function ()
StackExchange.openid.initPostLogin('.new-post-login', 'https%3a%2f%2ftex.stackexchange.com%2fquestions%2f468508%2fwhat-is-the-difference-between-macro-and-command%23new-answer', 'question_page');
);
Post as a guest
Required, but never shown
3 Answers
3
active
oldest
votes
3 Answers
3
active
oldest
votes
active
oldest
votes
active
oldest
votes
macro programming is a programming style (as distinct from compiled programs) where the operation works by each token being replaced inline by its replacement text. Popular macro programming languages other than TeX include the C language pre-processor and the m4 system.
What macro means in LaTeX?
TeX's main mode of user customisation is via its macro processing language, but of course it requires some built in commands (primitives) that you may use to build up macros.
So commands such as def
, halign
, count
are not macros.
Tex also has some primitives that are classed as expandable and so act pretty much the same as pre-defined macros although technically they are not classed as macros in TeX, a list of these expandable primitives is here:
Is there a list of expandable TeX primitives? LaTeX? e-TeX? others?
A control sequence is a string of characters such as foobar
that references a command (or token, or is undefined) as opposed to a character such as x
that just references itself, it may reference tex primitives or macros, depending on the context. Note that being a macro is a property of the meaning of the token, but being a control sequence is a property of its syntax. So by default ~
is a macro but it is an active character token not a control sequence.
In TeX macros are always defined by def
and its variants such as edef
. LaTeX definition commands such as newcommand
are themselves macros that ultimately result in an application of def
to define a macro.
add a comment |
macro programming is a programming style (as distinct from compiled programs) where the operation works by each token being replaced inline by its replacement text. Popular macro programming languages other than TeX include the C language pre-processor and the m4 system.
What macro means in LaTeX?
TeX's main mode of user customisation is via its macro processing language, but of course it requires some built in commands (primitives) that you may use to build up macros.
So commands such as def
, halign
, count
are not macros.
Tex also has some primitives that are classed as expandable and so act pretty much the same as pre-defined macros although technically they are not classed as macros in TeX, a list of these expandable primitives is here:
Is there a list of expandable TeX primitives? LaTeX? e-TeX? others?
A control sequence is a string of characters such as foobar
that references a command (or token, or is undefined) as opposed to a character such as x
that just references itself, it may reference tex primitives or macros, depending on the context. Note that being a macro is a property of the meaning of the token, but being a control sequence is a property of its syntax. So by default ~
is a macro but it is an active character token not a control sequence.
In TeX macros are always defined by def
and its variants such as edef
. LaTeX definition commands such as newcommand
are themselves macros that ultimately result in an application of def
to define a macro.
add a comment |
macro programming is a programming style (as distinct from compiled programs) where the operation works by each token being replaced inline by its replacement text. Popular macro programming languages other than TeX include the C language pre-processor and the m4 system.
What macro means in LaTeX?
TeX's main mode of user customisation is via its macro processing language, but of course it requires some built in commands (primitives) that you may use to build up macros.
So commands such as def
, halign
, count
are not macros.
Tex also has some primitives that are classed as expandable and so act pretty much the same as pre-defined macros although technically they are not classed as macros in TeX, a list of these expandable primitives is here:
Is there a list of expandable TeX primitives? LaTeX? e-TeX? others?
A control sequence is a string of characters such as foobar
that references a command (or token, or is undefined) as opposed to a character such as x
that just references itself, it may reference tex primitives or macros, depending on the context. Note that being a macro is a property of the meaning of the token, but being a control sequence is a property of its syntax. So by default ~
is a macro but it is an active character token not a control sequence.
In TeX macros are always defined by def
and its variants such as edef
. LaTeX definition commands such as newcommand
are themselves macros that ultimately result in an application of def
to define a macro.
macro programming is a programming style (as distinct from compiled programs) where the operation works by each token being replaced inline by its replacement text. Popular macro programming languages other than TeX include the C language pre-processor and the m4 system.
What macro means in LaTeX?
TeX's main mode of user customisation is via its macro processing language, but of course it requires some built in commands (primitives) that you may use to build up macros.
So commands such as def
, halign
, count
are not macros.
Tex also has some primitives that are classed as expandable and so act pretty much the same as pre-defined macros although technically they are not classed as macros in TeX, a list of these expandable primitives is here:
Is there a list of expandable TeX primitives? LaTeX? e-TeX? others?
A control sequence is a string of characters such as foobar
that references a command (or token, or is undefined) as opposed to a character such as x
that just references itself, it may reference tex primitives or macros, depending on the context. Note that being a macro is a property of the meaning of the token, but being a control sequence is a property of its syntax. So by default ~
is a macro but it is an active character token not a control sequence.
In TeX macros are always defined by def
and its variants such as edef
. LaTeX definition commands such as newcommand
are themselves macros that ultimately result in an application of def
to define a macro.
edited Jan 4 at 11:18
answered Jan 4 at 10:27
David CarlisleDavid Carlisle
485k4111211865
485k4111211865
add a comment |
add a comment |
I seldom use the term "command" when it is about (La)TeX.
In the TeXbook the term "command" does also appear in contexts where not single tokens but the coming into being of sequences of control word tokens plays a rôle.
E.g., chapter 15 of the TeXbook includes a double dangerous bend paragraph where you can read:
[...] Therefore plain TeX provides an allocation function for registers; Appendix B includes the command
newinsertfootins
which definesfootins
as the number for footnote insertions. [...]
So, e.g., newinsertfootins
is a "command".
(By the way: The mentioned appendix B does actually not exhibit tokens. It exhibits TeX code, i.e., TeX-input which the programmer of the plain TeX format did produce. That TeX-input itself is not tokens. TeX will create tokens when processing that code.)
In the TeXbook you also find the phrase "primitive command". It is used in contexts where very short and easily to grasp sequences of TeX-input play a rôle. (Sequences that, when being read and tokenized, yield primitives...)
I suppose, a "command" according to the diction of the TeXbook is a sequence of characters which forms some portion of TeX-input. A portion of TeX-input which, when TeX reads and tokenizes that sequence of characters, yields a set of whatsoever tokens and/or (e.g., in case the command in question contains "invalid characters"/characters of category code 15) some error-messages. In case of yielding a set of tokens, that set of tokens is intended to form a syntactical entity which TeX's token-processing-apparatus can attempt to carry out/to process in whatsoever way. Such an attempt in turn may also yield error-messages and/or erroneous behavior.
Thus the term "command" in the diction of the TeXbook itself seems to actually not focus on things as results of TeX's "digestive processes". One of the earlier results of TeX's digestive processes are tokens. Tokens are produced in TeX's mouth. Thus the term "command" itself seems to actually not refer to tokens. It seems to refer to subsets of the things which TeX gets to "eat"—it refers to portions of the TeX-input, i.e., to the things which in interactive mode the user/programmer can type, or which TeX can read from .tex-input-files or read
-handles for tokenization and further processing.
In the LaTeX 2e-kernel you have things like newcommand
and renewcommand
.
These things during LaTeX's digestive processes cause LaTeX to perform assignments. But the point of view seems not to be the view of the LaTeX-program which "eats" LaTeX-input and while "digesting" produces "tokens" etc. The point of view seems to be the view of the programmer. The programmer does not produce tokens. She/he produces LaTeX-input. That LaTeX-input is not tokens yet. It is a sequence of input-characters.
After writing newcommand...
, the programmer in her/his code can take the thing introduced via newcommand
for a small portion of LaTeX-input which when tokenized by LaTeX yields a token which itself is considered a syntactical entity which, when used as intended, can be carried out via LaTeX's token-processing-apparatus without yielding erroneous behavior.
I could not find the phrase "command token" in texbook.tex.
This can also be taken for an indication that "command" and "token" in the diction of the TeXbook do in the upshot not refer to the same stage of work.
It seems, in the TeXbook the term "command" is used when the focus is on that stage where the programmer produces/delivers/types whatsoever .tex-input.
It seems, in the TeXbook the terms "token" and "macro" are used when the focus is on that stage where the TeX-program does process the .tex-input.
The term "macro" seems to refer to the concept of TeX's tokens while tokens and thus macro-tokens do not come into being while writing/producing/delivering TeX-input but while having the TeX-program read and tokenize TeX-input and from the resulting tokens expand the expandable ones.
As far as I know, macro-tokens are ⟨control sequences⟩ that both are expandable and do not have the meaning of one of those about 300 low-level atomic operations that are called "primitives".
What kinds of tokens can form such ⟨control sequences⟩?
⟨control sequences⟩ come in two flavors:
⟨control sequence tokens⟩
These divide into ⟨control symbol tokens⟩ and ⟨control word tokens⟩
⟨active character tokens⟩
With this remark we reach the question of how to divide up ⟨control sequences⟩.
As just seen, one way of dividing up ⟨control sequences⟩ is by looking at the question what kinds of tokens can form ⟨control sequences⟩:
⟨control sequence tokens⟩ are those lovely "thingies" that in TeX input need to be preceded by a character whose category code is 0 (escape).
Usually the backslash-character is the only character whose category code is 0.
⟨control sequence tokens⟩ come in two flavors:
⟨control symbol tokens⟩ have names which consist of a single character which does not have category code 11 (letter).
E.g., usually .
and \
are ⟨control symbol tokens⟩ as usually the characters .
and do not have category code 11 (letter).
a
usually is not a ⟨control symbol token⟩ as usually the character a
does have the category code 11 (letter).
⟨control word tokens⟩ have names which consist either of several characters which do not need to have the same category code, or of a single character which does have the category code 11 (letter).
Any ⟨control sequence⟩ (i.e., both ⟨control sequence tokens⟩ and ⟨active character tokens⟩, see below) can come into being
- when (La)TeX reads and tokenizes input.
- when (La)TeX inserts it during expansion; e.g., as part of the ⟨replacement text⟩ of a macro; e.g., as part of the
the
-expansion of a token-register.
A ⟨control sequence token⟩ can come into being also
- as the result of expanding some
csname..endcsname
-expression.
Any character whose current category code is not 11 (letter), even a character of category code 15 (invalid), can form the name of a ⟨control symbol token⟩ that came into being as a result of reading and tokenizing TeX-input.
(I say "current" because as soon as the character that forms the name of the ⟨control sequence token⟩ in question is switched to 11 (letter), the ⟨control sequence token⟩ in question will not be treated as a ⟨control symbol token⟩ (any more) but as a ⟨control word token⟩.)
Otherwise you could, e.g., not use ⟨control symbol tokens⟩ for changing the category codes of characters that currently have category code 15 (invalid).
But you can do, e.g.,:
catcode`A=15 % Now uppercase-a is invalid.
% Typing an "A" both outside a comment and not behind a character
% of category code 5 (end of line) which is not
% intended to form the name of a control-symbol-token would yield
% an error-message now.
% But you can do:
catcode`A=11 % Now uppercase-a is a letter again.
After elaborating on the names of ⟨control symbol tokens⟩, let's elaborate on the names of ⟨control word tokens⟩:
On the one hand, TeX's reading- and tokenizing-apparatus will only take sequences of characters of category code 11 (letter) as names of ⟨control word tokens⟩.
Thus names of ⟨control word tokens⟩ created via having TeX read and tokenize them directly from TeX-input consist only of characters that had category code 11 (letter) at the time of reading and tokenizeng them.
On the other hand names of ⟨control word tokens⟩ created via evaluating csname..endcsname
-expressions can consist either of more than one character of whatsoever category code differing from 5, 9, 14 and 15 at the time of reading and tokenizing the csname..endcsname
-expression , or of a single character of category code 11 (letter).
A character of category code 10 (space)—usually the space character (ASCII 32) is the only character with that category code — that in the TeX-input occurs behind a character-sequence that got tokenized as ⟨control symbol token⟩ will not be ignored but will be tokenized as an explicit space token.
The only exception to that rule is with such characters that follow the control-space, , i.e., that ⟨control symbol token⟩ whose name is formed by the space character. These will be ignored, too.
A character of category code 10 (space) that in the TeX-input occurs behind a character-sequence that got tokenized as ⟨control word token⟩ will be ignored/will not yield any token.
When (La)TeX does unexpanded-write a ⟨control word token⟩ (be it writing to file, be it writing to the screen), it automatically inserts a space character at the end of the character sequence that represents the ⟨control word token⟩ in question.
When (La)TeX does unexpanded-write a ⟨control symbol token⟩ it will not automatically insert such a trailing space character.
When applying string
for transforming the following token into a sequence of ⟨character tokens⟩ (either character tokens of category code 12 (other) and/or character tokens of category code 10 (space) and character code 32—32 encodes the space-character in ASCII—so called explicit space tokens—(La)TeX will not attach an additional trailing space token.
You can change the way in which TeX does unexpanded-write a ⟨control sequence token⟩ whose name consists of a single character by before writing changing the category code of that character either to 11 (letter)—in this case it will be written as a ⟨control word token⟩ with trailing space— or to a value differing from 11—in this case it will be written as a ⟨control symbol token⟩ without trailing space.
Thus you can have fun with TeX's automatic insertion of spaces when doing delayed writes with unexpanded ⟨control sequence tokens⟩ whose names consist of a single character—the code
documentclassarticle
begindocument
newwritemywrite
immediateopenoutmywrite test.txtrelax
catcode`A=12
writemywritebnoexpandAb
immediatewritemywritebnoexpandAb
catcode`A=11
Hello
enddocument
besides the .dvi- or .pdf-file yields a file test.txt with the following content:
bAb
bA b
As both write
-commands were issued while A
was of category code 12 (other), you might have expected each A
to be unexpanded-written as ⟨control symbol token⟩ without a trailing space, and thus you might have expected the output
bAb
bAb
. But the first write
was not immediate
and thus it got carried out not immediately but at the time of shipping out the page. At that time, the second write
, which was immediate
, was already done and A
had category code 11 (letter) again and thus the A
in the first write
was written within the second line as a ⟨control word token⟩, with LaTeX attaching a trailing space character.
The nameless ⟨control sequence token⟩—methods for forming it are: 1) expanding csnameendcsname
; 2) placing a backslash (a character of category code 0 (escape)) at the end of a line while the parameter endlinechar
has a non-positive value or a value outside the range of code-points of the TeX-engine's input-encoding—deserves special attention:
Applying string
to it always yields the catcode 12(other) character token sequence csnameendcsname
. There is no space token between csname
and endcsname
. There also is no trailing space token.
Unexpanded-writing it yields the character sequence csnameendcsname␣
—there is a trailing space character.
Besides the ⟨control sequence tokens⟩ there is another kind of ⟨control sequences⟩: ⟨active character tokens⟩.
An ⟨active character token⟩ is a ⟨character token⟩ whose category code is 13 (active).
An ⟨active character token⟩ can be used like a ⟨control sequence token⟩.
E.g., after catcode`b=13
you can do def bThe active character string b is a macro now.
Another way of dividing up ⟨control sequences⟩ is by looking at their "decomposability":
The TeXBook says:
About 300 of TeX's control sequences are called primitive; these are
the low-level atomic operations that are not decomposable into simpler
functions. All other control sequences are defined, ultimately, in
terms of the primitive ones. For example, input is a primitive
operation, but ’ and " are not; the latter are defined in terms of
an accent primitive.
(It should not be overlooked that when defining ⟨control sequences⟩ that are macros, ⟨non-active character tokens⟩ can play a rôle also.)
Yet an another way of dividing up ⟨control sequences⟩ is by looking at the actions triggered by them and at the point in time when these actions are triggered:
There is the analogy between TeX's input-processing and digestive processes:
- TeX input is divided up into so-called tokens (⟨control sequence tokens⟩ and ⟨character tokens⟩) in TeX's mouth.
- Expansion of expandable ⟨control sequences⟩ takes place in the gullet.
Unexpandable ⟨control sequences⟩ will be handled in the stomach.
So you distinguish expandable ⟨control sequences⟩ and unexpandable ⟨control sequences⟩.
Expandable ⟨control sequences⟩ (and their arguments) will in the gullet be replaced by other constellations of tokens. Replacement of expandable ⟨control sequences⟩ takes place in the gullet until there are no more expandable ⟨control sequences⟩ left.
Unexpandable ⟨control sequences⟩ (which came into being either during tokenization in TeX's mouth or via replacement of expandable tokens during expansion) pass the gullet and reach the stomach where boxes will be built, assignment-primitives will be carried out, etc.
There are primitive ⟨control sequences⟩ that are expandable:
E.g., the string
-primitive is an expandable primitive. Together with the following token it vanishes in TeX's gullet and as replacement you get a set of ⟨character tokens⟩ of category code 12 (other) or 10 (space) that represents the name of the ⟨control sequence token⟩ in question/is the catcode-12-pendant of the ⟨character token⟩ in question. (When an explicit space token trails string
, string
will deliver an explicit space token.)
E.g., the csname
-primitive is an expandable primitive. It triggers the processing of a csname..endcsname
-expression in TeX's gullet and it vanishes in TeX's gullet and as replacement you get the corresponding ⟨control symbol token⟩ or ⟨control word token⟩. In case that token is undefined, it will get the meaning of the relax
-primitive within the current scope.
E.g., the romannumeral
-primitive is an expandable primitive. It and a following ⟨number⟩ vanish in TeX's gullet and in case the ⟨number⟩ is positive as replacement you get a sequence of ⟨character tokens⟩ of category code 12 (other) that represent the corresponding ⟨number⟩ in lowercase-roman notation.
There are primitive ⟨control sequences⟩ that are not expandable:
E.g., the relax
-primitive is not expandable. It does pass the gullet and reach the stomach.
E.g., the assignment-primitives def
, edef
, gdef
, xdef
, countdef
etc are not expandable. They do not trigger their replacement in TeX's gullet but will reach the stomach.
There are non-primitive ⟨control sequences⟩ that are expandable:
Macros are non-primitive ⟨control sequences⟩ that are expandable. They and their arguments get replaced with their ⟨replacement text⟩ in TeX's gullet. Macros are defined in TeX's stomach in terms of the primitives def
, edef
, gdef
and xdef
. Be aware that LaTeX's newcommand
also is just a macro which "decomposes" in LaTeX's gullet to some set of tokens containing the def
-primitive.
There are non-primitive ⟨control sequences⟩ that are not expandable:
You can, e.g., do things like letbgroup={
.
bgroup
is a ⟨control sequence⟩. More specific: It is a ⟨control sequence token⟩. It is a ⟨control word token⟩. It is not expandable but it also does not have the meaning of one of the about 300 non-decomposable functions of TeX that are called primitives. bgroup
is an implicit character token. ;-)
Other kinds of non-primitive ⟨control sequences⟩ that are not expandable are, e.g., chardef
-tokens, countdef
-tokens, toksdef
-tokens, ...
In order to get used to the terminology, let's unravel some cases:
After carrying out the assignments
catcode`W=13 %
catcode`X=13 %
catcode`Y=13 %
catcode`Z=13 %
newcommand WThe active character stringW is a macro.
newcommandsomethingThis is a macro.
newcommand+This is a macro, too.
let X=a
let Y=relax
let Z=romannumeral
,
W
is a ⟨control sequence⟩: It is an ⟨active character token⟩. It is a macro and thus both expandable and not a primitive.X
is a ⟨control sequence⟩: It is an
⟨active character token⟩. It is not expandable. Thus it cannot be a macro. It also is not an unexpandable primitive. It has the same meaning as the catcode 11(letter) ⟨character token⟩a
. It is an implicit character token.Y
is a ⟨control sequence⟩: It is an ⟨active character token⟩. Its meaning equals the meaning of the unexpandablerelax
-primitive. Thus it is an unexpandable primitive.Z
is a ⟨control sequence⟩: It is an ⟨active character token⟩. Its meaning equals the meaning of the expandableromannumeral
-primitive. Thus it is an expandable primitive.something
is a ⟨control sequence⟩: It is a ⟨control sequence token⟩. It is a ⟨control word token⟩. It is expandable. Its meaning does not equal one of the about 300 non-decomposable functions of TeX that are called primitives. It is a macro.+
is a ⟨control sequence⟩: It is a ⟨control sequence token⟩. It is a ⟨control symbol token⟩. It is expandable. Its meaning does not equal one of the about 300 non-decomposable functions of TeX that are called primitives. It is a macro.
The term "macro" does always denote a non-primitive expandable ⟨control sequence⟩.
Thus when you know that a token is a macro, you can conclude that it cannot be a
⟨non-active character token⟩ because ⟨non-active character tokens⟩ cannot be ⟨control sequences⟩.
You can conclude that it is a ⟨control sequence⟩.
But you cannot conclude whether the ⟨control sequence⟩ is a ⟨control word token⟩ or a ⟨control symbol token⟩ or an
⟨active character token⟩.
Not all expandable ⟨control sequences⟩ are macros.
There are expandable ⟨primitives⟩ as well.
Not all non-expandable ⟨control sequences⟩ are ⟨primitives⟩.
There are non-expandable non-primitives as well:
E.g., implicit character tokens, chardef
-tokens, countdef
-tokens, toksdef
-tokens, ...
1
Sorry for asking, but just curiosity: are you writing everything from your mind?
– Sigur
Jan 5 at 0:33
I just observed you were editing a lot.
– Sigur
Jan 5 at 0:51
1
@Sigur I just thought about it, looked up the quotes and spontaneously typed into the input-field of the TeX-LaTeX-StackExchange-Website shown in my internet-browser. If I had really planned this answer systematically, I would have typed within my favorite editor-software, saved the text as a file and revised it a few times before pasting it from the editor-software to the input-field of the TeX-LaTeX-StackExchange-Website. But I didn't take it all too serious. I just wanted to have fun...;-)
– Ulrich Diez
Jan 5 at 1:10
2
Ulrich, Since TeX doesn't end in an orthographic 's', it's possessive form should be "TeX's" not just "TeX'".
– Alan Munn
Jan 5 at 17:04
2
@AlanMunn These comments might prevent others from making the same mistakes... ;-)
– Ulrich Diez
Jan 5 at 17:41
|
show 3 more comments
I seldom use the term "command" when it is about (La)TeX.
In the TeXbook the term "command" does also appear in contexts where not single tokens but the coming into being of sequences of control word tokens plays a rôle.
E.g., chapter 15 of the TeXbook includes a double dangerous bend paragraph where you can read:
[...] Therefore plain TeX provides an allocation function for registers; Appendix B includes the command
newinsertfootins
which definesfootins
as the number for footnote insertions. [...]
So, e.g., newinsertfootins
is a "command".
(By the way: The mentioned appendix B does actually not exhibit tokens. It exhibits TeX code, i.e., TeX-input which the programmer of the plain TeX format did produce. That TeX-input itself is not tokens. TeX will create tokens when processing that code.)
In the TeXbook you also find the phrase "primitive command". It is used in contexts where very short and easily to grasp sequences of TeX-input play a rôle. (Sequences that, when being read and tokenized, yield primitives...)
I suppose, a "command" according to the diction of the TeXbook is a sequence of characters which forms some portion of TeX-input. A portion of TeX-input which, when TeX reads and tokenizes that sequence of characters, yields a set of whatsoever tokens and/or (e.g., in case the command in question contains "invalid characters"/characters of category code 15) some error-messages. In case of yielding a set of tokens, that set of tokens is intended to form a syntactical entity which TeX's token-processing-apparatus can attempt to carry out/to process in whatsoever way. Such an attempt in turn may also yield error-messages and/or erroneous behavior.
Thus the term "command" in the diction of the TeXbook itself seems to actually not focus on things as results of TeX's "digestive processes". One of the earlier results of TeX's digestive processes are tokens. Tokens are produced in TeX's mouth. Thus the term "command" itself seems to actually not refer to tokens. It seems to refer to subsets of the things which TeX gets to "eat"—it refers to portions of the TeX-input, i.e., to the things which in interactive mode the user/programmer can type, or which TeX can read from .tex-input-files or read
-handles for tokenization and further processing.
In the LaTeX 2e-kernel you have things like newcommand
and renewcommand
.
These things during LaTeX's digestive processes cause LaTeX to perform assignments. But the point of view seems not to be the view of the LaTeX-program which "eats" LaTeX-input and while "digesting" produces "tokens" etc. The point of view seems to be the view of the programmer. The programmer does not produce tokens. She/he produces LaTeX-input. That LaTeX-input is not tokens yet. It is a sequence of input-characters.
After writing newcommand...
, the programmer in her/his code can take the thing introduced via newcommand
for a small portion of LaTeX-input which when tokenized by LaTeX yields a token which itself is considered a syntactical entity which, when used as intended, can be carried out via LaTeX's token-processing-apparatus without yielding erroneous behavior.
I could not find the phrase "command token" in texbook.tex.
This can also be taken for an indication that "command" and "token" in the diction of the TeXbook do in the upshot not refer to the same stage of work.
It seems, in the TeXbook the term "command" is used when the focus is on that stage where the programmer produces/delivers/types whatsoever .tex-input.
It seems, in the TeXbook the terms "token" and "macro" are used when the focus is on that stage where the TeX-program does process the .tex-input.
The term "macro" seems to refer to the concept of TeX's tokens while tokens and thus macro-tokens do not come into being while writing/producing/delivering TeX-input but while having the TeX-program read and tokenize TeX-input and from the resulting tokens expand the expandable ones.
As far as I know, macro-tokens are ⟨control sequences⟩ that both are expandable and do not have the meaning of one of those about 300 low-level atomic operations that are called "primitives".
What kinds of tokens can form such ⟨control sequences⟩?
⟨control sequences⟩ come in two flavors:
⟨control sequence tokens⟩
These divide into ⟨control symbol tokens⟩ and ⟨control word tokens⟩
⟨active character tokens⟩
With this remark we reach the question of how to divide up ⟨control sequences⟩.
As just seen, one way of dividing up ⟨control sequences⟩ is by looking at the question what kinds of tokens can form ⟨control sequences⟩:
⟨control sequence tokens⟩ are those lovely "thingies" that in TeX input need to be preceded by a character whose category code is 0 (escape).
Usually the backslash-character is the only character whose category code is 0.
⟨control sequence tokens⟩ come in two flavors:
⟨control symbol tokens⟩ have names which consist of a single character which does not have category code 11 (letter).
E.g., usually .
and \
are ⟨control symbol tokens⟩ as usually the characters .
and do not have category code 11 (letter).
a
usually is not a ⟨control symbol token⟩ as usually the character a
does have the category code 11 (letter).
⟨control word tokens⟩ have names which consist either of several characters which do not need to have the same category code, or of a single character which does have the category code 11 (letter).
Any ⟨control sequence⟩ (i.e., both ⟨control sequence tokens⟩ and ⟨active character tokens⟩, see below) can come into being
- when (La)TeX reads and tokenizes input.
- when (La)TeX inserts it during expansion; e.g., as part of the ⟨replacement text⟩ of a macro; e.g., as part of the
the
-expansion of a token-register.
A ⟨control sequence token⟩ can come into being also
- as the result of expanding some
csname..endcsname
-expression.
Any character whose current category code is not 11 (letter), even a character of category code 15 (invalid), can form the name of a ⟨control symbol token⟩ that came into being as a result of reading and tokenizing TeX-input.
(I say "current" because as soon as the character that forms the name of the ⟨control sequence token⟩ in question is switched to 11 (letter), the ⟨control sequence token⟩ in question will not be treated as a ⟨control symbol token⟩ (any more) but as a ⟨control word token⟩.)
Otherwise you could, e.g., not use ⟨control symbol tokens⟩ for changing the category codes of characters that currently have category code 15 (invalid).
But you can do, e.g.,:
catcode`A=15 % Now uppercase-a is invalid.
% Typing an "A" both outside a comment and not behind a character
% of category code 5 (end of line) which is not
% intended to form the name of a control-symbol-token would yield
% an error-message now.
% But you can do:
catcode`A=11 % Now uppercase-a is a letter again.
After elaborating on the names of ⟨control symbol tokens⟩, let's elaborate on the names of ⟨control word tokens⟩:
On the one hand, TeX's reading- and tokenizing-apparatus will only take sequences of characters of category code 11 (letter) as names of ⟨control word tokens⟩.
Thus names of ⟨control word tokens⟩ created via having TeX read and tokenize them directly from TeX-input consist only of characters that had category code 11 (letter) at the time of reading and tokenizeng them.
On the other hand names of ⟨control word tokens⟩ created via evaluating csname..endcsname
-expressions can consist either of more than one character of whatsoever category code differing from 5, 9, 14 and 15 at the time of reading and tokenizing the csname..endcsname
-expression , or of a single character of category code 11 (letter).
A character of category code 10 (space)—usually the space character (ASCII 32) is the only character with that category code — that in the TeX-input occurs behind a character-sequence that got tokenized as ⟨control symbol token⟩ will not be ignored but will be tokenized as an explicit space token.
The only exception to that rule is with such characters that follow the control-space, , i.e., that ⟨control symbol token⟩ whose name is formed by the space character. These will be ignored, too.
A character of category code 10 (space) that in the TeX-input occurs behind a character-sequence that got tokenized as ⟨control word token⟩ will be ignored/will not yield any token.
When (La)TeX does unexpanded-write a ⟨control word token⟩ (be it writing to file, be it writing to the screen), it automatically inserts a space character at the end of the character sequence that represents the ⟨control word token⟩ in question.
When (La)TeX does unexpanded-write a ⟨control symbol token⟩ it will not automatically insert such a trailing space character.
When applying string
for transforming the following token into a sequence of ⟨character tokens⟩ (either character tokens of category code 12 (other) and/or character tokens of category code 10 (space) and character code 32—32 encodes the space-character in ASCII—so called explicit space tokens—(La)TeX will not attach an additional trailing space token.
You can change the way in which TeX does unexpanded-write a ⟨control sequence token⟩ whose name consists of a single character by before writing changing the category code of that character either to 11 (letter)—in this case it will be written as a ⟨control word token⟩ with trailing space— or to a value differing from 11—in this case it will be written as a ⟨control symbol token⟩ without trailing space.
Thus you can have fun with TeX's automatic insertion of spaces when doing delayed writes with unexpanded ⟨control sequence tokens⟩ whose names consist of a single character—the code
documentclassarticle
begindocument
newwritemywrite
immediateopenoutmywrite test.txtrelax
catcode`A=12
writemywritebnoexpandAb
immediatewritemywritebnoexpandAb
catcode`A=11
Hello
enddocument
besides the .dvi- or .pdf-file yields a file test.txt with the following content:
bAb
bA b
As both write
-commands were issued while A
was of category code 12 (other), you might have expected each A
to be unexpanded-written as ⟨control symbol token⟩ without a trailing space, and thus you might have expected the output
bAb
bAb
. But the first write
was not immediate
and thus it got carried out not immediately but at the time of shipping out the page. At that time, the second write
, which was immediate
, was already done and A
had category code 11 (letter) again and thus the A
in the first write
was written within the second line as a ⟨control word token⟩, with LaTeX attaching a trailing space character.
The nameless ⟨control sequence token⟩—methods for forming it are: 1) expanding csnameendcsname
; 2) placing a backslash (a character of category code 0 (escape)) at the end of a line while the parameter endlinechar
has a non-positive value or a value outside the range of code-points of the TeX-engine's input-encoding—deserves special attention:
Applying string
to it always yields the catcode 12(other) character token sequence csnameendcsname
. There is no space token between csname
and endcsname
. There also is no trailing space token.
Unexpanded-writing it yields the character sequence csnameendcsname␣
—there is a trailing space character.
Besides the ⟨control sequence tokens⟩ there is another kind of ⟨control sequences⟩: ⟨active character tokens⟩.
An ⟨active character token⟩ is a ⟨character token⟩ whose category code is 13 (active).
An ⟨active character token⟩ can be used like a ⟨control sequence token⟩.
E.g., after catcode`b=13
you can do def bThe active character string b is a macro now.
Another way of dividing up ⟨control sequences⟩ is by looking at their "decomposability":
The TeXBook says:
About 300 of TeX's control sequences are called primitive; these are
the low-level atomic operations that are not decomposable into simpler
functions. All other control sequences are defined, ultimately, in
terms of the primitive ones. For example, input is a primitive
operation, but ’ and " are not; the latter are defined in terms of
an accent primitive.
(It should not be overlooked that when defining ⟨control sequences⟩ that are macros, ⟨non-active character tokens⟩ can play a rôle also.)
Yet an another way of dividing up ⟨control sequences⟩ is by looking at the actions triggered by them and at the point in time when these actions are triggered:
There is the analogy between TeX's input-processing and digestive processes:
- TeX input is divided up into so-called tokens (⟨control sequence tokens⟩ and ⟨character tokens⟩) in TeX's mouth.
- Expansion of expandable ⟨control sequences⟩ takes place in the gullet.
Unexpandable ⟨control sequences⟩ will be handled in the stomach.
So you distinguish expandable ⟨control sequences⟩ and unexpandable ⟨control sequences⟩.
Expandable ⟨control sequences⟩ (and their arguments) will in the gullet be replaced by other constellations of tokens. Replacement of expandable ⟨control sequences⟩ takes place in the gullet until there are no more expandable ⟨control sequences⟩ left.
Unexpandable ⟨control sequences⟩ (which came into being either during tokenization in TeX's mouth or via replacement of expandable tokens during expansion) pass the gullet and reach the stomach where boxes will be built, assignment-primitives will be carried out, etc.
There are primitive ⟨control sequences⟩ that are expandable:
E.g., the string
-primitive is an expandable primitive. Together with the following token it vanishes in TeX's gullet and as replacement you get a set of ⟨character tokens⟩ of category code 12 (other) or 10 (space) that represents the name of the ⟨control sequence token⟩ in question/is the catcode-12-pendant of the ⟨character token⟩ in question. (When an explicit space token trails string
, string
will deliver an explicit space token.)
E.g., the csname
-primitive is an expandable primitive. It triggers the processing of a csname..endcsname
-expression in TeX's gullet and it vanishes in TeX's gullet and as replacement you get the corresponding ⟨control symbol token⟩ or ⟨control word token⟩. In case that token is undefined, it will get the meaning of the relax
-primitive within the current scope.
E.g., the romannumeral
-primitive is an expandable primitive. It and a following ⟨number⟩ vanish in TeX's gullet and in case the ⟨number⟩ is positive as replacement you get a sequence of ⟨character tokens⟩ of category code 12 (other) that represent the corresponding ⟨number⟩ in lowercase-roman notation.
There are primitive ⟨control sequences⟩ that are not expandable:
E.g., the relax
-primitive is not expandable. It does pass the gullet and reach the stomach.
E.g., the assignment-primitives def
, edef
, gdef
, xdef
, countdef
etc are not expandable. They do not trigger their replacement in TeX's gullet but will reach the stomach.
There are non-primitive ⟨control sequences⟩ that are expandable:
Macros are non-primitive ⟨control sequences⟩ that are expandable. They and their arguments get replaced with their ⟨replacement text⟩ in TeX's gullet. Macros are defined in TeX's stomach in terms of the primitives def
, edef
, gdef
and xdef
. Be aware that LaTeX's newcommand
also is just a macro which "decomposes" in LaTeX's gullet to some set of tokens containing the def
-primitive.
There are non-primitive ⟨control sequences⟩ that are not expandable:
You can, e.g., do things like letbgroup={
.
bgroup
is a ⟨control sequence⟩. More specific: It is a ⟨control sequence token⟩. It is a ⟨control word token⟩. It is not expandable but it also does not have the meaning of one of the about 300 non-decomposable functions of TeX that are called primitives. bgroup
is an implicit character token. ;-)
Other kinds of non-primitive ⟨control sequences⟩ that are not expandable are, e.g., chardef
-tokens, countdef
-tokens, toksdef
-tokens, ...
In order to get used to the terminology, let's unravel some cases:
After carrying out the assignments
catcode`W=13 %
catcode`X=13 %
catcode`Y=13 %
catcode`Z=13 %
newcommand WThe active character stringW is a macro.
newcommandsomethingThis is a macro.
newcommand+This is a macro, too.
let X=a
let Y=relax
let Z=romannumeral
,
W
is a ⟨control sequence⟩: It is an ⟨active character token⟩. It is a macro and thus both expandable and not a primitive.X
is a ⟨control sequence⟩: It is an
⟨active character token⟩. It is not expandable. Thus it cannot be a macro. It also is not an unexpandable primitive. It has the same meaning as the catcode 11(letter) ⟨character token⟩a
. It is an implicit character token.Y
is a ⟨control sequence⟩: It is an ⟨active character token⟩. Its meaning equals the meaning of the unexpandablerelax
-primitive. Thus it is an unexpandable primitive.Z
is a ⟨control sequence⟩: It is an ⟨active character token⟩. Its meaning equals the meaning of the expandableromannumeral
-primitive. Thus it is an expandable primitive.something
is a ⟨control sequence⟩: It is a ⟨control sequence token⟩. It is a ⟨control word token⟩. It is expandable. Its meaning does not equal one of the about 300 non-decomposable functions of TeX that are called primitives. It is a macro.+
is a ⟨control sequence⟩: It is a ⟨control sequence token⟩. It is a ⟨control symbol token⟩. It is expandable. Its meaning does not equal one of the about 300 non-decomposable functions of TeX that are called primitives. It is a macro.
The term "macro" does always denote a non-primitive expandable ⟨control sequence⟩.
Thus when you know that a token is a macro, you can conclude that it cannot be a
⟨non-active character token⟩ because ⟨non-active character tokens⟩ cannot be ⟨control sequences⟩.
You can conclude that it is a ⟨control sequence⟩.
But you cannot conclude whether the ⟨control sequence⟩ is a ⟨control word token⟩ or a ⟨control symbol token⟩ or an
⟨active character token⟩.
Not all expandable ⟨control sequences⟩ are macros.
There are expandable ⟨primitives⟩ as well.
Not all non-expandable ⟨control sequences⟩ are ⟨primitives⟩.
There are non-expandable non-primitives as well:
E.g., implicit character tokens, chardef
-tokens, countdef
-tokens, toksdef
-tokens, ...
1
Sorry for asking, but just curiosity: are you writing everything from your mind?
– Sigur
Jan 5 at 0:33
I just observed you were editing a lot.
– Sigur
Jan 5 at 0:51
1
@Sigur I just thought about it, looked up the quotes and spontaneously typed into the input-field of the TeX-LaTeX-StackExchange-Website shown in my internet-browser. If I had really planned this answer systematically, I would have typed within my favorite editor-software, saved the text as a file and revised it a few times before pasting it from the editor-software to the input-field of the TeX-LaTeX-StackExchange-Website. But I didn't take it all too serious. I just wanted to have fun...;-)
– Ulrich Diez
Jan 5 at 1:10
2
Ulrich, Since TeX doesn't end in an orthographic 's', it's possessive form should be "TeX's" not just "TeX'".
– Alan Munn
Jan 5 at 17:04
2
@AlanMunn These comments might prevent others from making the same mistakes... ;-)
– Ulrich Diez
Jan 5 at 17:41
|
show 3 more comments
I seldom use the term "command" when it is about (La)TeX.
In the TeXbook the term "command" does also appear in contexts where not single tokens but the coming into being of sequences of control word tokens plays a rôle.
E.g., chapter 15 of the TeXbook includes a double dangerous bend paragraph where you can read:
[...] Therefore plain TeX provides an allocation function for registers; Appendix B includes the command
newinsertfootins
which definesfootins
as the number for footnote insertions. [...]
So, e.g., newinsertfootins
is a "command".
(By the way: The mentioned appendix B does actually not exhibit tokens. It exhibits TeX code, i.e., TeX-input which the programmer of the plain TeX format did produce. That TeX-input itself is not tokens. TeX will create tokens when processing that code.)
In the TeXbook you also find the phrase "primitive command". It is used in contexts where very short and easily to grasp sequences of TeX-input play a rôle. (Sequences that, when being read and tokenized, yield primitives...)
I suppose, a "command" according to the diction of the TeXbook is a sequence of characters which forms some portion of TeX-input. A portion of TeX-input which, when TeX reads and tokenizes that sequence of characters, yields a set of whatsoever tokens and/or (e.g., in case the command in question contains "invalid characters"/characters of category code 15) some error-messages. In case of yielding a set of tokens, that set of tokens is intended to form a syntactical entity which TeX's token-processing-apparatus can attempt to carry out/to process in whatsoever way. Such an attempt in turn may also yield error-messages and/or erroneous behavior.
Thus the term "command" in the diction of the TeXbook itself seems to actually not focus on things as results of TeX's "digestive processes". One of the earlier results of TeX's digestive processes are tokens. Tokens are produced in TeX's mouth. Thus the term "command" itself seems to actually not refer to tokens. It seems to refer to subsets of the things which TeX gets to "eat"—it refers to portions of the TeX-input, i.e., to the things which in interactive mode the user/programmer can type, or which TeX can read from .tex-input-files or read
-handles for tokenization and further processing.
In the LaTeX 2e-kernel you have things like newcommand
and renewcommand
.
These things during LaTeX's digestive processes cause LaTeX to perform assignments. But the point of view seems not to be the view of the LaTeX-program which "eats" LaTeX-input and while "digesting" produces "tokens" etc. The point of view seems to be the view of the programmer. The programmer does not produce tokens. She/he produces LaTeX-input. That LaTeX-input is not tokens yet. It is a sequence of input-characters.
After writing newcommand...
, the programmer in her/his code can take the thing introduced via newcommand
for a small portion of LaTeX-input which when tokenized by LaTeX yields a token which itself is considered a syntactical entity which, when used as intended, can be carried out via LaTeX's token-processing-apparatus without yielding erroneous behavior.
I could not find the phrase "command token" in texbook.tex.
This can also be taken for an indication that "command" and "token" in the diction of the TeXbook do in the upshot not refer to the same stage of work.
It seems, in the TeXbook the term "command" is used when the focus is on that stage where the programmer produces/delivers/types whatsoever .tex-input.
It seems, in the TeXbook the terms "token" and "macro" are used when the focus is on that stage where the TeX-program does process the .tex-input.
The term "macro" seems to refer to the concept of TeX's tokens while tokens and thus macro-tokens do not come into being while writing/producing/delivering TeX-input but while having the TeX-program read and tokenize TeX-input and from the resulting tokens expand the expandable ones.
As far as I know, macro-tokens are ⟨control sequences⟩ that both are expandable and do not have the meaning of one of those about 300 low-level atomic operations that are called "primitives".
What kinds of tokens can form such ⟨control sequences⟩?
⟨control sequences⟩ come in two flavors:
⟨control sequence tokens⟩
These divide into ⟨control symbol tokens⟩ and ⟨control word tokens⟩
⟨active character tokens⟩
With this remark we reach the question of how to divide up ⟨control sequences⟩.
As just seen, one way of dividing up ⟨control sequences⟩ is by looking at the question what kinds of tokens can form ⟨control sequences⟩:
⟨control sequence tokens⟩ are those lovely "thingies" that in TeX input need to be preceded by a character whose category code is 0 (escape).
Usually the backslash-character is the only character whose category code is 0.
⟨control sequence tokens⟩ come in two flavors:
⟨control symbol tokens⟩ have names which consist of a single character which does not have category code 11 (letter).
E.g., usually .
and \
are ⟨control symbol tokens⟩ as usually the characters .
and do not have category code 11 (letter).
a
usually is not a ⟨control symbol token⟩ as usually the character a
does have the category code 11 (letter).
⟨control word tokens⟩ have names which consist either of several characters which do not need to have the same category code, or of a single character which does have the category code 11 (letter).
Any ⟨control sequence⟩ (i.e., both ⟨control sequence tokens⟩ and ⟨active character tokens⟩, see below) can come into being
- when (La)TeX reads and tokenizes input.
- when (La)TeX inserts it during expansion; e.g., as part of the ⟨replacement text⟩ of a macro; e.g., as part of the
the
-expansion of a token-register.
A ⟨control sequence token⟩ can come into being also
- as the result of expanding some
csname..endcsname
-expression.
Any character whose current category code is not 11 (letter), even a character of category code 15 (invalid), can form the name of a ⟨control symbol token⟩ that came into being as a result of reading and tokenizing TeX-input.
(I say "current" because as soon as the character that forms the name of the ⟨control sequence token⟩ in question is switched to 11 (letter), the ⟨control sequence token⟩ in question will not be treated as a ⟨control symbol token⟩ (any more) but as a ⟨control word token⟩.)
Otherwise you could, e.g., not use ⟨control symbol tokens⟩ for changing the category codes of characters that currently have category code 15 (invalid).
But you can do, e.g.,:
catcode`A=15 % Now uppercase-a is invalid.
% Typing an "A" both outside a comment and not behind a character
% of category code 5 (end of line) which is not
% intended to form the name of a control-symbol-token would yield
% an error-message now.
% But you can do:
catcode`A=11 % Now uppercase-a is a letter again.
After elaborating on the names of ⟨control symbol tokens⟩, let's elaborate on the names of ⟨control word tokens⟩:
On the one hand, TeX's reading- and tokenizing-apparatus will only take sequences of characters of category code 11 (letter) as names of ⟨control word tokens⟩.
Thus names of ⟨control word tokens⟩ created via having TeX read and tokenize them directly from TeX-input consist only of characters that had category code 11 (letter) at the time of reading and tokenizeng them.
On the other hand names of ⟨control word tokens⟩ created via evaluating csname..endcsname
-expressions can consist either of more than one character of whatsoever category code differing from 5, 9, 14 and 15 at the time of reading and tokenizing the csname..endcsname
-expression , or of a single character of category code 11 (letter).
A character of category code 10 (space)—usually the space character (ASCII 32) is the only character with that category code — that in the TeX-input occurs behind a character-sequence that got tokenized as ⟨control symbol token⟩ will not be ignored but will be tokenized as an explicit space token.
The only exception to that rule is with such characters that follow the control-space, , i.e., that ⟨control symbol token⟩ whose name is formed by the space character. These will be ignored, too.
A character of category code 10 (space) that in the TeX-input occurs behind a character-sequence that got tokenized as ⟨control word token⟩ will be ignored/will not yield any token.
When (La)TeX does unexpanded-write a ⟨control word token⟩ (be it writing to file, be it writing to the screen), it automatically inserts a space character at the end of the character sequence that represents the ⟨control word token⟩ in question.
When (La)TeX does unexpanded-write a ⟨control symbol token⟩ it will not automatically insert such a trailing space character.
When applying string
for transforming the following token into a sequence of ⟨character tokens⟩ (either character tokens of category code 12 (other) and/or character tokens of category code 10 (space) and character code 32—32 encodes the space-character in ASCII—so called explicit space tokens—(La)TeX will not attach an additional trailing space token.
You can change the way in which TeX does unexpanded-write a ⟨control sequence token⟩ whose name consists of a single character by before writing changing the category code of that character either to 11 (letter)—in this case it will be written as a ⟨control word token⟩ with trailing space— or to a value differing from 11—in this case it will be written as a ⟨control symbol token⟩ without trailing space.
Thus you can have fun with TeX's automatic insertion of spaces when doing delayed writes with unexpanded ⟨control sequence tokens⟩ whose names consist of a single character—the code
documentclassarticle
begindocument
newwritemywrite
immediateopenoutmywrite test.txtrelax
catcode`A=12
writemywritebnoexpandAb
immediatewritemywritebnoexpandAb
catcode`A=11
Hello
enddocument
besides the .dvi- or .pdf-file yields a file test.txt with the following content:
bAb
bA b
As both write
-commands were issued while A
was of category code 12 (other), you might have expected each A
to be unexpanded-written as ⟨control symbol token⟩ without a trailing space, and thus you might have expected the output
bAb
bAb
. But the first write
was not immediate
and thus it got carried out not immediately but at the time of shipping out the page. At that time, the second write
, which was immediate
, was already done and A
had category code 11 (letter) again and thus the A
in the first write
was written within the second line as a ⟨control word token⟩, with LaTeX attaching a trailing space character.
The nameless ⟨control sequence token⟩—methods for forming it are: 1) expanding csnameendcsname
; 2) placing a backslash (a character of category code 0 (escape)) at the end of a line while the parameter endlinechar
has a non-positive value or a value outside the range of code-points of the TeX-engine's input-encoding—deserves special attention:
Applying string
to it always yields the catcode 12(other) character token sequence csnameendcsname
. There is no space token between csname
and endcsname
. There also is no trailing space token.
Unexpanded-writing it yields the character sequence csnameendcsname␣
—there is a trailing space character.
Besides the ⟨control sequence tokens⟩ there is another kind of ⟨control sequences⟩: ⟨active character tokens⟩.
An ⟨active character token⟩ is a ⟨character token⟩ whose category code is 13 (active).
An ⟨active character token⟩ can be used like a ⟨control sequence token⟩.
E.g., after catcode`b=13
you can do def bThe active character string b is a macro now.
Another way of dividing up ⟨control sequences⟩ is by looking at their "decomposability":
The TeXBook says:
About 300 of TeX's control sequences are called primitive; these are
the low-level atomic operations that are not decomposable into simpler
functions. All other control sequences are defined, ultimately, in
terms of the primitive ones. For example, input is a primitive
operation, but ’ and " are not; the latter are defined in terms of
an accent primitive.
(It should not be overlooked that when defining ⟨control sequences⟩ that are macros, ⟨non-active character tokens⟩ can play a rôle also.)
Yet an another way of dividing up ⟨control sequences⟩ is by looking at the actions triggered by them and at the point in time when these actions are triggered:
There is the analogy between TeX's input-processing and digestive processes:
- TeX input is divided up into so-called tokens (⟨control sequence tokens⟩ and ⟨character tokens⟩) in TeX's mouth.
- Expansion of expandable ⟨control sequences⟩ takes place in the gullet.
Unexpandable ⟨control sequences⟩ will be handled in the stomach.
So you distinguish expandable ⟨control sequences⟩ and unexpandable ⟨control sequences⟩.
Expandable ⟨control sequences⟩ (and their arguments) will in the gullet be replaced by other constellations of tokens. Replacement of expandable ⟨control sequences⟩ takes place in the gullet until there are no more expandable ⟨control sequences⟩ left.
Unexpandable ⟨control sequences⟩ (which came into being either during tokenization in TeX's mouth or via replacement of expandable tokens during expansion) pass the gullet and reach the stomach where boxes will be built, assignment-primitives will be carried out, etc.
There are primitive ⟨control sequences⟩ that are expandable:
E.g., the string
-primitive is an expandable primitive. Together with the following token it vanishes in TeX's gullet and as replacement you get a set of ⟨character tokens⟩ of category code 12 (other) or 10 (space) that represents the name of the ⟨control sequence token⟩ in question/is the catcode-12-pendant of the ⟨character token⟩ in question. (When an explicit space token trails string
, string
will deliver an explicit space token.)
E.g., the csname
-primitive is an expandable primitive. It triggers the processing of a csname..endcsname
-expression in TeX's gullet and it vanishes in TeX's gullet and as replacement you get the corresponding ⟨control symbol token⟩ or ⟨control word token⟩. In case that token is undefined, it will get the meaning of the relax
-primitive within the current scope.
E.g., the romannumeral
-primitive is an expandable primitive. It and a following ⟨number⟩ vanish in TeX's gullet and in case the ⟨number⟩ is positive as replacement you get a sequence of ⟨character tokens⟩ of category code 12 (other) that represent the corresponding ⟨number⟩ in lowercase-roman notation.
There are primitive ⟨control sequences⟩ that are not expandable:
E.g., the relax
-primitive is not expandable. It does pass the gullet and reach the stomach.
E.g., the assignment-primitives def
, edef
, gdef
, xdef
, countdef
etc are not expandable. They do not trigger their replacement in TeX's gullet but will reach the stomach.
There are non-primitive ⟨control sequences⟩ that are expandable:
Macros are non-primitive ⟨control sequences⟩ that are expandable. They and their arguments get replaced with their ⟨replacement text⟩ in TeX's gullet. Macros are defined in TeX's stomach in terms of the primitives def
, edef
, gdef
and xdef
. Be aware that LaTeX's newcommand
also is just a macro which "decomposes" in LaTeX's gullet to some set of tokens containing the def
-primitive.
There are non-primitive ⟨control sequences⟩ that are not expandable:
You can, e.g., do things like letbgroup={
.
bgroup
is a ⟨control sequence⟩. More specific: It is a ⟨control sequence token⟩. It is a ⟨control word token⟩. It is not expandable but it also does not have the meaning of one of the about 300 non-decomposable functions of TeX that are called primitives. bgroup
is an implicit character token. ;-)
Other kinds of non-primitive ⟨control sequences⟩ that are not expandable are, e.g., chardef
-tokens, countdef
-tokens, toksdef
-tokens, ...
In order to get used to the terminology, let's unravel some cases:
After carrying out the assignments
catcode`W=13 %
catcode`X=13 %
catcode`Y=13 %
catcode`Z=13 %
newcommand WThe active character stringW is a macro.
newcommandsomethingThis is a macro.
newcommand+This is a macro, too.
let X=a
let Y=relax
let Z=romannumeral
,
W
is a ⟨control sequence⟩: It is an ⟨active character token⟩. It is a macro and thus both expandable and not a primitive.X
is a ⟨control sequence⟩: It is an
⟨active character token⟩. It is not expandable. Thus it cannot be a macro. It also is not an unexpandable primitive. It has the same meaning as the catcode 11(letter) ⟨character token⟩a
. It is an implicit character token.Y
is a ⟨control sequence⟩: It is an ⟨active character token⟩. Its meaning equals the meaning of the unexpandablerelax
-primitive. Thus it is an unexpandable primitive.Z
is a ⟨control sequence⟩: It is an ⟨active character token⟩. Its meaning equals the meaning of the expandableromannumeral
-primitive. Thus it is an expandable primitive.something
is a ⟨control sequence⟩: It is a ⟨control sequence token⟩. It is a ⟨control word token⟩. It is expandable. Its meaning does not equal one of the about 300 non-decomposable functions of TeX that are called primitives. It is a macro.+
is a ⟨control sequence⟩: It is a ⟨control sequence token⟩. It is a ⟨control symbol token⟩. It is expandable. Its meaning does not equal one of the about 300 non-decomposable functions of TeX that are called primitives. It is a macro.
The term "macro" does always denote a non-primitive expandable ⟨control sequence⟩.
Thus when you know that a token is a macro, you can conclude that it cannot be a
⟨non-active character token⟩ because ⟨non-active character tokens⟩ cannot be ⟨control sequences⟩.
You can conclude that it is a ⟨control sequence⟩.
But you cannot conclude whether the ⟨control sequence⟩ is a ⟨control word token⟩ or a ⟨control symbol token⟩ or an
⟨active character token⟩.
Not all expandable ⟨control sequences⟩ are macros.
There are expandable ⟨primitives⟩ as well.
Not all non-expandable ⟨control sequences⟩ are ⟨primitives⟩.
There are non-expandable non-primitives as well:
E.g., implicit character tokens, chardef
-tokens, countdef
-tokens, toksdef
-tokens, ...
I seldom use the term "command" when it is about (La)TeX.
In the TeXbook the term "command" does also appear in contexts where not single tokens but the coming into being of sequences of control word tokens plays a rôle.
E.g., chapter 15 of the TeXbook includes a double dangerous bend paragraph where you can read:
[...] Therefore plain TeX provides an allocation function for registers; Appendix B includes the command
newinsertfootins
which definesfootins
as the number for footnote insertions. [...]
So, e.g., newinsertfootins
is a "command".
(By the way: The mentioned appendix B does actually not exhibit tokens. It exhibits TeX code, i.e., TeX-input which the programmer of the plain TeX format did produce. That TeX-input itself is not tokens. TeX will create tokens when processing that code.)
In the TeXbook you also find the phrase "primitive command". It is used in contexts where very short and easily to grasp sequences of TeX-input play a rôle. (Sequences that, when being read and tokenized, yield primitives...)
I suppose, a "command" according to the diction of the TeXbook is a sequence of characters which forms some portion of TeX-input. A portion of TeX-input which, when TeX reads and tokenizes that sequence of characters, yields a set of whatsoever tokens and/or (e.g., in case the command in question contains "invalid characters"/characters of category code 15) some error-messages. In case of yielding a set of tokens, that set of tokens is intended to form a syntactical entity which TeX's token-processing-apparatus can attempt to carry out/to process in whatsoever way. Such an attempt in turn may also yield error-messages and/or erroneous behavior.
Thus the term "command" in the diction of the TeXbook itself seems to actually not focus on things as results of TeX's "digestive processes". One of the earlier results of TeX's digestive processes are tokens. Tokens are produced in TeX's mouth. Thus the term "command" itself seems to actually not refer to tokens. It seems to refer to subsets of the things which TeX gets to "eat"—it refers to portions of the TeX-input, i.e., to the things which in interactive mode the user/programmer can type, or which TeX can read from .tex-input-files or read
-handles for tokenization and further processing.
In the LaTeX 2e-kernel you have things like newcommand
and renewcommand
.
These things during LaTeX's digestive processes cause LaTeX to perform assignments. But the point of view seems not to be the view of the LaTeX-program which "eats" LaTeX-input and while "digesting" produces "tokens" etc. The point of view seems to be the view of the programmer. The programmer does not produce tokens. She/he produces LaTeX-input. That LaTeX-input is not tokens yet. It is a sequence of input-characters.
After writing newcommand...
, the programmer in her/his code can take the thing introduced via newcommand
for a small portion of LaTeX-input which when tokenized by LaTeX yields a token which itself is considered a syntactical entity which, when used as intended, can be carried out via LaTeX's token-processing-apparatus without yielding erroneous behavior.
I could not find the phrase "command token" in texbook.tex.
This can also be taken for an indication that "command" and "token" in the diction of the TeXbook do in the upshot not refer to the same stage of work.
It seems, in the TeXbook the term "command" is used when the focus is on that stage where the programmer produces/delivers/types whatsoever .tex-input.
It seems, in the TeXbook the terms "token" and "macro" are used when the focus is on that stage where the TeX-program does process the .tex-input.
The term "macro" seems to refer to the concept of TeX's tokens while tokens and thus macro-tokens do not come into being while writing/producing/delivering TeX-input but while having the TeX-program read and tokenize TeX-input and from the resulting tokens expand the expandable ones.
As far as I know, macro-tokens are ⟨control sequences⟩ that both are expandable and do not have the meaning of one of those about 300 low-level atomic operations that are called "primitives".
What kinds of tokens can form such ⟨control sequences⟩?
⟨control sequences⟩ come in two flavors:
⟨control sequence tokens⟩
These divide into ⟨control symbol tokens⟩ and ⟨control word tokens⟩
⟨active character tokens⟩
With this remark we reach the question of how to divide up ⟨control sequences⟩.
As just seen, one way of dividing up ⟨control sequences⟩ is by looking at the question what kinds of tokens can form ⟨control sequences⟩:
⟨control sequence tokens⟩ are those lovely "thingies" that in TeX input need to be preceded by a character whose category code is 0 (escape).
Usually the backslash-character is the only character whose category code is 0.
⟨control sequence tokens⟩ come in two flavors:
⟨control symbol tokens⟩ have names which consist of a single character which does not have category code 11 (letter).
E.g., usually .
and \
are ⟨control symbol tokens⟩ as usually the characters .
and do not have category code 11 (letter).
a
usually is not a ⟨control symbol token⟩ as usually the character a
does have the category code 11 (letter).
⟨control word tokens⟩ have names which consist either of several characters which do not need to have the same category code, or of a single character which does have the category code 11 (letter).
Any ⟨control sequence⟩ (i.e., both ⟨control sequence tokens⟩ and ⟨active character tokens⟩, see below) can come into being
- when (La)TeX reads and tokenizes input.
- when (La)TeX inserts it during expansion; e.g., as part of the ⟨replacement text⟩ of a macro; e.g., as part of the
the
-expansion of a token-register.
A ⟨control sequence token⟩ can come into being also
- as the result of expanding some
csname..endcsname
-expression.
Any character whose current category code is not 11 (letter), even a character of category code 15 (invalid), can form the name of a ⟨control symbol token⟩ that came into being as a result of reading and tokenizing TeX-input.
(I say "current" because as soon as the character that forms the name of the ⟨control sequence token⟩ in question is switched to 11 (letter), the ⟨control sequence token⟩ in question will not be treated as a ⟨control symbol token⟩ (any more) but as a ⟨control word token⟩.)
Otherwise you could, e.g., not use ⟨control symbol tokens⟩ for changing the category codes of characters that currently have category code 15 (invalid).
But you can do, e.g.,:
catcode`A=15 % Now uppercase-a is invalid.
% Typing an "A" both outside a comment and not behind a character
% of category code 5 (end of line) which is not
% intended to form the name of a control-symbol-token would yield
% an error-message now.
% But you can do:
catcode`A=11 % Now uppercase-a is a letter again.
After elaborating on the names of ⟨control symbol tokens⟩, let's elaborate on the names of ⟨control word tokens⟩:
On the one hand, TeX's reading- and tokenizing-apparatus will only take sequences of characters of category code 11 (letter) as names of ⟨control word tokens⟩.
Thus names of ⟨control word tokens⟩ created via having TeX read and tokenize them directly from TeX-input consist only of characters that had category code 11 (letter) at the time of reading and tokenizeng them.
On the other hand names of ⟨control word tokens⟩ created via evaluating csname..endcsname
-expressions can consist either of more than one character of whatsoever category code differing from 5, 9, 14 and 15 at the time of reading and tokenizing the csname..endcsname
-expression , or of a single character of category code 11 (letter).
A character of category code 10 (space)—usually the space character (ASCII 32) is the only character with that category code — that in the TeX-input occurs behind a character-sequence that got tokenized as ⟨control symbol token⟩ will not be ignored but will be tokenized as an explicit space token.
The only exception to that rule is with such characters that follow the control-space, , i.e., that ⟨control symbol token⟩ whose name is formed by the space character. These will be ignored, too.
A character of category code 10 (space) that in the TeX-input occurs behind a character-sequence that got tokenized as ⟨control word token⟩ will be ignored/will not yield any token.
When (La)TeX does unexpanded-write a ⟨control word token⟩ (be it writing to file, be it writing to the screen), it automatically inserts a space character at the end of the character sequence that represents the ⟨control word token⟩ in question.
When (La)TeX does unexpanded-write a ⟨control symbol token⟩ it will not automatically insert such a trailing space character.
When applying string
for transforming the following token into a sequence of ⟨character tokens⟩ (either character tokens of category code 12 (other) and/or character tokens of category code 10 (space) and character code 32—32 encodes the space-character in ASCII—so called explicit space tokens—(La)TeX will not attach an additional trailing space token.
You can change the way in which TeX does unexpanded-write a ⟨control sequence token⟩ whose name consists of a single character by before writing changing the category code of that character either to 11 (letter)—in this case it will be written as a ⟨control word token⟩ with trailing space— or to a value differing from 11—in this case it will be written as a ⟨control symbol token⟩ without trailing space.
Thus you can have fun with TeX's automatic insertion of spaces when doing delayed writes with unexpanded ⟨control sequence tokens⟩ whose names consist of a single character—the code
documentclassarticle
begindocument
newwritemywrite
immediateopenoutmywrite test.txtrelax
catcode`A=12
writemywritebnoexpandAb
immediatewritemywritebnoexpandAb
catcode`A=11
Hello
enddocument
besides the .dvi- or .pdf-file yields a file test.txt with the following content:
bAb
bA b
As both write
-commands were issued while A
was of category code 12 (other), you might have expected each A
to be unexpanded-written as ⟨control symbol token⟩ without a trailing space, and thus you might have expected the output
bAb
bAb
. But the first write
was not immediate
and thus it got carried out not immediately but at the time of shipping out the page. At that time, the second write
, which was immediate
, was already done and A
had category code 11 (letter) again and thus the A
in the first write
was written within the second line as a ⟨control word token⟩, with LaTeX attaching a trailing space character.
The nameless ⟨control sequence token⟩—methods for forming it are: 1) expanding csnameendcsname
; 2) placing a backslash (a character of category code 0 (escape)) at the end of a line while the parameter endlinechar
has a non-positive value or a value outside the range of code-points of the TeX-engine's input-encoding—deserves special attention:
Applying string
to it always yields the catcode 12(other) character token sequence csnameendcsname
. There is no space token between csname
and endcsname
. There also is no trailing space token.
Unexpanded-writing it yields the character sequence csnameendcsname␣
—there is a trailing space character.
Besides the ⟨control sequence tokens⟩ there is another kind of ⟨control sequences⟩: ⟨active character tokens⟩.
An ⟨active character token⟩ is a ⟨character token⟩ whose category code is 13 (active).
An ⟨active character token⟩ can be used like a ⟨control sequence token⟩.
E.g., after catcode`b=13
you can do def bThe active character string b is a macro now.
Another way of dividing up ⟨control sequences⟩ is by looking at their "decomposability":
The TeXBook says:
About 300 of TeX's control sequences are called primitive; these are
the low-level atomic operations that are not decomposable into simpler
functions. All other control sequences are defined, ultimately, in
terms of the primitive ones. For example, input is a primitive
operation, but ’ and " are not; the latter are defined in terms of
an accent primitive.
(It should not be overlooked that when defining ⟨control sequences⟩ that are macros, ⟨non-active character tokens⟩ can play a rôle also.)
Yet an another way of dividing up ⟨control sequences⟩ is by looking at the actions triggered by them and at the point in time when these actions are triggered:
There is the analogy between TeX's input-processing and digestive processes:
- TeX input is divided up into so-called tokens (⟨control sequence tokens⟩ and ⟨character tokens⟩) in TeX's mouth.
- Expansion of expandable ⟨control sequences⟩ takes place in the gullet.
Unexpandable ⟨control sequences⟩ will be handled in the stomach.
So you distinguish expandable ⟨control sequences⟩ and unexpandable ⟨control sequences⟩.
Expandable ⟨control sequences⟩ (and their arguments) will in the gullet be replaced by other constellations of tokens. Replacement of expandable ⟨control sequences⟩ takes place in the gullet until there are no more expandable ⟨control sequences⟩ left.
Unexpandable ⟨control sequences⟩ (which came into being either during tokenization in TeX's mouth or via replacement of expandable tokens during expansion) pass the gullet and reach the stomach where boxes will be built, assignment-primitives will be carried out, etc.
There are primitive ⟨control sequences⟩ that are expandable:
E.g., the string
-primitive is an expandable primitive. Together with the following token it vanishes in TeX's gullet and as replacement you get a set of ⟨character tokens⟩ of category code 12 (other) or 10 (space) that represents the name of the ⟨control sequence token⟩ in question/is the catcode-12-pendant of the ⟨character token⟩ in question. (When an explicit space token trails string
, string
will deliver an explicit space token.)
E.g., the csname
-primitive is an expandable primitive. It triggers the processing of a csname..endcsname
-expression in TeX's gullet and it vanishes in TeX's gullet and as replacement you get the corresponding ⟨control symbol token⟩ or ⟨control word token⟩. In case that token is undefined, it will get the meaning of the relax
-primitive within the current scope.
E.g., the romannumeral
-primitive is an expandable primitive. It and a following ⟨number⟩ vanish in TeX's gullet and in case the ⟨number⟩ is positive as replacement you get a sequence of ⟨character tokens⟩ of category code 12 (other) that represent the corresponding ⟨number⟩ in lowercase-roman notation.
There are primitive ⟨control sequences⟩ that are not expandable:
E.g., the relax
-primitive is not expandable. It does pass the gullet and reach the stomach.
E.g., the assignment-primitives def
, edef
, gdef
, xdef
, countdef
etc are not expandable. They do not trigger their replacement in TeX's gullet but will reach the stomach.
There are non-primitive ⟨control sequences⟩ that are expandable:
Macros are non-primitive ⟨control sequences⟩ that are expandable. They and their arguments get replaced with their ⟨replacement text⟩ in TeX's gullet. Macros are defined in TeX's stomach in terms of the primitives def
, edef
, gdef
and xdef
. Be aware that LaTeX's newcommand
also is just a macro which "decomposes" in LaTeX's gullet to some set of tokens containing the def
-primitive.
There are non-primitive ⟨control sequences⟩ that are not expandable:
You can, e.g., do things like letbgroup={
.
bgroup
is a ⟨control sequence⟩. More specific: It is a ⟨control sequence token⟩. It is a ⟨control word token⟩. It is not expandable but it also does not have the meaning of one of the about 300 non-decomposable functions of TeX that are called primitives. bgroup
is an implicit character token. ;-)
Other kinds of non-primitive ⟨control sequences⟩ that are not expandable are, e.g., chardef
-tokens, countdef
-tokens, toksdef
-tokens, ...
In order to get used to the terminology, let's unravel some cases:
After carrying out the assignments
catcode`W=13 %
catcode`X=13 %
catcode`Y=13 %
catcode`Z=13 %
newcommand WThe active character stringW is a macro.
newcommandsomethingThis is a macro.
newcommand+This is a macro, too.
let X=a
let Y=relax
let Z=romannumeral
,
W
is a ⟨control sequence⟩: It is an ⟨active character token⟩. It is a macro and thus both expandable and not a primitive.X
is a ⟨control sequence⟩: It is an
⟨active character token⟩. It is not expandable. Thus it cannot be a macro. It also is not an unexpandable primitive. It has the same meaning as the catcode 11(letter) ⟨character token⟩a
. It is an implicit character token.Y
is a ⟨control sequence⟩: It is an ⟨active character token⟩. Its meaning equals the meaning of the unexpandablerelax
-primitive. Thus it is an unexpandable primitive.Z
is a ⟨control sequence⟩: It is an ⟨active character token⟩. Its meaning equals the meaning of the expandableromannumeral
-primitive. Thus it is an expandable primitive.something
is a ⟨control sequence⟩: It is a ⟨control sequence token⟩. It is a ⟨control word token⟩. It is expandable. Its meaning does not equal one of the about 300 non-decomposable functions of TeX that are called primitives. It is a macro.+
is a ⟨control sequence⟩: It is a ⟨control sequence token⟩. It is a ⟨control symbol token⟩. It is expandable. Its meaning does not equal one of the about 300 non-decomposable functions of TeX that are called primitives. It is a macro.
The term "macro" does always denote a non-primitive expandable ⟨control sequence⟩.
Thus when you know that a token is a macro, you can conclude that it cannot be a
⟨non-active character token⟩ because ⟨non-active character tokens⟩ cannot be ⟨control sequences⟩.
You can conclude that it is a ⟨control sequence⟩.
But you cannot conclude whether the ⟨control sequence⟩ is a ⟨control word token⟩ or a ⟨control symbol token⟩ or an
⟨active character token⟩.
Not all expandable ⟨control sequences⟩ are macros.
There are expandable ⟨primitives⟩ as well.
Not all non-expandable ⟨control sequences⟩ are ⟨primitives⟩.
There are non-expandable non-primitives as well:
E.g., implicit character tokens, chardef
-tokens, countdef
-tokens, toksdef
-tokens, ...
edited Jan 6 at 11:38
answered Jan 4 at 23:08
Ulrich DiezUlrich Diez
4,295615
4,295615
1
Sorry for asking, but just curiosity: are you writing everything from your mind?
– Sigur
Jan 5 at 0:33
I just observed you were editing a lot.
– Sigur
Jan 5 at 0:51
1
@Sigur I just thought about it, looked up the quotes and spontaneously typed into the input-field of the TeX-LaTeX-StackExchange-Website shown in my internet-browser. If I had really planned this answer systematically, I would have typed within my favorite editor-software, saved the text as a file and revised it a few times before pasting it from the editor-software to the input-field of the TeX-LaTeX-StackExchange-Website. But I didn't take it all too serious. I just wanted to have fun...;-)
– Ulrich Diez
Jan 5 at 1:10
2
Ulrich, Since TeX doesn't end in an orthographic 's', it's possessive form should be "TeX's" not just "TeX'".
– Alan Munn
Jan 5 at 17:04
2
@AlanMunn These comments might prevent others from making the same mistakes... ;-)
– Ulrich Diez
Jan 5 at 17:41
|
show 3 more comments
1
Sorry for asking, but just curiosity: are you writing everything from your mind?
– Sigur
Jan 5 at 0:33
I just observed you were editing a lot.
– Sigur
Jan 5 at 0:51
1
@Sigur I just thought about it, looked up the quotes and spontaneously typed into the input-field of the TeX-LaTeX-StackExchange-Website shown in my internet-browser. If I had really planned this answer systematically, I would have typed within my favorite editor-software, saved the text as a file and revised it a few times before pasting it from the editor-software to the input-field of the TeX-LaTeX-StackExchange-Website. But I didn't take it all too serious. I just wanted to have fun...;-)
– Ulrich Diez
Jan 5 at 1:10
2
Ulrich, Since TeX doesn't end in an orthographic 's', it's possessive form should be "TeX's" not just "TeX'".
– Alan Munn
Jan 5 at 17:04
2
@AlanMunn These comments might prevent others from making the same mistakes... ;-)
– Ulrich Diez
Jan 5 at 17:41
1
1
Sorry for asking, but just curiosity: are you writing everything from your mind?
– Sigur
Jan 5 at 0:33
Sorry for asking, but just curiosity: are you writing everything from your mind?
– Sigur
Jan 5 at 0:33
I just observed you were editing a lot.
– Sigur
Jan 5 at 0:51
I just observed you were editing a lot.
– Sigur
Jan 5 at 0:51
1
1
@Sigur I just thought about it, looked up the quotes and spontaneously typed into the input-field of the TeX-LaTeX-StackExchange-Website shown in my internet-browser. If I had really planned this answer systematically, I would have typed within my favorite editor-software, saved the text as a file and revised it a few times before pasting it from the editor-software to the input-field of the TeX-LaTeX-StackExchange-Website. But I didn't take it all too serious. I just wanted to have fun...;-)
– Ulrich Diez
Jan 5 at 1:10
@Sigur I just thought about it, looked up the quotes and spontaneously typed into the input-field of the TeX-LaTeX-StackExchange-Website shown in my internet-browser. If I had really planned this answer systematically, I would have typed within my favorite editor-software, saved the text as a file and revised it a few times before pasting it from the editor-software to the input-field of the TeX-LaTeX-StackExchange-Website. But I didn't take it all too serious. I just wanted to have fun...;-)
– Ulrich Diez
Jan 5 at 1:10
2
2
Ulrich, Since TeX doesn't end in an orthographic 's', it's possessive form should be "TeX's" not just "TeX'".
– Alan Munn
Jan 5 at 17:04
Ulrich, Since TeX doesn't end in an orthographic 's', it's possessive form should be "TeX's" not just "TeX'".
– Alan Munn
Jan 5 at 17:04
2
2
@AlanMunn These comments might prevent others from making the same mistakes... ;-)
– Ulrich Diez
Jan 5 at 17:41
@AlanMunn These comments might prevent others from making the same mistakes... ;-)
– Ulrich Diez
Jan 5 at 17:41
|
show 3 more comments
They're used interchangeably because they mean the same thing.
A macro is defined as
a single computer instruction that stands for a sequence of operations
The use of command probably stems from the way it's defined in LaTeX via
newcommand
andrenewcommand
and LaTeX3'sxparse
viaNewDocumentCommand
andRenewDocumentCommand
.There's also control sequence used by DEK in his TeX Book. They all point to the same thing.
7
I would not say they mean the same thing, tex primitives are not macros.
– David Carlisle
Jan 4 at 10:12
I'm confident the majority of people using (La)TeX use the terminology interchangeably.
– Werner
Jan 4 at 17:32
2
Certainly some do but that I think leads to confusion (as witnessed by this question)
– David Carlisle
Jan 4 at 17:35
add a comment |
They're used interchangeably because they mean the same thing.
A macro is defined as
a single computer instruction that stands for a sequence of operations
The use of command probably stems from the way it's defined in LaTeX via
newcommand
andrenewcommand
and LaTeX3'sxparse
viaNewDocumentCommand
andRenewDocumentCommand
.There's also control sequence used by DEK in his TeX Book. They all point to the same thing.
7
I would not say they mean the same thing, tex primitives are not macros.
– David Carlisle
Jan 4 at 10:12
I'm confident the majority of people using (La)TeX use the terminology interchangeably.
– Werner
Jan 4 at 17:32
2
Certainly some do but that I think leads to confusion (as witnessed by this question)
– David Carlisle
Jan 4 at 17:35
add a comment |
They're used interchangeably because they mean the same thing.
A macro is defined as
a single computer instruction that stands for a sequence of operations
The use of command probably stems from the way it's defined in LaTeX via
newcommand
andrenewcommand
and LaTeX3'sxparse
viaNewDocumentCommand
andRenewDocumentCommand
.There's also control sequence used by DEK in his TeX Book. They all point to the same thing.
They're used interchangeably because they mean the same thing.
A macro is defined as
a single computer instruction that stands for a sequence of operations
The use of command probably stems from the way it's defined in LaTeX via
newcommand
andrenewcommand
and LaTeX3'sxparse
viaNewDocumentCommand
andRenewDocumentCommand
.There's also control sequence used by DEK in his TeX Book. They all point to the same thing.
answered Jan 4 at 5:09
WernerWerner
440k669671662
440k669671662
7
I would not say they mean the same thing, tex primitives are not macros.
– David Carlisle
Jan 4 at 10:12
I'm confident the majority of people using (La)TeX use the terminology interchangeably.
– Werner
Jan 4 at 17:32
2
Certainly some do but that I think leads to confusion (as witnessed by this question)
– David Carlisle
Jan 4 at 17:35
add a comment |
7
I would not say they mean the same thing, tex primitives are not macros.
– David Carlisle
Jan 4 at 10:12
I'm confident the majority of people using (La)TeX use the terminology interchangeably.
– Werner
Jan 4 at 17:32
2
Certainly some do but that I think leads to confusion (as witnessed by this question)
– David Carlisle
Jan 4 at 17:35
7
7
I would not say they mean the same thing, tex primitives are not macros.
– David Carlisle
Jan 4 at 10:12
I would not say they mean the same thing, tex primitives are not macros.
– David Carlisle
Jan 4 at 10:12
I'm confident the majority of people using (La)TeX use the terminology interchangeably.
– Werner
Jan 4 at 17:32
I'm confident the majority of people using (La)TeX use the terminology interchangeably.
– Werner
Jan 4 at 17:32
2
2
Certainly some do but that I think leads to confusion (as witnessed by this question)
– David Carlisle
Jan 4 at 17:35
Certainly some do but that I think leads to confusion (as witnessed by this question)
– David Carlisle
Jan 4 at 17:35
add a comment |
Thanks for contributing an answer to TeX - LaTeX Stack Exchange!
- Please be sure to answer the question. Provide details and share your research!
But avoid …
- Asking for help, clarification, or responding to other answers.
- Making statements based on opinion; back them up with references or personal experience.
To learn more, see our tips on writing great answers.
Sign up or log in
StackExchange.ready(function ()
StackExchange.helpers.onClickDraftSave('#login-link');
);
Sign up using Google
Sign up using Facebook
Sign up using Email and Password
Post as a guest
Required, but never shown
StackExchange.ready(
function ()
StackExchange.openid.initPostLogin('.new-post-login', 'https%3a%2f%2ftex.stackexchange.com%2fquestions%2f468508%2fwhat-is-the-difference-between-macro-and-command%23new-answer', 'question_page');
);
Post as a guest
Required, but never shown
Sign up or log in
StackExchange.ready(function ()
StackExchange.helpers.onClickDraftSave('#login-link');
);
Sign up using Google
Sign up using Facebook
Sign up using Email and Password
Post as a guest
Required, but never shown
Sign up or log in
StackExchange.ready(function ()
StackExchange.helpers.onClickDraftSave('#login-link');
);
Sign up using Google
Sign up using Facebook
Sign up using Email and Password
Post as a guest
Required, but never shown
Sign up or log in
StackExchange.ready(function ()
StackExchange.helpers.onClickDraftSave('#login-link');
);
Sign up using Google
Sign up using Facebook
Sign up using Email and Password
Sign up using Google
Sign up using Facebook
Sign up using Email and Password
Post as a guest
Required, but never shown
Required, but never shown
Required, but never shown
Required, but never shown
Required, but never shown
Required, but never shown
Required, but never shown
Required, but never shown
Required, but never shown
2
We write
macro
orcommand
orcontrol sequence
because we don't want to use the same name all the time. We're not lazy ;-)– Christian Hupfer
Jan 4 at 8:15
1
I personally do use these three words slightly differently, and I'm now wondering if this is correct. For me a "macro" is a command defined in terms of other commands (it can be expanded, so I exclude primitives and lengths), a "command" is an instruction that does something (which is admittedly vague, but this would include macros, but not lengths) and a "control" sequence" is a single token that consists of a backslash followed by a sequence of characters (irrespective of whether it is defined at all, but I'm not including active characters). Most commands are all three of these things.
– Circumscribe
Jan 4 at 8:39
@Circumscribe There are expandable primitives that are sometimes expanding to a value, e.g.
eTeXrevision
. Is this a macro as per your definition?– TeXnician
Jan 4 at 8:41
@TeXnician: Good point, I realised that only just before you posted your comment [for those reading along: the first version of my comment said that a macro is something that expands to something else]. I would say it is not (nor are e.g.
the
,ifx
andcsname
), but I don't know how (or if) these terms are officially defined and I'm actually curious about what users think.– Circumscribe
Jan 4 at 8:50
Clarification: for my definition a "macro" would be anything for which the
show<macro>
output starts with "macro:" (and which is thus ultimately defined using a version ofdef
?) and a command would be anything that is either a macro or a primitive (something for whichshow<primitive>
outputs<primitive>
), but is not a register (or a character). That last part is probably terrible as a definition though. I also tend to say "command" rather than "macro" when the instruction performs an assignment rather than producing output, but that's probably just me.– Circumscribe
Jan 4 at 10:01