1 .\" Copyright (c) 1989, 1991, 1993
2 .\" The Regents of the University of California. All rights reserved.
4 .\" Redistribution and use in source and binary forms, with or without
5 .\" modification, are permitted provided that the following conditions
7 .\" 1. Redistributions of source code must retain the above copyright
8 .\" notice, this list of conditions and the following disclaimer.
9 .\" 2. Redistributions in binary form must reproduce the above copyright
10 .\" notice, this list of conditions and the following disclaimer in the
11 .\" documentation and/or other materials provided with the distribution.
12 .\" 3. Neither the name of the University nor the names of its contributors
13 .\" may be used to endorse or promote products derived from this software
14 .\" without specific prior written permission.
16 .\" THIS SOFTWARE IS PROVIDED BY THE REGENTS AND CONTRIBUTORS ``AS IS'' AND
17 .\" ANY EXPRESS OR IMPLIED WARRANTIES, INCLUDING, BUT NOT LIMITED TO, THE
18 .\" IMPLIED WARRANTIES OF MERCHANTABILITY AND FITNESS FOR A PARTICULAR PURPOSE
19 .\" ARE DISCLAIMED. IN NO EVENT SHALL THE REGENTS OR CONTRIBUTORS BE LIABLE
20 .\" FOR ANY DIRECT, INDIRECT, INCIDENTAL, SPECIAL, EXEMPLARY, OR CONSEQUENTIAL
21 .\" DAMAGES (INCLUDING, BUT NOT LIMITED TO, PROCUREMENT OF SUBSTITUTE GOODS
22 .\" OR SERVICES; LOSS OF USE, DATA, OR PROFITS; OR BUSINESS INTERRUPTION)
23 .\" HOWEVER CAUSED AND ON ANY THEORY OF LIABILITY, WHETHER IN CONTRACT, STRICT
24 .\" LIABILITY, OR TORT (INCLUDING NEGLIGENCE OR OTHERWISE) ARISING IN ANY WAY
25 .\" OUT OF THE USE OF THIS SOFTWARE, EVEN IF ADVISED OF THE POSSIBILITY OF
33 .Nd language definition data base for
41 contains all language definitions for
47 The following table names and describes each field.
48 .Bl -column Namexxx Tpexxx
49 .It Sy "Name Type Description"
50 .It "ab str regular expression for the start of an alternate comment"
51 .It "ae str regular expression for the end of an alternate comment"
52 .It "pb str regular expression for start of a procedure"
53 .It "bb str regular expression for start of a lexical block"
54 .It "be str regular expression for the end of a lexical block"
55 .It "cb str regular expression for the start of a comment"
56 .It "ce str regular expression for the end of a comment"
57 .It "sb str regular expression for the start of a string"
58 .It "se str regular expression for the end of a string"
59 .It "lb str regular expression for the start of a character constant"
60 .It "le str regular expression for the end of a character constant"
61 .It "nc str regular expression for a non-comment (see below)"
62 .It "tl bool present means procedures are only defined at the top lexical level"
63 .It "oc bool present means upper and lower case are equivalent"
64 .It "kw str a list of keywords separated by spaces"
67 Non-comments are required to describe a certain context where a
68 sequence that would normally start a comment loses its special
70 A typical example for this can be found in Perl, where
71 comments are normally starting with
75 is an operator on an array.
76 .Sh REGULAR EXPRESSIONS
78 uses regular expression which are very similar to those of
82 The characters `^', `$', `:' and `\e'
83 are reserved characters and must be
84 "quoted" with a preceding
87 are to be included as normal characters.
88 The metasymbols and their meanings are:
89 .Bl -tag -width indent
93 the beginning of a line
95 a delimiter (space, tab, newline, start of line)
97 matches any string of symbols (like .* in lex)
99 matches any alphanumeric name.
100 In a procedure definition (pb) the string
101 that matches this symbol is used as the procedure name.
107 last item is optional
109 preceding any string means that the string will not match an
110 input string if the input string is preceded by an escape character (\e).
111 This is typically used for languages (like C) which can include the
112 string delimiter in a string by escaping it.
115 Unlike other regular expressions in the system, these match words
117 Hence something like "(tramp|steamer)flies?"
118 would match "tramp", "steamer", "trampflies", or "steamerflies".
120 The keyword list is just a list of keywords in the language separated
122 If the "oc" boolean is specified, indicating that upper
123 and lower case are equivalent, then all the keywords should be
124 specified in lower case.
126 .Bl -tag -width /usr/share/misc/vgrindefs -compact
127 .It Pa /usr/share/misc/vgrindefs
128 File containing terminal descriptions.
131 The following entry, which describes the C language, is
132 typical of a language entry.
135 :pb=^\ed?*?\ed?\ep\ed?\e(\ea?\e):bb={:be=}:cb=/*:ce=*/:sb=":se=\ee":\e
137 :kw=asm auto break case char continue default do double else enum\e
138 extern float for fortran goto if int long register return short\e
139 sizeof static struct switch typedef union unsigned while #define\e
140 #else #endif #if #ifdef #ifndef #include #undef # define else endif\e
141 if ifdef ifndef include undef:
144 Note that the first field is just the language name (and any variants
146 Thus the C language could be specified to
150 Entries may continue onto multiple lines by giving a \e as the last
155 Boolean capabilities which indicate that the language has
156 some particular feature
158 capabilities which give a regular expression or
161 .Xr troff 1 Pq Pa ports/textproc/groff ,
166 file format appeared in