libraries/spcre/libpcre/pcre/NON-UNIX-USE
changeset 0 7f656887cf89
equal deleted inserted replaced
-1:000000000000 0:7f656887cf89
       
     1 Compiling PCRE on non-Unix systems
       
     2 ----------------------------------
       
     3 
       
     4 This document contains the following sections:
       
     5 
       
     6   General
       
     7   Generic instructions for the PCRE C library
       
     8   The C++ wrapper functions
       
     9   Building for virtual Pascal
       
    10   Stack size in Windows environments
       
    11   Linking programs in Windows environments
       
    12   Comments about Win32 builds
       
    13   Building PCRE on Windows with CMake
       
    14   Use of relative paths with CMake on Windows
       
    15   Testing with runtest.bat
       
    16   Building under Windows with BCC5.5
       
    17   Building PCRE on OpenVMS
       
    18 
       
    19 
       
    20 GENERAL
       
    21 
       
    22 I (Philip Hazel) have no experience of Windows or VMS sytems and how their
       
    23 libraries work. The items in the PCRE distribution and Makefile that relate to
       
    24 anything other than Unix-like systems are untested by me.
       
    25 
       
    26 There are some other comments and files in the Contrib directory on the ftp
       
    27 site that you may find useful. See
       
    28 
       
    29   ftp://ftp.csx.cam.ac.uk/pub/software/programming/pcre/Contrib
       
    30 
       
    31 If you want to compile PCRE for a non-Unix system (especially for a system that
       
    32 does not support "configure" and "make" files), note that the basic PCRE
       
    33 library consists entirely of code written in Standard C, and so should compile
       
    34 successfully on any system that has a Standard C compiler and library. The C++
       
    35 wrapper functions are a separate issue (see below).
       
    36 
       
    37 The PCRE distribution includes a "configure" file for use by the Configure/Make
       
    38 build system, as found in many Unix-like environments. There is also support
       
    39 support for CMake, which some users prefer, in particular in Windows
       
    40 environments. There are some instructions for CMake under Windows in the
       
    41 section entitled "Building PCRE with CMake" below. CMake can also be used to
       
    42 build PCRE in Unix-like systems.
       
    43 
       
    44 
       
    45 GENERIC INSTRUCTIONS FOR THE PCRE C LIBRARY
       
    46 
       
    47 The following are generic comments about building the PCRE C library "by hand".
       
    48 
       
    49  (1) Copy or rename the file config.h.generic as config.h, and edit the macro
       
    50      settings that it contains to whatever is appropriate for your environment.
       
    51      In particular, if you want to force a specific value for newline, you can
       
    52      define the NEWLINE macro. When you compile any of the PCRE modules, you
       
    53      must specify -DHAVE_CONFIG_H to your compiler so that config.h is included
       
    54      in the sources.
       
    55 
       
    56      An alternative approach is not to edit config.h, but to use -D on the
       
    57      compiler command line to make any changes that you need to the
       
    58      configuration options. In this case -DHAVE_CONFIG_H must not be set.
       
    59 
       
    60      NOTE: There have been occasions when the way in which certain parameters
       
    61      in config.h are used has changed between releases. (In the configure/make
       
    62      world, this is handled automatically.) When upgrading to a new release,
       
    63      you are strongly advised to review config.h.generic before re-using what
       
    64      you had previously.
       
    65 
       
    66  (2) Copy or rename the file pcre.h.generic as pcre.h.
       
    67 
       
    68  (3) EITHER:
       
    69        Copy or rename file pcre_chartables.c.dist as pcre_chartables.c.
       
    70 
       
    71      OR:
       
    72        Compile dftables.c as a stand-alone program (using -DHAVE_CONFIG_H if
       
    73        you have set up config.h), and then run it with the single argument
       
    74        "pcre_chartables.c". This generates a set of standard character tables
       
    75        and writes them to that file. The tables are generated using the default
       
    76        C locale for your system. If you want to use a locale that is specified
       
    77        by LC_xxx environment variables, add the -L option to the dftables
       
    78        command. You must use this method if you are building on a system that
       
    79        uses EBCDIC code.
       
    80 
       
    81      The tables in pcre_chartables.c are defaults. The caller of PCRE can
       
    82      specify alternative tables at run time.
       
    83 
       
    84  (4) Ensure that you have the following header files:
       
    85 
       
    86        pcre_internal.h
       
    87        ucp.h
       
    88 
       
    89  (5) Also ensure that you have the following file, which is #included as source
       
    90      when building a debugging version of PCRE, and is also used by pcretest.
       
    91 
       
    92        pcre_printint.src
       
    93 
       
    94  (6) Compile the following source files, setting -DHAVE_CONFIG_H as a compiler
       
    95      option if you have set up config.h with your configuration, or else use
       
    96      other -D settings to change the configuration as required.
       
    97 
       
    98        pcre_chartables.c
       
    99        pcre_compile.c
       
   100        pcre_config.c
       
   101        pcre_dfa_exec.c
       
   102        pcre_exec.c
       
   103        pcre_fullinfo.c
       
   104        pcre_get.c
       
   105        pcre_globals.c
       
   106        pcre_info.c
       
   107        pcre_maketables.c
       
   108        pcre_newline.c
       
   109        pcre_ord2utf8.c
       
   110        pcre_refcount.c
       
   111        pcre_study.c
       
   112        pcre_tables.c
       
   113        pcre_try_flipped.c
       
   114        pcre_ucd.c
       
   115        pcre_valid_utf8.c
       
   116        pcre_version.c
       
   117        pcre_xclass.c
       
   118 
       
   119      Make sure that you include -I. in the compiler command (or equivalent for
       
   120      an unusual compiler) so that all included PCRE header files are first
       
   121      sought in the current directory. Otherwise you run the risk of picking up
       
   122      a previously-installed file from somewhere else.
       
   123 
       
   124  (7) Now link all the compiled code into an object library in whichever form
       
   125      your system keeps such libraries. This is the basic PCRE C library. If
       
   126      your system has static and shared libraries, you may have to do this once
       
   127      for each type.
       
   128 
       
   129  (8) Similarly, compile pcreposix.c (remembering -DHAVE_CONFIG_H if necessary)
       
   130      and link the result (on its own) as the pcreposix library.
       
   131 
       
   132  (9) Compile the test program pcretest.c (again, don't forget -DHAVE_CONFIG_H).
       
   133      This needs the functions in the pcre and pcreposix libraries when linking.
       
   134      It also needs the pcre_printint.src source file, which it #includes.
       
   135 
       
   136 (10) Run pcretest on the testinput files in the testdata directory, and check
       
   137      that the output matches the corresponding testoutput files. Note that the
       
   138      supplied files are in Unix format, with just LF characters as line
       
   139      terminators. You may need to edit them to change this if your system uses
       
   140      a different convention. If you are using Windows, you probably should use
       
   141      the wintestinput3 file instead of testinput3 (and the corresponding output
       
   142      file). This is a locale test; wintestinput3 sets the locale to "french"
       
   143      rather than "fr_FR", and there some minor output differences.
       
   144 
       
   145 (11) If you want to use the pcregrep command, compile and link pcregrep.c; it
       
   146      uses only the basic PCRE library (it does not need the pcreposix library).
       
   147 
       
   148 
       
   149 THE C++ WRAPPER FUNCTIONS
       
   150 
       
   151 The PCRE distribution also contains some C++ wrapper functions and tests,
       
   152 contributed by Google Inc. On a system that can use "configure" and "make",
       
   153 the functions are automatically built into a library called pcrecpp. It should
       
   154 be straightforward to compile the .cc files manually on other systems. The
       
   155 files called xxx_unittest.cc are test programs for each of the corresponding
       
   156 xxx.cc files.
       
   157 
       
   158 
       
   159 BUILDING FOR VIRTUAL PASCAL
       
   160 
       
   161 A script for building PCRE using Borland's C++ compiler for use with VPASCAL
       
   162 was contributed by Alexander Tokarev. Stefan Weber updated the script and added
       
   163 additional files. The following files in the distribution are for building PCRE
       
   164 for use with VP/Borland: makevp_c.txt, makevp_l.txt, makevp.bat, pcregexp.pas.
       
   165 
       
   166 
       
   167 STACK SIZE IN WINDOWS ENVIRONMENTS
       
   168 
       
   169 The default processor stack size of 1Mb in some Windows environments is too
       
   170 small for matching patterns that need much recursion. In particular, test 2 may
       
   171 fail because of this. Normally, running out of stack causes a crash, but there
       
   172 have been cases where the test program has just died silently. See your linker
       
   173 documentation for how to increase stack size if you experience problems. The
       
   174 Linux default of 8Mb is a reasonable choice for the stack, though even that can
       
   175 be too small for some pattern/subject combinations.
       
   176 
       
   177 PCRE has a compile configuration option to disable the use of stack for
       
   178 recursion so that heap is used instead. However, pattern matching is
       
   179 significantly slower when this is done. There is more about stack usage in the
       
   180 "pcrestack" documentation.
       
   181 
       
   182 
       
   183 LINKING PROGRAMS IN WINDOWS ENVIRONMENTS
       
   184 
       
   185 If you want to statically link a program against a PCRE library in the form of
       
   186 a non-dll .a file, you must define PCRE_STATIC before including pcre.h,
       
   187 otherwise the pcre_malloc() and pcre_free() exported functions will be declared
       
   188 __declspec(dllimport), with unwanted results.
       
   189 
       
   190 
       
   191 CALLING CONVENTIONS IN WINDOWS ENVIRONMENTS
       
   192 
       
   193 It is possible to compile programs to use different calling conventions using
       
   194 MSVC. Search the web for "calling conventions" for more information. To make it
       
   195 easier to change the calling convention for the exported functions in the
       
   196 PCRE library, the macro PCRE_CALL_CONVENTION is present in all the external
       
   197 definitions. It can be set externally when compiling (e.g. in CFLAGS). If it is
       
   198 not set, it defaults to empty; the default calling convention is then used
       
   199 (which is what is wanted most of the time).
       
   200 
       
   201 
       
   202 COMMENTS ABOUT WIN32 BUILDS (see also "BUILDING PCRE WITH CMAKE" below)
       
   203 
       
   204 There are two ways of building PCRE using the "configure, make, make install"
       
   205 paradigm on Windows systems: using MinGW or using Cygwin. These are not at all
       
   206 the same thing; they are completely different from each other. There is also
       
   207 support for building using CMake, which some users find a more straightforward
       
   208 way of building PCRE under Windows. However, the tests are not run
       
   209 automatically when CMake is used.
       
   210 
       
   211 The MinGW home page (http://www.mingw.org/) says this:
       
   212 
       
   213   MinGW: A collection of freely available and freely distributable Windows
       
   214   specific header files and import libraries combined with GNU toolsets that
       
   215   allow one to produce native Windows programs that do not rely on any
       
   216   3rd-party C runtime DLLs.
       
   217 
       
   218 The Cygwin home page (http://www.cygwin.com/) says this:
       
   219 
       
   220   Cygwin is a Linux-like environment for Windows. It consists of two parts:
       
   221 
       
   222   . A DLL (cygwin1.dll) which acts as a Linux API emulation layer providing
       
   223     substantial Linux API functionality
       
   224 
       
   225   . A collection of tools which provide Linux look and feel.
       
   226 
       
   227   The Cygwin DLL currently works with all recent, commercially released x86 32
       
   228   bit and 64 bit versions of Windows, with the exception of Windows CE.
       
   229 
       
   230 On both MinGW and Cygwin, PCRE should build correctly using:
       
   231 
       
   232   ./configure && make && make install
       
   233 
       
   234 This should create two libraries called libpcre and libpcreposix, and, if you
       
   235 have enabled building the C++ wrapper, a third one called libpcrecpp. These are
       
   236 independent libraries: when you like with libpcreposix or libpcrecpp you must
       
   237 also link with libpcre, which contains the basic functions. (Some earlier
       
   238 releases of PCRE included the basic libpcre functions in libpcreposix. This no
       
   239 longer happens.)
       
   240 
       
   241 A user submitted a special-purpose patch that makes it easy to create
       
   242 "pcre.dll" under mingw32 using the "msys" environment. It provides "pcre.dll"
       
   243 as a special target. If you use this target, no other files are built, and in
       
   244 particular, the pcretest and pcregrep programs are not built. An example of how
       
   245 this might be used is:
       
   246 
       
   247   ./configure --enable-utf --disable-cpp CFLAGS="-03 -s"; make pcre.dll
       
   248 
       
   249 Using Cygwin's compiler generates libraries and executables that depend on
       
   250 cygwin1.dll. If a library that is generated this way is distributed,
       
   251 cygwin1.dll has to be distributed as well. Since cygwin1.dll is under the GPL
       
   252 licence, this forces not only PCRE to be under the GPL, but also the entire
       
   253 application. A distributor who wants to keep their own code proprietary must
       
   254 purchase an appropriate Cygwin licence.
       
   255 
       
   256 MinGW has no such restrictions. The MinGW compiler generates a library or
       
   257 executable that can run standalone on Windows without any third party dll or
       
   258 licensing issues.
       
   259 
       
   260 But there is more complication:
       
   261 
       
   262 If a Cygwin user uses the -mno-cygwin Cygwin gcc flag, what that really does is
       
   263 to tell Cygwin's gcc to use the MinGW gcc. Cygwin's gcc is only acting as a
       
   264 front end to MinGW's gcc (if you install Cygwin's gcc, you get both Cygwin's
       
   265 gcc and MinGW's gcc). So, a user can:
       
   266 
       
   267 . Build native binaries by using MinGW or by getting Cygwin and using
       
   268   -mno-cygwin.
       
   269 
       
   270 . Build binaries that depend on cygwin1.dll by using Cygwin with the normal
       
   271   compiler flags.
       
   272 
       
   273 The test files that are supplied with PCRE are in Unix format, with LF
       
   274 characters as line terminators. It may be necessary to change the line
       
   275 terminators in order to get some of the tests to work. We hope to improve
       
   276 things in this area in future.
       
   277 
       
   278 
       
   279 BUILDING PCRE ON WINDOWS WITH CMAKE
       
   280 
       
   281 CMake is an alternative build facility that can be used instead of the
       
   282 traditional Unix "configure". CMake version 2.4.7 supports Borland makefiles,
       
   283 MinGW makefiles, MSYS makefiles, NMake makefiles, UNIX makefiles, Visual Studio
       
   284 6, Visual Studio 7, Visual Studio 8, and Watcom W8. The following instructions
       
   285 were contributed by a PCRE user.
       
   286 
       
   287 1.  Download CMake 2.4.7 or above from http://www.cmake.org/, install and ensure
       
   288     that cmake\bin is on your path.
       
   289 
       
   290 2.  Unzip (retaining folder structure) the PCRE source tree into a source
       
   291     directory such as C:\pcre.
       
   292 
       
   293 3.  Create a new, empty build directory: C:\pcre\build\
       
   294 
       
   295 4.  Run CMakeSetup from the Shell envirornment of your build tool, e.g., Msys
       
   296     for Msys/MinGW or Visual Studio Command Prompt for VC/VC++
       
   297 
       
   298 5.  Enter C:\pcre\pcre-xx and C:\pcre\build for the source and build
       
   299     directories, respectively
       
   300 
       
   301 6.  Hit the "Configure" button.
       
   302 
       
   303 7.  Select the particular IDE / build tool that you are using (Visual Studio,
       
   304     MSYS makefiles, MinGW makefiles, etc.)
       
   305 
       
   306 8.  The GUI will then list several configuration options. This is where you can
       
   307     enable UTF-8 support, etc.
       
   308 
       
   309 9.  Hit "Configure" again. The adjacent "OK" button should now be active.
       
   310 
       
   311 10. Hit "OK".
       
   312 
       
   313 11. The build directory should now contain a usable build system, be it a
       
   314     solution file for Visual Studio, makefiles for MinGW, etc.
       
   315 
       
   316 
       
   317 USE OF RELATIVE PATHS WITH CMAKE ON WINDOWS
       
   318 
       
   319 A PCRE user comments as follows:
       
   320 
       
   321 I thought that others may want to know the current state of
       
   322 CMAKE_USE_RELATIVE_PATHS support on Windows.
       
   323 
       
   324 Here it is:
       
   325 -- AdditionalIncludeDirectories is only partially modified (only the
       
   326 first path - see below)
       
   327 -- Only some of the contained file paths are modified - shown below for
       
   328 pcre.vcproj
       
   329 -- It properly modifies
       
   330 
       
   331 I am sure CMake people can fix that if they want to. Until then one will
       
   332 need to replace existing absolute paths in project files with relative
       
   333 paths manually (e.g. from VS) - relative to project file location. I did
       
   334 just that before being told to try CMAKE_USE_RELATIVE_PATHS. Not a big
       
   335 deal.
       
   336 
       
   337 AdditionalIncludeDirectories="E:\builds\pcre\build;E:\builds\pcre\pcre-7.5;"
       
   338 AdditionalIncludeDirectories=".;E:\builds\pcre\pcre-7.5;"
       
   339 
       
   340 RelativePath="pcre.h">
       
   341 RelativePath="pcre_chartables.c">
       
   342 RelativePath="pcre_chartables.c.rule">
       
   343 
       
   344 
       
   345 TESTING WITH RUNTEST.BAT
       
   346 
       
   347 1. Copy RunTest.bat into the directory where pcretest.exe has been created.
       
   348 
       
   349 2. Edit RunTest.bat and insert a line that indentifies the relative location of
       
   350    the pcre source, e.g.:
       
   351 
       
   352    set srcdir=..\pcre-7.4-RC3
       
   353 
       
   354 3. Run RunTest.bat from a command shell environment. Test outputs will
       
   355    automatically be compared to expected results, and discrepancies will
       
   356    identified in the console output.
       
   357 
       
   358 4. To test pcrecpp, run pcrecpp_unittest.exe, pcre_stringpiece_unittest.exe and
       
   359    pcre_scanner_unittest.exe.
       
   360 
       
   361 
       
   362 BUILDING UNDER WINDOWS WITH BCC5.5
       
   363 
       
   364 Michael Roy sent these comments about building PCRE under Windows with BCC5.5:
       
   365 
       
   366   Some of the core BCC libraries have a version of PCRE from 1998 built in,
       
   367   which can lead to pcre_exec() giving an erroneous PCRE_ERROR_NULL from a
       
   368   version mismatch. I'm including an easy workaround below, if you'd like to
       
   369   include it in the non-unix instructions:
       
   370 
       
   371   When linking a project with BCC5.5, pcre.lib must be included before any of
       
   372   the libraries cw32.lib, cw32i.lib, cw32mt.lib, and cw32mti.lib on the command
       
   373   line.
       
   374 
       
   375 
       
   376 BUILDING PCRE ON OPENVMS
       
   377 
       
   378 Dan Mooney sent the following comments about building PCRE on OpenVMS. They
       
   379 relate to an older version of PCRE that used fewer source files, so the exact
       
   380 commands will need changing. See the current list of source files above.
       
   381 
       
   382 "It was quite easy to compile and link the library. I don't have a formal
       
   383 make file but the attached file [reproduced below] contains the OpenVMS DCL
       
   384 commands I used to build the library. I had to add #define
       
   385 POSIX_MALLOC_THRESHOLD 10 to pcre.h since it was not defined anywhere.
       
   386 
       
   387 The library was built on:
       
   388 O/S: HP OpenVMS v7.3-1
       
   389 Compiler: Compaq C v6.5-001-48BCD
       
   390 Linker: vA13-01
       
   391 
       
   392 The test results did not match 100% due to the issues you mention in your
       
   393 documentation regarding isprint(), iscntrl(), isgraph() and ispunct(). I
       
   394 modified some of the character tables temporarily and was able to get the
       
   395 results to match. Tests using the fr locale did not match since I don't have
       
   396 that locale loaded. The study size was always reported to be 3 less than the
       
   397 value in the standard test output files."
       
   398 
       
   399 =========================
       
   400 $! This DCL procedure builds PCRE on OpenVMS
       
   401 $!
       
   402 $! I followed the instructions in the non-unix-use file in the distribution.
       
   403 $!
       
   404 $ COMPILE == "CC/LIST/NOMEMBER_ALIGNMENT/PREFIX_LIBRARY_ENTRIES=ALL_ENTRIES
       
   405 $ COMPILE DFTABLES.C
       
   406 $ LINK/EXE=DFTABLES.EXE DFTABLES.OBJ
       
   407 $ RUN DFTABLES.EXE/OUTPUT=CHARTABLES.C
       
   408 $ COMPILE MAKETABLES.C
       
   409 $ COMPILE GET.C
       
   410 $ COMPILE STUDY.C
       
   411 $! I had to set POSIX_MALLOC_THRESHOLD to 10 in PCRE.H since the symbol
       
   412 $! did not seem to be defined anywhere.
       
   413 $! I edited pcre.h and added #DEFINE SUPPORT_UTF8 to enable UTF8 support.
       
   414 $ COMPILE PCRE.C
       
   415 $ LIB/CREATE PCRE MAKETABLES.OBJ, GET.OBJ, STUDY.OBJ, PCRE.OBJ
       
   416 $! I had to set POSIX_MALLOC_THRESHOLD to 10 in PCRE.H since the symbol
       
   417 $! did not seem to be defined anywhere.
       
   418 $ COMPILE PCREPOSIX.C
       
   419 $ LIB/CREATE PCREPOSIX PCREPOSIX.OBJ
       
   420 $ COMPILE PCRETEST.C
       
   421 $ LINK/EXE=PCRETEST.EXE PCRETEST.OBJ, PCRE/LIB, PCREPOSIX/LIB
       
   422 $! C programs that want access to command line arguments must be
       
   423 $! defined as a symbol
       
   424 $ PCRETEST :== "$ SYS$ROADSUSERS:[DMOONEY.REGEXP]PCRETEST.EXE"
       
   425 $! Arguments must be enclosed in quotes.
       
   426 $ PCRETEST "-C"
       
   427 $! Test results:
       
   428 $!
       
   429 $!   The test results did not match 100%. The functions isprint(), iscntrl(),
       
   430 $!   isgraph() and ispunct() on OpenVMS must not produce the same results
       
   431 $!   as the system that built the test output files provided with the
       
   432 $!   distribution.
       
   433 $!
       
   434 $!   The study size did not match and was always 3 less on OpenVMS.
       
   435 $!
       
   436 $!   Locale could not be set to fr
       
   437 $!
       
   438 =========================
       
   439 
       
   440 Last Updated: 05 September 2008
       
   441 ****