Welcome to Data Profiler release 4.2. This document contains the most up-to-date information for running this product on Linux systems and may supersede that in the product documentation. It covers the following topics:
The following hardware and software is needed for a minimum and preferred installation:
Note The above are recommended system requirements for satisfactory operation. For maximum performance, make sure that physical memory meets the requirements of the local system and all applications.
There should be no differences in installing Data Profiler on either the Red Hat or SUSE Linux operating systems. The installation creates a Groupid called pvsw, under which dataprofiler.sh runs. The pvsw Groupid permissions are 644. Profiles and the resulting files run by a user are owned by the user who created them.
Review the following checklist before installing Data Profiler:
rpm -e DataProfiler-<version>-<date>
The license file to use for your solution is provided by your Pervasive sales representative. This license .slc file determines which integration products can be used. If you need to update features for your existing installation, contact your sales representative.
cd /<media>/<cdrom>
For media and cdrom, use your CD device name.
mkdir /mytmp
cp DataProfiler-<version>-<date>.tar.gz /mytmp
For version and date, use the values from the .tar.gz file name.
cd /mytmp
gzip -d DataProfiler-<version>-<date>.tar.gz
tar -xvf DataProfiler-<version>-<date>.tar
sh dataProfiler-<version>-<date>.sh
After installing, you must apply the license file. See To Apply the License File.
mkdir /mytmp
cp DataProfiler-<version>-<date>.tar.gz /mytmp
For version and date, use the value from the .tar.gz file name.
cd /mytmp
gzip -d DataProfiler-<version>-<date>.tar.gz
tar -xvf DataProfiler-<version>-<date>.tar
sh dataProfiler-<version>-<date>.sh
After installing, apply the license. See To Apply the License File.
The following are instructions for applying the default license file. If you do not have a license file, see Obtain a license file.
/opt/PervasiveSoftware/DataProfiler<version>/com.pervasive.profile.engine/license
chmod 644 /opt/PervasiveSoftware/DataProfiler<version>/com.pervasive.profile.engine/license /dataprofiler.slc
After installing, you may want to verify the installation. See Postinstallation Steps.
To ensure that Data Profiler is set up correctly, follow these steps after installation:
If the security settings applied during installation are not appropriate for your environment, you can revert to your previous permissions by executing the following commands as root:
chmod -R 755 /opt/PervasiveSoftware/DataProfiler<version>/dataprofiler
chmod -R 777 /opt/PervasiveSoftware/DataProfiler<version>/plugins
chmod -R 777 /opt/PervasiveSoftware/DataProfiler<version>/jars
chmod -R 777 /opt/PervasiveSoftware/DataProfiler<version>/license
The Unix security feature for Data Profiler is now disabled.
To prepare Data Profiler projects to run on the command line, you must export each project to a .jar file. This is done in the Data Profiler user interface on Windows.
After creating a .jar file, you are ready to run a profile on the command line. See Running Profile Specification Files on the Command Line.
After you have finished designing profile specification files in the user interface, you may want to run those .dp.xml files on the command line.
Before you run profiles on the command line, you must have done the following:
Example:
dataprofiler.sh -cp projects/local/clients/sales.jar -s Profiles/topsales.dp.xml
The project classpath (-cp) and the profile name (-s) are required. The project classpath contains the profile to run and all the resources that the project depends upon, such as data set descriptions. The .jar file is a resource in the project classpath, so its location should be specified using a project-relative path with forward slashes. The .jar contains the source file, connection or data set, profile file, schema file, and project file.
INFO, 2008-01-24T20:13:03.697Z, Preparing profiler engine
INFO, 2008-01-24T20:13:03.822Z, Profiler engine is ready to run jobs
INFO, 2008-01-24T20:13:03.837Z, Preparing profiling specification Profiles/topsales.dp.xml
INFO, 2008-01-24T20:13:05.868Z, Executing profiling specification Profiles/topsales.dp.xml
INFO, 2008-01-24T20:13:05.947Z, Finished profiling specification Profiles/topsales.dp.xml
INFO, 2008-01-24T20:13:05.947Z, Specification run time: 00:00:03
Output files are created in a Results folder child directory of the current directory.
Supported command line options are listed in the following table.
|
Command
|
Description
|
|---|---|
|
-cp
|
Required. Project classpath. Load dependent specifications from directories in this classpath (separated by semicolons).
The classpath must include the root directory of your project or of the .jar file to which you exported your project. This resolves resource paths relative to the project root.
|
|
-h
|
Print command line help and exit.
|
|
-l
|
Write log messages to a file. Include path and name of the log file for the profile. If none is given, the current working directory is assumed.
|
|
-m
|
Memory heap size.
Best practice suggestion: If you have performance issues and you have the system resources, begin using the default of 512M. If performance is still an issue, try a greater value. In addition, you can also pass additional JVM tuning properties. For details, see Tuning the Data Profiler Engine for Improved Performance.
|
|
-r
|
Write results to this directory. Include path and name of the results folder that contains profile output files. If none is given, current working directory and the default Results folder are used.
|
|
-mpf
|
Specify the macro property file names. Provide the full path to each file, except in the following cases:
You can specify multiple macro definition files. The last macrodef.xml file listed on the command line takes precedence over the other files.
|
|
-mp
|
Provide a value for a macro in name and value pairs. You can override previous values with this command. This option takes precedence over the -mpf option. This flag is used for individual macro settings, and more than one property can be passed at a time. If you provide more than one value, the last value takes precedence.
|
|
-rt
|
Specify an output report type. Options are csv, html, and pdf
If you do not specify a format, reports are not generated.
|
|
-s
|
Required. Run the profiling specification. Include path and name of the profile specification file. If no path is provided, the current working directory is assumed.
|
|
-version
|
Provides the version of Data Profiler application.
|
/opt/PervasiveSoftware/DataProfiler<version>/com.pervasive.profiler.engine/bin/dataprofiler.sh -cp projects/local/clients/sales.jar -mp MYMACRO=/datafiles/dp42/projects -m 256m -l ./log_files -r /datafiles/dp42/projects/results -rt csv -s Profiles/topsales.dp.xml
This example specifies the path to the .sh file, path to the project .jar file, macro name and value, memory heap size, log file folder name, results folder name, output report type, and path to the profile file.
You can tune the data profiler engine Java Virtual Machine (JVM) to improve performance when running profiles and to address resource exhaustion issues, such as OutOfMemory errors. JVM tuning controls the following factors:
When you run the engine from the command line, the -m flag sets heap memory size, but you can tune the engine further.
The dataprofiler.tuning.sh file included in your product installation provides additional JVM arguments that you can set. The file is located in the following directory:
/opt/PervasiveSoftware/DataProfiler<version>/com.pervasive.profiler.engine/bin
Open the file in a text editor to read the JVM argument notes and to see the options. You may uncomment and tune the code in the file as needed.
See Also
Running Profile Specification Files on the Command Line
This section lists the most noteworthy known issues for Linux as of the release date. If you encounter an issue not found here, contact Pervasive Support.
|
Issue
|
Resolution
|
|---|---|
|
You do not have administrative privileges for the machine. A root user ID and password are required to install the product.
|
Contact your support organization and request that your privileges be modified to Administrative for the affected machine.
|
|
While performing a maintenance installation, a message is returned saying your subscription is expired.
|
Contact your support organization to obtain an updated license. After implementing the new license, attempt to install again.
|
|
You have trouble running Data Profiler and encounter a loading of shared libraries error.
|
Put the Data Profiler installation directory at the beginning of your PATH environment variable. For details, see Postinstallation Steps. In rare cases, customers have reported problems with an existing incompatible version of one of the third-party components that we use, such as ACE, ICU, or Xerces C. Place Data Profiler at the front of the search paths to insure the component version in the Data Profiler install directory is loaded first at run time.
|
|
34877 - A profile that includes an IBM DB2 7.2 connection fails and returns an error.
|
Run any profile that uses an IBM DB2 7.2 connection on a supported Windows platform.
|
|
34931 - The current version 4.2 Linux install package does not allow you to change to the installation location to a custom location.
|
None.
|
|
34976 - The connectors Pervasive PSQL v10 and Pervasive PSQL v9 are not supported on Linux.
|
Run any profile that uses a Pervasive PSQL v10 or v9 connection on a supported Windows platform.
|
Two versions of the Oracle8 connector are included with Data Profiler to support two versions of Oracle8. The djoci8.8.0.5.so connector works with all Oracle 8.0.x client installations. The djoci8.8.1.5.so connector works with all Oracle 8.1.x client installations. You must either copy or make a symbolic link from the vendor shared library to the djoci8.so shared library in the Data Profiler installation directory.
By default, the 8.1.x version of the Oracle8 connector is installed.
To enable Oracle 8.0.x support, type the following command in the Data Profiler installation directory:
cp djoci8.8.0.5.so djoci8.so
To enable Oracle 8.1.x support, type the following command in the Data Profiler installation directory:
cp djoci8.8.1.5.so djoci8.so
Support for both the Data Direct Technologies ODBC driver manager and the IODBC driver manager are included in the installation.
By default, the ODBC 3.x connector named djodbc3.so is built for the Data Direct Technologies ODBC driver manager. The IODBC version of the ODBC 3.x connector can be used in situations where the default ODBC 3.x connector does not work with ODBC drivers from other vendors. The IODBC enabled shared library is named djodbc3.iodbc.so; the Data Direct Technologies shared library is named djodbc3.datad.so.
cp djodbc3.iodbc.so djodbc3.so
cp djodbc3.datad.so djodbc3.so
If you encounter problems not covered in this document, see the file contact.htm with the product documentation under /opt/PervasiveSoftware/DataProfiler<version>/help/pdf.
PERVASIVE SOFTWARE INC. LICENSES THE SOFTWARE AND DOCUMENTATION PRODUCT TO YOU OR YOUR COMPANY SOLELY ON AN "AS IS" BASIS AND SOLELY IN ACCORDANCE WITH THE TERMS AND CONDITIONS OF THE ACCOMPANYING LICENSE AGREEMENT.
PERVASIVE SOFTWARE INC. MAKES NO OTHER WARRANTIES WHATSOEVER, EITHER EXPRESS OR IMPLIED, REGARDING THE SOFTWARE OR THE CONTENT OF THE DOCUMENTATION; PERVASIVE SOFTWARE INC. HEREBY EXPRESSLY STATES AND YOU OR YOUR COMPANY ACKNOWLEDGES THAT PERVASIVE SOFTWARE INC. DOES NOT MAKE ANY WARRANTIES, INCLUDING, FOR EXAMPLE, WITH RESPECT TO MERCHANTABILITY, TITLE, OR FITNESS FOR ANY PARTICULAR PURPOSE OR ARISING FROM COURSE OF DEALING OR USAGE OF TRADE, AMONG OTHERS.
© Copyright 2008 Pervasive Software Inc.
All Rights Reserved.
|
Pervasive Software Inc. http://www.pervasive.com 12365 Riata Trace Pkwy, Bldg B Austin, TX 78727 USA Voice: (512) 231-6000 Fax: (512) 231-6010 Online Pervasive Contacts |
*** END OF README ***