20. Logging¶

RDFox supports two types of logging: API logging, which can be used with any RDFox interface, and HTTP request logging, which is specific to the RDFox endpoint. This section describes the purpose of each type of logging and provides guidance on usage.

20.1. API Logging¶

API logging has been designed in order to simplify the reproduction of issues with importation, querying or reasoning, regardless of which RDFox API is being used when an issue is encountered. To support reproducing a problem, API logging can be switched on, after which all calls to the core RDFox API are logged into a shell script. This script can later be run in the RDFox shell, and this will produce exactly the same sequence of internal calls to RDFox. Hence, replaying the script will hopefully reproduce the original problem, which is the first important step towards analyzing the problem and providing support.

Each shell command written to the script is surrounded by comments containing the time and date of the API call, the duration of the call, and the details of any exception arising as a result of the call. This information is intended to help with analyzing the problem.

When API logging is enabled, each RDFox server session creates a subdirectory of the configured logging directory whose name is derived from the date and time that the server was started. The directory will contain a single script file named script.txt. Additionally, each distinct input imported during the session is recorded as a separate file in the directory so that the shell script can repeat the relevant import operations. An exception to this is inputs imported as files: since these are already available from the local file system and may be large, API logging does not create copies.

API logging is configured via the server parameters whose names begin with api-log (see Section 4.3).

20.1.1. Recommended Usage¶

Although API logging has no impact on the function of the API, it may introduce a non-negligible performance penalty depending on the exact pattern of API calls. Accordingly, API logging is disabled by default and should only be enabled while trying to reproduce an issue.

The recommended workflow to reproduce any problem for offline analysis is to restart the RDFox server with API logging enabled, repeat the interactions which provoked the issue, restart the server again with logging disabled and then to archive the directory created by the session where logging was enabled.

When archiving a log directory for offline reproduction, the following items should be gathered:

the log directory created for the session (including all its files),
the server directory if role persistence or role and data store persistence are enabled, and
any files that were imported during the logging session.

Note that, when moving the above items to another system, manual editing of the log script may be necessary to ensure that the paths to any imported files are correct for the receiving system.

Example: The following shell command imports one local Turtle file and one TriG file hosted at the imaginary example.org::

import /home/user1/data.ttl <https://example.org/hosted-data.trig>

When API logging is enabled, this would lead to a log entry similar to the following:

########### 2021-07-12 11:19:05 +0100 ###########
# START importData on ldsc1
import + \
    "/home/user1/data.ttl" \
    "input_recording_000001+.txt"

The local file path has been used directly in the import command in the log entry (on line 4) whereas the data retrieved from example.org has been recorded into input_recording_000001+.txt and is then referenced using just that file name as a relative path (on line 5). When archiving the directory containing the above API log for reproduction later on, a copy of the file at /home/user1/data.ttl should be included and line 4 of the above snippet adjusted as needed to ensure that it refers to the same file as /home/user1/data.ttl did when the logged import operation occurred.

20.2. HTTP Request Logging¶

HTTP request logging provides standardized logging of HTTP requests serviced by the RDFox endpoint. Both the common and extended log formats are supported. In contrast to API logging, request logging has been implemented to minimize the impact on performance as much as possible, and so the use of HTTP request logging in production is recommended.

HTTP request logging is configured via the endpoint parameters request-logger and, when the extended log format logger is active, elf-logger-fields (see Section 19.2).

20.2.1. Errors and Exceptions¶

The endpoint attempts to ensure that, when a logger is configured, a single log entry is created for each attempted request, regardless of any errors that have occurred. For example, if a client connects and then sends arbitrary data not conforming to the HTTP protocol, a log entry is still made with dashes (-) in place of, for example, the request method or URI.

In the case of an error which prevents RDFox accepting a new connection, the extended log format logger supports the proprietary field identifier x-socket-exception which can be used to specify that the textual representation of such errors should be included in the log. This can be useful for diagnosing problems with the configuration of the endpoint such as when the number of concurrent connections to the endpoint exhausts the allowed limit for open file descriptors.