innduct upstream git repository
INNDUCT
Section: Maintenance Commands (8)
Index
Return to Main Contents
NAME
innduct - quickly and reliably stream Usenet articles to remote site
SYNOPSIS
innduct
[options]
site
[fqdn]
DESCRIPTION
innduct
implements NNTP peer-to-peer news transmission including the streaming
extensions, for sending news articles to a remote site. It is
intended as a replacement for
innfeed
or
nntpsend
and
innxmit.
You need to run one instance of innduct for each peer site. innduct
manages its interaction with
innd,
including flushing the feed as appropriate, etc., so that articles are
transmitted quickly, and manages the retransmission of its own
backlog. innduct includes the locking necessary to avoid multiple
simutaneous invocations.
By default, innduct reads the default feedfile corresponding to
the site
site
(ie
pathoutgoing/site)
and feeds it via NNTP, streaming if possible, to the host
fqdn.
If
fqdn
is not specified, it defaults to
site.
innduct daemonises after argument parsing, and all logging (including
error messages) are sent to syslog (facility
news).
The best way to run innduct is probably to periodically invoke it
for each feed (e.g. from cron), passing the
-q
option to arrange that innduct silently exits if an instance is
already running for that site.
GENERAL OPTIONS
- -f|--feedfile=DIR/|PATH
-
Specifies the
feedfile
to read, and indirectly specifies the paths to
be used for various associated files (see FILES, below).
If specified as
DIR/
it is taken as a directory to use, and the actual feed file used is
path/site.
If
PATH
or
DIR
does not start with a
/,
it is taken to be relative to
pathoutgoing
from inn.conf.
The default is
site.
- -q|--quiet-multiple
-
Makes innduct silently exit (with status 0) if another innduct holds
the lock for the site. Without -q, this causes a fatal error to
be logged and a nonzero exit.
- --no-daemon
-
Do not daemonise. innduct runs in the foreground, but otherwise
operates normally (logging to syslog, etc.).
- --interactive
-
Do not daemonise. innduct runs in the foreground and all messages
(including all debug messages) are written to stderr rather than
syslog. A control command line is also available on stdin/stdout.
- --no-streaming
-
Do not try to use the streaming extensions to NNTP (for use eg if the
peer can't cope when we send MODE STREAM).
- --no-filemon
-
Do not try to use the file change monitoring support to watch for
writes by innd to the feed file; poll it instead. (If file monitoring
is not compiled in, this option just downgrades the log message which
warns about this situation.)
- -C|--inndconf=FILE
-
Read
FILE
instead of the default
inn.conf.
- --port=PORT
-
Connect to port
PORT
at the remote site rather than to the NNTP port (119).
- --chdir=PATHRUN
-
Change directory to
pathrun
at startup. The default is
pathrun
from inn.conf.
- --cli=CLI-DIR/|CLI-PATH|none
-
Listen for control command line connections on
CLI-DIR/site
(if the value ends with a
/)
or
CLI-PATH
(if it doesn't). See CONTROLLING INNDUCT, below.
Note that there is a fairly short limit on the lengths of AF_UNIX
socket pathnames. If specified as
CLI-DIR/,
the directory will be created with mode 700 if necessary.
The default is
innduct/
which means to create that directory in
PATHRUN
and listen on
PATHRUN/innduct/site.
- --help
-
Just print a brief usage message and list of the options to stdout.
See TUNING OPTIONS below for more options.
CONTROLLING INNDUCT
If you tell innd to drop the feed, innduct will (when it notices,
which will normally be the next time it decides to flush) finish up the
articles it has in hand now, and then exit. It is harmless to cause
innd to flush the feed (but innduct won't notice and flushing won't
start a new feedfile; you have to leave that to innduct).
If you want to stop innduct you can send it SIGTERM or SIGINT, or the
stop
control command, in which case it will report statistics so far and
quickly exit. If innduct receives SIGKILL nothing will be broken or
corrupted; you just won't see some of the article stats.
innduct listens on an AF_UNIX socket (by default,
pathrun/innduct/site),
and provides a command-line interface which can be used to trigger
various events and for debugging. When a connection arrives, innduct
writes a prompt, reads commands a line at a time, and writes any
output back to the caller. (Everything uses unix line endings.) The
cli can most easily be accessed with a program like
netcat-openbsd
(eg
nc.openbsd -U
/var/run/news/innduct/site)
or
socat.
The prompt is
site|.
The following control commands are supported:
- h
-
Print a list of all the commands understood. This list includes
undocumented commands which mess with innduct's internal state and
should only be used by a developer in conjuction with the innduct
source code.
- flush
-
Start a new feed file and trigger a flush of the feed. (Or, cause
the
FLUSH-FINISH-PERIOD
to expire early, forcibly completing a previously started flush.)
- stop
-
Log statistics and exit. (Same effect as SIGTERM or SIGINT.)
- logstats
-
Log statistics so far and zero the stats counters. Stats are also
logged periodically, when an input file is completed and just before
tidy termination.
- show
-
Writes summary information about innduct's state to the current CLI
connection.
- dump q|a
-
Writes the same information about innduct's state to a plain text file
feedfile_dump.
This overwrites any previous dump. innduct does not ever delete these
dump files.
dump q
gives a summary including general state and a list of connections;
dump a
also includes information about each article innduct is dealing with.
- next blscan
-
Requests that innduct rescan for new backlog files at the next
PERIOD
poll. Normally innduct assumes that any backlog files dropped in by
the administrator are not urgent, and it may not get around to
noticing them for
BACKLOG-SCAN-PERIOD.
- next conn
-
Resets the connection startup delay counter so that innduct may
consider making a new connection to the peer right away, regardless
of the setting of
RECONNECT-PERIOD.
A connection attempt will still only be made if innduct feels that it
needs one, and innduct may wait up to
PERIOD
before actually starting the attempt.
TUNING OPTIONS
You should not normally need to adjust these. Time intervals may
specified in seconds, or as a number followed by one of the following
units:
s m h d,
sec min hour day,
das hs ks Ms.
- --max-connections=max
-
Restricts the maximum number of simultaneous NNTP connections
per peer to
max.
There is no global limit on the number of connections used by all
innducts, as the instances for different sites are entirely
independent.
The default is
10.
- --max-queue-per-conn=per-conn-max
-
Restricts the maximum number of outstanding articles queued on any
particular connection to
max.
(Non-streaming connections can only handle one article at a time.)
The default is
200.
- --max-queue-per-file=max
-
Restricts the maximum number articles read into core from any one
input file to
max.
The default is
twice
per-conn-max.
- --feedfile-flush-size=bytes
-
Specifies that innduct should flush the feed and start a new feedfile
when the existing feedfile size exceeds
bytes;
the effect is that the innduct will try to avoid the various
batchfiles growing much beyond this size. The default is
100000.
- --period-interval=PERIOD-INTERVAL
-
Specifies wakup interval and period granularity.
innduct wakes up every
PERIOD-INTERVAL
to do various housekeeping checks. Also, many of the timeout and
rescan intervals (those specified in this manual as
PERIOD)
are rounded up to the next multiple of
PERIOD-INTERVAL.
The default is
30s.
- --connection-timeout=TIME
-
How long to allow for a connection setup attempt before giving up.
The default is
200s.
- --stuck-flush-timeout=TIME
-
How long to wait for innd to respond to a flush request before giving
up. The default is
100s.
- --feedfile-poll=TIME
-
How often to poll the feedfile for new articles written by innd
if file monitoring
(inotify
or equivalent) is not available. (When file monitoring is available,
there is no need for periodic checks and we wake immediately up
whenever the feedfile changes.)
The default is
5s.
- --no-check-proportion=PERCENT
-
If the moving average of the proportion of articles being accepted
(rather than declined) by the peer exceeds this value, innduct uses
"no check mode" - ie it just sends the peer the articles with TAKETHIS
rather than checking first with CHECK whether the article is wanted.
This only affects streaming connections. The default is
95
(ie, 95%).
- --no-check-response-time=ARTICLES
-
The moving average mentioned above is an alpha-smoothed value with a
half-life of
ARTICLES.
The default is
100.
- --reconnect-interval=RECONNECT-PERIOD
-
Limits initiation of new connections to one each
RECONNECT-PERIOD.
This applies to reconnections if the peer has been down, and also to
ramping up the number of connections we are using after startup or in
response to an article flood. The default is
1000s.
- --flush-retry-interval=PERIOD
-
If our attempt to flush the feed failed (usually this will be because
innd is not running), try again after
PERIOD.
The default is
1000s.
- --earliest-deferred-retry=PERIOD
-
When the peer responds to our offer of an article with a 431 or 436
NNTP response code, indicating that the article has already been
offered to it by another of its peers, and that we should try again,
we wait at least
PERIOD.
before offering the article again. The default is
100s.
- --backlog-rescan-interval=BACKLOG-SCAN-PERIOD
-
We scan the directory containing
feedfile
for backlog files at least every
BACKLOG-SCAN-PERIOD,
in case the administrator has manually dropped in a file there for
processing.
The default is
300s.
- --max-flush-interval=PERIOD
-
We flush the feed and start a new feedfile at least every
PERIOD
even if the current instance of the feedfile has not reached the size
threshold.
The default is
100000s.
- --flush-finish-timeout=FLUSH-FINISH-PERIOD
-
If we flushed
FLUSH-FINISH-PERIOD
ago, and are still trying to finish processing articles that were
written to the old feed file, we forcibly and violently make sure that
we can finish the old feed file: we abandon and defer all the work,
which includes unceremoniously dropping any connections on which
we've sent some of those articles but not yet had replies, as they're
probably stuck somehow. The default is
2000s.
- --idle-timeout=PERIOD
-
Connections which have had no activity for
PERIOD
will be closed. This includes connections where we have sent commands
or articles but have not yet had the responses, so this same value
doubles as the timeout after which we conclude that the peer is
unresponsive or the connection has become broken.
The default is
1000s.
- --stats-log-interval=PERIOD
-
Log statistics at least every
PERIOD
The default is
2500s.
- --low-volume-thresh=WIN-THRESH --low-volume-window=PERIOD
-
If innduct has only one connection to the peer, and has processed
fewer than
WIN-THRESH
articles in the last
PERIOD
and also no articles in the last
PERIOD-INTERVAL
it will close the connection quickly. That is, innduct switches to a
mode where it opens a connection for each article (or, perhaps, each
handful of articles arriving together).
The default is to close if fewer than
3
articles in the last
1000s.
- --max-bad-input-data-ratio=PERCENT
-
We tolerate up to this proportion of badly-formatted lines in the
feedfile and other input files. Every badly-formatted line is logged,
but if there are too many we conclude that the corruption to our
on-disk data is too severe, and crash; to successfully restart,
administrator intervention will be required. This avoids flooding the
logs with warnings and also arranges to abort earlyish if an attempt
is made to process a file in the wrong format. We need to tolerate a
small proportion of broken lines, if for no other reason than that a
crash might leave a half-blanked-out entry. The default is
1
(ie, 1%).
- --max-bad-input-data-init=LINES
-
Additionally, we tolerate this number of additional badly-formatted
lines, so that if the badly-formatted lines are a few but at the start
of the file, we don't crash immediately.
The default is
30
(which would suffice to ignore one whole corrupt 4096-byte disk block
filled with random data, or one corrupt 1024-byte disk block filled
with an inappropriate text file with a mean line length of at least
35).
INNDUCT VS INNFEED/NNTPSEND/INNXMIT
- innfeed
-
does roughly the same thing as innduct. However, the way it receives
information from innd can result in articles being lost (not offered
to peers) if innfeed crashes for any reason. This is an inherent
defect in the innd channel feed protocol. innduct uses a file feed,
constantly "tailing" the feed file, and where implemented uses
inotify(2)
to reduce the latency which would come from having to constantly poll
the feed file. innduct is much smaller and simpler, at <5kloc to
innfeed's ~25kloc. innfeed needs a separate helper script or similar
infrastructure (of which there is an example in its manpage), whereas
innduct can be run directly and doesn't need help from shell scripts.
However, innfeed is capable of feeding multiple peers from a single
innfeed instance, whereas each innduct process handles exactly one
peer.
- nntpsend
-
processes feed files in batch mode. That is, you have to periodically
invoke nntpsend, and when you do, the feed is flushed and articles
which arrived before the flush are sent to the peer. This introduces
a batching delay, and also means that the NNTP connection to the peer
needs to be remade at each batch. nntpsend (which uses innxmit)
cannot make use of multiple connections to a single peer site.
However, nntpsend automatically find which sites need feeding by
looking in
pathoutgoing,
whereas the administrator needs to arrange to invoke innduct
separately for each peer.
- innxmit
-
is the actual NNTP feeder program used by nntpsend.
| innfeed | innduct | nntpsend/innxmit
|
realtime feed | Yes | Yes | No
|
reliable | No | Yes | Yes
|
source code size | 24kloc | 4.6kloc | 1.9kloc
|
invoke once for all sites | Yes | No | Yes
|
number of processes | one | 1/site | 2/site, intermittently
|
EXIT STATUS
- 0
-
An instance of innduct is already running for this
feedfile
and
-q
was specified.
- 4
-
The feed has been dropped by innd, and we (or previous innducts) have
successfully offered all the old articles to the peer site. Our work
is done.
- 8
-
innduct was invoked with bad options or command line arguments. The
error message will be printed to stderr, and also (if any options or
arguments were passed at all) to syslog with severity
crit.
- 12
-
Things are going wrong, hopefully shortage of memory, system file
table entries; disk IO problems; disk full; etc. The specifics of the
error will be logged to syslog with severity
err
(if syslog is working!)
- 16
-
Things are going badly wrong in an unexpected way: system calls which
are not expected to fail are doing so, or the protocol for
communicating with innd is being violated, or some such. Details will
be logged with severity
crit
(if syslog is working!)
- 24-27
-
These exit statuses are used by children forked by innduct to
communicate to the parent. You should not see them. If you do, it is
a bug.
FILES
innduct dances a somewhat complicated dance with innd to make sure
that everything goes smoothly and that there are no races. (See the
two ascii-art diagrams in README.states for details of the protocol.) Do
not mess with the feedfile and other associated files, other than as
explained here:
- pathrun
-
Default current working directory for innduct, and also default
grandparent directory for the command line socket.
- pathoutgoing/site
-
Default
feedfile.
- feedfile
-
Main feed file as specified in
newsfeeds(5).
This and other batchfiles used by innduct contains lines each of which
is of the form
token messageid
where token is the inn storage API token. Such lines can be
written by Tf,Wnm in a newsfeeds(5) entry. During
processing, innduct overwrites lines in the batch files which
correspond to articles it has processed: each such line is replaced
with one containing only spaces. Only innd should create
feedfile,
and only innduct should remove it.
- feedfile_lock
-
Lockfile, preventing multiple innduct invocations for the same
feed. A process holds this lock after it has opened the lockfile,
made an fcntl F_SETLK call, and then checked with stat and fstat that
the file it now has open and has locked still has the name
feedfile_lock. (Only) the lockholder may delete the lockfile.
For your convenience, after the lockfile is locked,
innfeed's
pid, the
site,
feedfile
and
fqdn
are all written to the lockfile. NB that stale lockfiles may contain
stale data so this information should not be relied on other than for
troubleshooting.
- feedfile_flushing
-
Batch file: the main feedfile is renamed to this filename by innduct
before it asks inn to flush the feed. Only innduct should create,
modify or remove this file.
- feedfile_defer
-
Batch file containing details of articles whose transmission has very
recently been deferred at the request of the recipient site. Created,
written, read and removed (only) by innduct.
- feedfile_backlog.time_t.inum
-
Batch file containing details of articles whose transmission has less
recently been deferred at the request of the recipient site. Created
by innduct, and will also be read, updated and removed by innduct.
However you (the administrator) may also safely remove backlog files.
- feedfile_backlogsomething
-
Batch file manually provided by the administrator. innduct will
automatically find, read and process any file matching this pattern
(blanking out entries for processed articles) and eventually remove
it. something may not contain # ~ or /.
-
Be sure to have finished writing the file before you rename it to
match the pattern feedfile_backlog*, as otherwise innduct
may find and process the file, and even think it has finished it,
before you have written the complete file. You may also safely remove
backlog files.
- pathrun/innduct/site
-
Default AF_UNIX listening socket for the control command line. See
CONTROLLING INNDUCT, above.
- feedfile_dump
-
On request via a control connection innduct dumps a summary of its
state to this text file. This is mostly useful for debugging.
- /etc/news/inn.conf
-
Used for
pathoutgoing
(to compute default
feedfile
and associated paths),
pathrun
(to compute default
PATHRUN
and hence effective default
CLI-DIR
and
CLI-PATH),
for finding how to communicate with innd, and also for
sourceaddress
and/or
sourceaddress6.
HISTORY
Written by Ian Jackson <ijackson@chiark.greenend.org.uk>
SEE ALSO
inn.conf(5),
innd(8),
newsfeeds(5)
Index
- NAME
-
- SYNOPSIS
-
- DESCRIPTION
-
- GENERAL OPTIONS
-
- CONTROLLING INNDUCT
-
- TUNING OPTIONS
-
- INNDUCT VS INNFEED/NNTPSEND/INNXMIT
-
- EXIT STATUS
-
- FILES
-
- HISTORY
-
- SEE ALSO
-
This document was created by
man2html,
using the manual pages.
Time: 14:47:09 GMT, April 30, 2025