CyberLeo.Net >> Repos - FreeBSD/FreeBSD.git/log

]> CyberLeo.Net >> Repos - FreeBSD/FreeBSD.git/log

projects / FreeBSD / FreeBSD.git / log

summary | shortlog | log | commit | commitdiff | tree
first ⋅ prev ⋅ next

commit | commitdiff | tree

Mateusz Guzik [Sat, 25 Jul 2020 10:37:15 +0000 (10:37 +0000)]

vfs: lockless lookup

Provides full scalability as long as all visited filesystems support the
lookup and terminal vnodes are different.

Inner workings are explained in the comment above cache_fplookup.

Capabilities and fd-relative lookups are not supported and will result in
immediate fallback to regular code.

Symlinks, ".." in the path, mount points without support for lockless lookup
and mismatched counters will result in an attempt to get a reference to the
directory vnode and continue in regular lookup. If this fails, the entire
operation is aborted and regular lookup starts from scratch. However, care is
taken that data is not copied again from userspace.

Sample benchmark:
incremental -j 104 bzImage on tmpfs:
before: 142.96s user 1025.63s system 4924% cpu 23.731 total
after: 147.36s user 313.40s system 3216% cpu 14.326 total

Sample microbenchmark: access calls to separate files in /tmpfs, 104 workers, ops/s:
before:   2165816
after:  151216530

Reviewed by:    kib
Tested by:      pho (in a patchset)
Differential Revision: https://reviews.freebsd.org/D25578

commit | commitdiff | tree

Mateusz Guzik [Sat, 25 Jul 2020 10:32:45 +0000 (10:32 +0000)]

vfs: add the infrastructure for lockless lookup

Reviewed by: kib
Tested by: pho (in a patchset)
Differential Revision: https://reviews.freebsd.org/D25577

commit | commitdiff | tree

Mateusz Guzik [Sat, 25 Jul 2020 10:31:52 +0000 (10:31 +0000)]

vfs: introduce vnode sequence counters

Modified on each permission change and link/unlink.

Reviewed by: kib
Tested by: pho (in a patchset)
Differential Revision: https://reviews.freebsd.org/D25573

commit | commitdiff | tree

Mateusz Guzik [Sat, 25 Jul 2020 10:29:48 +0000 (10:29 +0000)]

seqc: add a sleepable variant and convert some routines to macros

This temporarily duplicates some code.

Macro conversion convinces clang to carry predicts into consumers.

commit | commitdiff | tree

Ruslan Bukin [Sat, 25 Jul 2020 09:28:38 +0000 (09:28 +0000)]

Split-out the Intel GAS (Guest Address Space) management component
from Intel DMAR support, so it can be used on other IOMMU systems.

Reviewed by: kib
Sponsored by: DARPA/AFRL
Differential Revision: https://reviews.freebsd.org/D25743

commit | commitdiff | tree

Mateusz Guzik [Sat, 25 Jul 2020 07:48:20 +0000 (07:48 +0000)]

Remove duplicated content from _eventhandler.h

commit | commitdiff | tree

Mateusz Guzik [Sat, 25 Jul 2020 07:45:44 +0000 (07:45 +0000)]

Remove leftover macros for long gone vmsize mtx

commit | commitdiff | tree

Mateusz Guzik [Sat, 25 Jul 2020 07:15:23 +0000 (07:15 +0000)]

Guard sbcompress_ktls_rx with KERN_TLS

Fixes a compilation warning after r363464

commit | commitdiff | tree

Mateusz Guzik [Sat, 25 Jul 2020 07:14:33 +0000 (07:14 +0000)]

Do a lockless check in kthread_suspend_check

Otherwise an idle system running lockstat sleep 10 reports contention on
process lock comming from bufdaemon.

While here fix a style nit.

commit | commitdiff | tree

Michal Meloun [Sat, 25 Jul 2020 06:32:23 +0000 (06:32 +0000)]

Revert r363123.
As Emanuel poited me the Linux processes these clock assignments in forward
order, not in reversed. I misread the original code.
Tha problem with wrong order for assigned clocks found in tegra (and some imx)
DT should be reanalyzed and solved by different way.

MFC with: r363123
Reported by; manu

commit | commitdiff | tree

Rick Macklem [Fri, 24 Jul 2020 23:17:09 +0000 (23:17 +0000)]

Add support for ext_pgs mbufs to nfsm_uiombuflist() and nfsm_split().

This patch uses a slightly different algorithm for nfsm_uiombuflist() for
the non-ext_pgs case, where a variable called "mcp" is maintained, pointing to
the current location that mbuf data can be filled into. This avoids use of
mtod(mp, char *) + mp->m_len to calculate the location, since this does
not work for ext_pgs mbufs and I think it makes the algorithm more readable.
This change should not result in semantic changes for the non-ext_pgs case.
The patch also deletes come unneeded code.

It also adds support for anonymous page ext_pgs mbufs to nfsm_split().

This is another in the series of commits that add support to the NFS client
and server for building RPC messages in ext_pgs mbufs with anonymous pages.
This is useful so that the entire mbuf list does not need to be
copied before calling sosend() when NFS over TLS is enabled.
At this time for this case, use of ext_pgs mbufs cannot be enabled, since
ktls_encrypt() replaces the unencrypted data with encrypted data in place.

Until such time as this can be enabled, there should be no semantic change.
Also, note that this code is only used by the NFS client for a mirrored pNFS
server.

commit | commitdiff | tree

Navdeep Parhar [Fri, 24 Jul 2020 23:15:42 +0000 (23:15 +0000)]

cxgbe(4): Some updates to the common code.

Obtained from: Chelsio Communications
MFC after: 1 week
Sponsored by: Chelsio Communications

commit | commitdiff | tree

Ilya Bakulin [Fri, 24 Jul 2020 21:14:59 +0000 (21:14 +0000)]

Make it possible to get/set MMC frequency from camcontrol

Enhance camcontrol(8) so that it's possible to manually set frequency for SD/MMC cards.
While here, display more information about the current controller, such as
supported operating modes and VCCQ voltages, as well as current VCCQ voltage.

Reviewed by: manu
Approved by: imp (mentor)
Differential Revision: https://reviews.freebsd.org/D25795

commit | commitdiff | tree

Alexander Motin [Fri, 24 Jul 2020 20:52:09 +0000 (20:52 +0000)]

Introduce ipi_self_from_nmi().

It allows safe IPI sending to current CPU from NMI context.

Unlike other ipi_*() functions this waits for delivery to leave LAPIC in
a state safe for interrupted code.

MFC after: 2 weeks
Sponsored by: iXsystems, Inc.

commit | commitdiff | tree

Alexander Motin [Fri, 24 Jul 2020 20:44:50 +0000 (20:44 +0000)]

Use APIC_IPI_DEST_OTHERS for bitmapped IPIs too.

It should save bunch of LAPIC register accesses.

MFC after: 2 weeks

commit | commitdiff | tree

Alexander Motin [Fri, 24 Jul 2020 19:54:15 +0000 (19:54 +0000)]

Make lapic_ipi_vectored(APIC_IPI_DEST_SELF) NMI safe.

Sending IPI to self or all CPUs does not require write into upper part of
the ICR, prone to races.  Previously the code disabled interrupts, but it
was not enough for NMIs.  Instead of that when possible write only lower
part of the register, or use special SELF IPI register in x2APIC mode.

This also removes ICR reads used to preserve reserved bits on write.
It was there from the beginning, but I failed to find explanation why,
neither I see Linux doing it.  Specification even tells that ICR content
may be lost in deep C-states, so if hardware does not bother to preserve
it, why should we?

MFC after: 2 weeks
Sponsored by: iXsystems, Inc.

commit | commitdiff | tree

Emmanuel Vadot [Fri, 24 Jul 2020 19:52:52 +0000 (19:52 +0000)]

dwmmc: Add MMCCAM part

Add support for MMCCAM for dwmmc

Submitted by: kibab
Tested On: Rock64, RockPro64

commit | commitdiff | tree

Emmanuel Vadot [Fri, 24 Jul 2020 18:44:50 +0000 (18:44 +0000)]

mmccam: aw_mmc: Only print the new ios value under bootverbose

commit | commitdiff | tree

Emmanuel Vadot [Fri, 24 Jul 2020 18:43:46 +0000 (18:43 +0000)]

mmccam: Make non bootverbose more readable

Remove some debug printfs.
Convert some to CAM_DEBUG
Only print some when bootverbose is set.

commit | commitdiff | tree

Conrad Meyer [Fri, 24 Jul 2020 17:34:44 +0000 (17:34 +0000)]

Use gbincore_unlocked for unprotected incore()

Reviewed by: markj
Sponsored by: Isilon
Differential Revision: https://reviews.freebsd.org/D25790

commit | commitdiff | tree

Conrad Meyer [Fri, 24 Jul 2020 17:34:04 +0000 (17:34 +0000)]

Add unlocked/SMR fast path to getblk()

Convert the bufobj tries to an SMR zone/PCTRIE and add a gbincore_unlocked()
API wrapping this functionality. Use it for a fast path in getblkx(),
falling back to locked lookup if we raced a thread changing the buf's
identity.

Reported by: Attilio
Reviewed by: kib, markj
Testing: pho (in progress)
Sponsored by: Isilon
Differential Revision: https://reviews.freebsd.org/D25782

commit | commitdiff | tree

Conrad Meyer [Fri, 24 Jul 2020 17:32:10 +0000 (17:32 +0000)]

Use SMR to provide safe unlocked lookup for pctries from SMR zones

Adapt r358130, for the almost identical vm_radix, to the pctrie subsystem.
Like that change, the tree is kept correct for readers with store barriers
and careful ordering. Existing locks serialize writers.

Add a PCTRIE_DEFINE_SMR() wrapper that takes an additional smr_t parameter
and instantiates a FOO_PCTRIE_LOOKUP_UNLOCKED() function, in addition to the
usual definitions created by PCTRIE_DEFINE().

Interface consumers will be introduced in later commits.

As future work, it might be nice to add vm_radix algorithms missing from
generic pctrie to the pctrie interface, and then adapt vm_radix to use
pctrie.

Reported by: Attilio
Reviewed by: markj
Sponsored by: Isilon
Differential Revision: https://reviews.freebsd.org/D25781

commit | commitdiff | tree

Mateusz Guzik [Fri, 24 Jul 2020 17:28:24 +0000 (17:28 +0000)]

lockmgr: add missing 'continue' to account for spuriously failed fcmpset

PR: 248245
Reported by: gbe
Noted by: markj
Fixes by: r363415 ("lockmgr: add adaptive spinning")

commit | commitdiff | tree

Emmanuel Vadot [Fri, 24 Jul 2020 17:11:14 +0000 (17:11 +0000)]

mmccam: Add some aliases for non-mmccam to mmccam transition

A new tunable is present, kern.cam.sdda.mmcsd_compat to enable
this feature or not (default is enabled)

commit | commitdiff | tree

Juli Mallett [Fri, 24 Jul 2020 16:58:13 +0000 (16:58 +0000)]

Remove reference to nlist(3) missed in SCCS revision 5.26 by mckusick
when converting rwhod(8) to using kern.boottime ather than extracting
the boot time from kernel memory directly.

Reviewed by: imp

commit | commitdiff | tree

Mateusz Piotrowski [Fri, 24 Jul 2020 15:04:34 +0000 (15:04 +0000)]

Fix grammar issues and typos

Reported by: ian
MFC after: 1 week

commit | commitdiff | tree

Mateusz Piotrowski [Fri, 24 Jul 2020 14:17:37 +0000 (14:17 +0000)]

Document that force_depend() supports only /etc/rc.d scripts

Currently, force_depend() from rc.subr(8) does not support depending on
scripts outside of /etc/rc.d (like /usr/local/etc/rc.d). The /etc/rc.d path
is hard-coded into force_depend().

MFC after: 1 week

commit | commitdiff | tree

Mateusz Guzik [Fri, 24 Jul 2020 13:23:32 +0000 (13:23 +0000)]

vm: fix swap reservation leak and clean up surrounding code

The code did not subtract from the global counter if per-uid reservation
failed.

Cleanup highlights:
- load overcommit once
- move per-uid manipulation to dedicated routines
- don't fetch wire count if requested size is below the limit
- convert return type from int to bool
- ifdef the routines with _KERNEL to keep vm.h compilable by userspace

Reviewed by: kib (previous version)
Differential Revision: https://reviews.freebsd.org/D25787

commit | commitdiff | tree

Alex Richardson [Fri, 24 Jul 2020 08:40:04 +0000 (08:40 +0000)]

Include TMPFS in all the GENERIC kernel configs

Being able to use tmpfs without kernel modules is very useful when building
small MFS_ROOT kernels without a real file system.
Including TMPFS also matches arm/GENERIC and the MIPS std.MALTA configs.

Compiling TMPFS only adds 4 .c files so this should not make much of a
difference to NO_MODULES build times (as we do for our minimal RISC-V
images).

Reviewed By: br (earlier version for riscv), brooks, emaste
Differential Revision: https://reviews.freebsd.org/D25317

commit | commitdiff | tree

John-Mark Gurney [Fri, 24 Jul 2020 00:47:14 +0000 (00:47 +0000)]

fix up docs for m_getjcl as well..

commit | commitdiff | tree

John-Mark Gurney [Fri, 24 Jul 2020 00:35:21 +0000 (00:35 +0000)]

document that m_get2 only accepts up to MJUMPAGESIZE..

commit | commitdiff | tree

John Baldwin [Thu, 23 Jul 2020 23:48:18 +0000 (23:48 +0000)]

Add support for KTLS RX via software decryption.

Allow TLS records to be decrypted in the kernel after being received
by a NIC.  At a high level this is somewhat similar to software KTLS
for the transmit path except in reverse.  Protocols enqueue mbufs
containing encrypted TLS records (or portions of records) into the
tail of a socket buffer and the KTLS layer decrypts those records
before returning them to userland applications.  However, there is an
important difference:

- In the transmit case, the socket buffer is always a single "record"
  holding a chain of mbufs.  Not-yet-encrypted mbufs are marked not
  ready (M_NOTREADY) and released to protocols for transmit by marking
  mbufs ready once their data is encrypted.

- In the receive case, incoming (encrypted) data appended to the
  socket buffer is still a single stream of data from the protocol,
  but decrypted TLS records are stored as separate records in the
  socket buffer and read individually via recvmsg().

Initially I tried to make this work by marking incoming mbufs as
M_NOTREADY, but there didn't seemed to be a non-gross way to deal with
picking a portion of the mbuf chain and turning it into a new record
in the socket buffer after decrypting the TLS record it contained
(along with prepending a control message).  Also, such mbufs would
also need to be "pinned" in some way while they are being decrypted
such that a concurrent sbcut() wouldn't free them out from under the
thread performing decryption.

As such, I settled on the following solution:

- Socket buffers now contain an additional chain of mbufs (sb_mtls,
  sb_mtlstail, and sb_tlscc) containing encrypted mbufs appended by
  the protocol layer.  These mbufs are still marked M_NOTREADY, but
  soreceive*() generally don't know about them (except that they will
  block waiting for data to be decrypted for a blocking read).

- Each time a new mbuf is appended to this TLS mbuf chain, the socket
  buffer peeks at the TLS record header at the head of the chain to
  determine the encrypted record's length.  If enough data is queued
  for the TLS record, the socket is placed on a per-CPU TLS workqueue
  (reusing the existing KTLS workqueues and worker threads).

- The worker thread loops over the TLS mbuf chain decrypting records
  until it runs out of data.  Each record is detached from the TLS
  mbuf chain while it is being decrypted to keep the mbufs "pinned".
  However, a new sb_dtlscc field tracks the character count of the
  detached record and sbcut()/sbdrop() is updated to account for the
  detached record.  After the record is decrypted, the worker thread
  first checks to see if sbcut() dropped the record.  If so, it is
  freed (can happen when a socket is closed with pending data).
  Otherwise, the header and trailer are stripped from the original
  mbufs, a control message is created holding the decrypted TLS
  header, and the decrypted TLS record is appended to the "normal"
  socket buffer chain.

(Side note: the SBCHECK() infrastucture was very useful as I was
able to add assertions there about the TLS chain that caught several
bugs during development.)

Tested by: rmacklem (various versions)
Relnotes: yes
Sponsored by: Chelsio Communications
Differential Revision: https://reviews.freebsd.org/D24628

commit | commitdiff | tree

Bryan Drewery [Thu, 23 Jul 2020 23:29:50 +0000 (23:29 +0000)]

Limit gmirror failpoint tests to the test worker

This avoids injecting errors into the test system's mirrors.

gnop seems like a good solution here but it injects errors at the wrong
place vs where these tests expect and does not support a 'max global count'
like the failpoints do with 'n*' syntax.

Reviewed by: cem, vangyzen
Sponsored by: Dell EMC Isilon

commit | commitdiff | tree

John-Mark Gurney [Thu, 23 Jul 2020 22:28:35 +0000 (22:28 +0000)]

update example to make it active when creating a new boot method...

Clean up some of the sentences and grammar...

make igor happy..

commit | commitdiff | tree

John Baldwin [Thu, 23 Jul 2020 21:43:06 +0000 (21:43 +0000)]

Consolidate duplicated code into a ktls_ocf_dispatch function.

This function manages the loop around crypto_dispatch and coordination
with ktls_ocf_callback.

Sponsored by: Netflix
Differential Revision: https://reviews.freebsd.org/D25757

commit | commitdiff | tree

John Baldwin [Thu, 23 Jul 2020 21:40:03 +0000 (21:40 +0000)]

Set si_trapno to the exception code from esr.

Reviewed by: kib
Sponsored by: DARPA
Differential Revision: https://reviews.freebsd.org/D25771

commit | commitdiff | tree

John Baldwin [Thu, 23 Jul 2020 21:33:10 +0000 (21:33 +0000)]

Pass the right size to memcpy() when copying the array of FP registers.

The size of the containing structure was passed instead of the size of
the array. This happened to be harmless as the extra word copied is
one we copy in the next line anyway.

Reported by: CHERI (bounds check violation)
Reviewed by: brooks, imp
Obtained from: CheriBSD
MFC after: 1 week
Sponsored by: DARPA
Differential Revision: https://reviews.freebsd.org/D25791

commit | commitdiff | tree

John Baldwin [Thu, 23 Jul 2020 20:08:42 +0000 (20:08 +0000)]

Set si_addr to badvaddr for TLB faults.

Reviewed by: kib
Sponsored by: DARPA
Differential Revision: https://reviews.freebsd.org/D25775

commit | commitdiff | tree

Ed Maste [Thu, 23 Jul 2020 20:06:24 +0000 (20:06 +0000)]

md5: return non-zero if built-in tests (-x) fail

MFC after: 1 week
Sponsored by: The FreeBSD Foundation

commit | commitdiff | tree

Michael Tuexen [Thu, 23 Jul 2020 19:43:49 +0000 (19:43 +0000)]

Clear the pointer to the socket when closing it also in case of
an ungraceful operation.
This fixes a use-after-free bug found and reported by Taylor
Brandstetter of Google by testing the userland stack.

MFC after: 1 week

commit | commitdiff | tree

Ed Maste [Thu, 23 Jul 2020 19:19:33 +0000 (19:19 +0000)]

modules/crypto: disable optimized assembly skein1024 implementation

It is presumably broken in the same way as userland skein1024 (see r363454)

PR: 248221

commit | commitdiff | tree

Ed Maste [Thu, 23 Jul 2020 18:55:47 +0000 (18:55 +0000)]

libmd: temporarily disable optimized assembly skein1024 implementation

It is apparently broken when assembled by contemporary GNU as as well as
Clang IAS (which is used in the default configuration).

PR: 248221
Reported by: pizzamig
Sponsored by: The FreeBSD Foundation

commit | commitdiff | tree

Cy Schubert [Thu, 23 Jul 2020 17:39:49 +0000 (17:39 +0000)]

Document the IPFILTER_PREDEFINED environment variable.

PR: 248088
Reported by: joeb1@a1poweruser.com
MFC after: 1 week

commit | commitdiff | tree

Cy Schubert [Thu, 23 Jul 2020 17:39:45 +0000 (17:39 +0000)]

Load ipfilter, ipnat, and ippool rules, and start ipmon in a vnet jail.

PR: 248109
Reported by: joeb1@a1poweruser.com
MFC after: 2 weeks

commit | commitdiff | tree

Mateusz Guzik [Thu, 23 Jul 2020 17:26:53 +0000 (17:26 +0000)]

locks: fix a long standing bug for primitives with kdtrace but without spinning

In such a case the second argument to lock_delay_arg_init was NULL which was
immediately causing a null pointer deref.

Since the sructure is only used for spin count, provide a dedicate routine
initializing it.

Reported by: andrew

commit | commitdiff | tree

Doug Moore [Thu, 23 Jul 2020 17:16:20 +0000 (17:16 +0000)]

Rank balanced (RB) trees are a class of balanced trees that includes
AVL trees, red-black trees, and others. Weak AVL (wavl) trees are a
recently discovered member of that class. This change replaces
red-black rebalancing with weak AVL rebalancing in the RB tree macros.

Wavl trees sit between AVL and red-black trees in terms of how
strictly balance is enforced. They have the stricter balance of AVL
trees as the tree is built - a wavl tree is an AVL tree until the
first deletion. Once removals start, wavl trees are lazier about
rebalancing than AVL trees, so that removals can be fast, but the
balance of the tree can decay to that of a red-black tree. Subsequent
insertions can push balance back toward the stricter AVL conditions.

Removing a node from a wavl tree never requires more than two
rotations, which is better than either red-black or AVL
trees. Inserting a node into a wavl tree never requires more than two
rotations, which matches red-black and AVL trees. The only
disadvantage of wavl trees to red-black trees is that more insertions
are likely to adjust the tree a bit. That's the cost of keeping the
tree more balanced.

Testing has shown that for the cases where red-black trees do worst,
wavl trees better balance leads to faster lookups, so that if lookups
outnumber insertions by a nontrivial amount, lookup time saved exceeds
the extra cost of balancing.

Reviewed by: alc, gbe, markj
Tested by: pho
Discussed with: emaste
Differential Revision: https://reviews.freebsd.org/D25480

commit | commitdiff | tree

Mark Johnston [Thu, 23 Jul 2020 15:03:28 +0000 (15:03 +0000)]

rc.firewall: Merge two identical conditions into one.

No functional change intended.

PR: 247949
Submitted by: Jose Luis Duran <jlduran@gmail.com>
MFC after: 1 week

commit | commitdiff | tree

Alexander Motin [Thu, 23 Jul 2020 14:33:25 +0000 (14:33 +0000)]

Add missing newlines.

MFC after: 3 days

commit | commitdiff | tree

Mark Johnston [Thu, 23 Jul 2020 14:21:45 +0000 (14:21 +0000)]

MFOpenZFS: Fix zpool history unbounded memory usage

In original implementation, zpool history will read the whole history
before printing anything, causing memory usage goes unbounded. We fix
this by breaking it into read-print iterations.

Reviewed-by: Tom Caputi <tcaputi@datto.com>
Reviewed-by: Matt Ahrens <matt@delphix.com>
Reviewed-by: Igor Kozhukhov <igor@dilos.org>
Reviewed-by: Brian Behlendorf <behlendorf1@llnl.gov>
Signed-off-by: Chunwei Chen <david.chen@nutanix.com>
Closes #9516

Note, this change changes the libzfs.so ABI by modifying the prototype
of zpool_get_history(). Since libzfs is effectively private to the base
system it is anticipated that this will not be a problem.

PR: 247557
Obtained from: OpenZFS
Reported and tested by: Sam Vaughan <samjvaughan@gmail.com>
Discussed with: freqlabs
MFC after: 2 weeks
Differential Revision: https://reviews.freebsd.org/D25745
openzfs/zfs@7125a109dcc55628336ff3f58e59e503f4d7694d

commit | commitdiff | tree

Mark Johnston [Thu, 23 Jul 2020 14:03:37 +0000 (14:03 +0000)]

cuse: Stop checking for failures from malloc(M_WAITOK).

PR: 240545
Submitted by: Andrew Reiter <arr@watson.org>
Reviewed by: hselasky
MFC after: 1 week
Differential Revision: https://reviews.freebsd.org/D25765

commit | commitdiff | tree

Mark Johnston [Thu, 23 Jul 2020 14:03:24 +0000 (14:03 +0000)]

ntb: Stop checking for failures from malloc(M_WAITOK).

PR: 240545
Submitted by: Andrew Reiter <arr@watson.org>
Reviewed by: cem, mav
MFC after: 1 week
Differential Revision: https://reviews.freebsd.org/D25768

commit | commitdiff | tree

Mateusz Guzik [Thu, 23 Jul 2020 08:42:16 +0000 (08:42 +0000)]

vm: annotate swap_reserved with __exclusive_cache_line

The counter keeps being updated all the time and variables read afterwards
share the cacheline. Note this still fundamentally does not scale and needs
to be replaced, in the meantime gets a bandaid.

brk1_processes -t 52 ops/s:
before: 8598298
after: 9098080

commit | commitdiff | tree

Michael Tuexen [Thu, 23 Jul 2020 01:35:24 +0000 (01:35 +0000)]

Detect and handle an invalid reassembly constellation, which results in
a memory leak.

Thanks to Felix Weinrank for finding this issue using fuzz testing the
userland stack.

MFC after: 1 week

commit | commitdiff | tree

Brooks Davis [Wed, 22 Jul 2020 23:39:58 +0000 (23:39 +0000)]

Correct a type-mismatch between xdr_long and the variable "bad".

Way back in r28911 (August 1997, CVS rev 1.22) we imported a NetBSD
information leak fix via OpenBSD. Unfortunatly we failed to track the
followup commit that fixed the type of the error code. Apply the change
from int to long now.

Reviewed by: emaste
Found by: CHERI
Obtained from: CheriBSD
MFC after: 3 days
Sponsored by: DARPA
Differential Revision: https://reviews.freebsd.org/D25779

commit | commitdiff | tree

Brooks Davis [Wed, 22 Jul 2020 23:35:41 +0000 (23:35 +0000)]

Use SI_ORDER_(FOURTH|FIFTH) rather than bespoke versions.

No functional change.

When these SYSINITs were added these macros didn't exist.

Reviewed by: imp
Obtained from: CheriBSD
MFC after: 1 week
Sponsored by: DARPA
Differential Revision: https://reviews.freebsd.org/D25758

commit | commitdiff | tree

Rick Macklem [Wed, 22 Jul 2020 23:33:37 +0000 (23:33 +0000)]

Modify writing to mirrored pNFS DSs to prepare for use of ext_pgs mbufs.

This patch modifies writing to mirrored pNFS DSs slightly so that there is
only one m_copym() call for a mirrored pair instead of two of them.
This call replaces the custom nfsm_copym() call, which is no longer needed
and deleted by this patch. The patch does introduce a new nfsm_split()
function that only calls m_split() for the non-ext_pgs case.
The semantics of nfsm_uiombuflist() is changed to include code that nul
pads the generated mbuf list. This was done by nfsm_copym() prior to this patch.

The main reason for this change is that it allows the data to be a list
of ext_pgs mbufs, since the m_copym() is for the entire mbuf list.
This support will be added in a future commit.

This patch only affects writing to mirrored flexible file layout pNFS servers.

commit | commitdiff | tree

John Baldwin [Wed, 22 Jul 2020 22:51:14 +0000 (22:51 +0000)]

Add missing space after switch.

Reviewed by: br, emaste
Sponsored by: DARPA
Differential Revision: https://reviews.freebsd.org/D25778

commit | commitdiff | tree

Brooks Davis [Wed, 22 Jul 2020 21:44:51 +0000 (21:44 +0000)]

Avoid reading one byte before the path buffer.

This happens when there's only one component (e.g. "/foo"). This
(mostly-harmless) bug has been present since June 1990 when it was
commited to mountd.c SCCS version 5.9.

Note: the bug is on the second changed line, the first line is changed
for visual consistency.

Reviewed by: cem, emaste, mckusick, rmacklem
Found with: CHERI
Obtained from: CheriBSD
MFC after: 1 week
Sponsored by: DARPA
Differential Revision: https://reviews.freebsd.org/D25759

commit | commitdiff | tree

Alexander Motin [Wed, 22 Jul 2020 20:15:21 +0000 (20:15 +0000)]

Untie nmi_handle_intr() from DEV_ISA.

The only part of nmi_handle_intr() depending on ISA is isa_nmi(), which is
already wrapped. Entering debugger on NMI does not really depend on ISA.

MFC after: 2 weeks

commit | commitdiff | tree

Emmanuel Vadot [Wed, 22 Jul 2020 19:08:05 +0000 (19:08 +0000)]

mmccam: Add support for 1.2V and 1.8V eMMC

If the card reports that it support 1.2V or 1.8V signaling switch to this voltage.

Submitted by: kibab

commit | commitdiff | tree

Emmanuel Vadot [Wed, 22 Jul 2020 19:04:45 +0000 (19:04 +0000)]

mmccam: Add support for 1.8V sdcard

If the card reports that it support 1.8V signaling switch to this voltage.
While here update the list of mode for mmccam.

Submitted by: kibab

commit | commitdiff | tree

Emmanuel Vadot [Wed, 22 Jul 2020 18:33:36 +0000 (18:33 +0000)]

aw_mmc: Start a mmccam discovery when the CD handler is called.

Submitted by: kibab

commit | commitdiff | tree

Emmanuel Vadot [Wed, 22 Jul 2020 18:30:17 +0000 (18:30 +0000)]

mmccam: Add a generic mmccam_start_discovery function

This is a generic function start a scan request for the given
cam_sim.
Other driver can now just use this function to request a new rescan.

Submitted by: kibab

commit | commitdiff | tree

Emmanuel Vadot [Wed, 22 Jul 2020 18:21:37 +0000 (18:21 +0000)]

mmccam: Use a sbuf for the mmc ident function

While here fix a typo.

commit | commitdiff | tree

Li-Wen Hsu [Wed, 22 Jul 2020 17:37:11 +0000 (17:37 +0000)]

Fix sys.geom.class.eli.onetime_test.onetime after r363402

PR: 247954
X-MFC with: r363402
Sponsored by: The FreeBSD Foundation

commit | commitdiff | tree

Emmanuel Vadot [Wed, 22 Jul 2020 17:36:28 +0000 (17:36 +0000)]

mmc_xpt: Fix debug messages

PROBE_RESET was printed for PROBE_IDENTIFY, fix this.
While here add one for the PROBE_RESET.

Submitted by: kibab

commit | commitdiff | tree

Kyle Evans [Wed, 22 Jul 2020 17:33:35 +0000 (17:33 +0000)]

pkg-bootstrap: complain on improper `pkg bootstrap` usage

Right now, the bootstrap will gloss over things like pkg bootstrap -x or
pkg bootstrap -f pkg. Make it more clear that this is incorrect, and hint
at the correct formatting.

Reported by: jhb (IIRC via IRC)
Approved by: bapt, jhb, manu
MFC after: 1 week
Differential Revision: https://reviews.freebsd.org/D24750

commit | commitdiff | tree

Mark Johnston [Wed, 22 Jul 2020 14:32:47 +0000 (14:32 +0000)]

usb(4): Stop checking for failures from malloc(M_WAITOK).

Handle the fact that parts of usb(4) can be compiled into the boot
loader, where M_WAITOK does not guarantee a successful allocation.

PR: 240545
Submitted by: Andrew Reiter <arr@watson.org> (original version)
Reviewed by: hselasky
MFC after: 1 week
Differential Revision: https://reviews.freebsd.org/D25706

commit | commitdiff | tree

Tom Jones [Wed, 22 Jul 2020 13:49:54 +0000 (13:49 +0000)]

Add tests for "add", "change" and "delete" functionality of /sbin/route.

Add tests to cover "add", "change" and "delete" functionality of /sbin/route
for ipv4 and ipv6. These tests for the existing route tool are the first step
towards creating libroute.

Submitted by:   Ahsan Barkati
Sponsored by:   Google, Inc. (GSoC 2020)
Reviewed by:    kp, thj
Approved by:    bz (mentor)
MFC after:      1 month
Differential Revision:  https://reviews.freebsd.org/D25220

commit | commitdiff | tree

Gordon Bergling [Wed, 22 Jul 2020 13:00:56 +0000 (13:00 +0000)]

geli(8): Add missing commands in the EXAMPLES section

- Add a missing 'geli attach' command
- Fix the passphrase prompt for a 'geli attach' command

Reported by: Fabian Keil <freebsd-listen at fabiankeil dot de>
Reviewed by: bcr (mentor)
Approved by: bcr (mentor)
Differential Revision: https://reviews.freebsd.org/D25761

commit | commitdiff | tree

Mateusz Guzik [Wed, 22 Jul 2020 12:30:31 +0000 (12:30 +0000)]

lockmgr: add adaptive spinning

It is very conservative. Only spinning when LK_ADAPTIVE is passed, only on
exclusive lock and never when any waiters are present. buffer cache is remains
not spinning.

This reduces total sleep times during buildworld etc., but it does not shorten
total real time (culprits are contention in the vm subsystem along with slock +
upgrade which is not covered).

For microbenchmarks: open3_processes -t 52 (open/close of the same file for
writing) ops/s:
before: 258845
after: 801638

Reviewed by: kib
Tested by: pho
Differential Revision: https://reviews.freebsd.org/D25753

commit | commitdiff | tree

Xin LI [Wed, 22 Jul 2020 02:15:21 +0000 (02:15 +0000)]

Consistently use gctl_get_provider instead of home-grown variants.

Reviewed by: cem, imp
MFC after: 2 weeks
Differential revision: https://reviews.freebsd.org/D25739

commit | commitdiff | tree

Xin LI [Wed, 22 Jul 2020 02:14:27 +0000 (02:14 +0000)]

gctl_get_class, gctl_get_geom and gctl_get_provider: provide feedback
when the requested argument is missing.

Reviewed by: cem
MFC after: 2 weeks
Differential revision: https://reviews.freebsd.org/D25738

commit | commitdiff | tree

Kyle Evans [Wed, 22 Jul 2020 02:09:10 +0000 (02:09 +0000)]

libbe: annotate lbh as __unused in be_is_auto_snapshot_name

lbh is included for consistency with other functions and in case
future work needs to use it, but it is currently unused. Mark it,
and a post-OpenZFS-import world will be able to raise WARNS of
libbe to the default (pending some minor changes to openzfs libzfs).

MFC after: 3 days

commit | commitdiff | tree

Warner Losh [Wed, 22 Jul 2020 00:44:47 +0000 (00:44 +0000)]

getty appears to date from 3rd edition research unix. That's the oldest man page
on TUHS and its 'unix 1972' restoration effort has assembler sources that look
like simpler version of what's in the 5th edition.

commit | commitdiff | tree

Mitchell Horne [Tue, 21 Jul 2020 22:47:02 +0000 (22:47 +0000)]

INTRNG: only shuffle for !EARLY_AP_STARTUP

During device attachment, all interrupt sources will bind to the BSP,
as it is the only processor online. This means interrupts must be
redistributed ("shuffled") later, during SI_SUB_SMP.

For the EARLY_AP_STARTUP case, this is no longer true. SI_SUB_SMP will
execute much earlier, meaning APs will be online and available before
devices begin attachment, and there will therefore be nothing to
shuffle.

All PIC-conforming interrupt controllers will handle this early
distribution properly, except for RISC-V's PLIC. Make the necessary
tweak to the PLIC driver.

While here, convert irq_assign_cpu from a boolean_t to a bool.

Reviewed by: markj
Differential Revision: https://reviews.freebsd.org/D25693

commit | commitdiff | tree

Alexander V. Chernikov [Tue, 21 Jul 2020 19:56:13 +0000 (19:56 +0000)]

Transition from rtrequest1_fib() to rib_action().

Remove all variations of rtrequest <rtrequest1_fib, rtrequest_fib,
in6_rtrequest, rtrequest_fib> and their uses and switch to
to rib_action(). This is part of the new routing KPI.

Submitted by: Neel Chauhan <neel AT neelc DOT org>
Differential Revision: https://reviews.freebsd.org/D25546

commit | commitdiff | tree

Alan Somers [Tue, 21 Jul 2020 19:18:29 +0000 (19:18 +0000)]

Fix geli's null cipher, and add a test case

PR: 247954
Submitted by: jhb (sys), asomers (tests)
Reviewed by: jhb (tests), asomers (sys)
MFC after: 2 weeks
Sponsored by: Axcient

commit | commitdiff | tree

Dimitry Andric [Tue, 21 Jul 2020 17:34:05 +0000 (17:34 +0000)]

Merge llvm, clang, compiler-rt, libc++, libunwind, lld, lldb and openmp
10.0.1 final (aka llvmorg-10.0.1-0-gef32c611aa2).

There were no changes since rc2, except in the upstream regression
tests, which we do not ship.

Relnotes: yes
MFC after: immediately (no material changes except tag)

commit | commitdiff | tree

Alexander Motin [Tue, 21 Jul 2020 17:18:38 +0000 (17:18 +0000)]

Avoid code duplicaiton by using ipi_selected().

MFC after: 2 weeks

commit | commitdiff | tree

Alan Somers [Tue, 21 Jul 2020 16:46:40 +0000 (16:46 +0000)]

[skip ci] document close_range(2) as async-signal-safe

Reviewed by: bcr (manpages)
MFC after: 2 weeks
Sponsored by: Axcient
Differential Revision: https://reviews.freebsd.org/D25513

commit | commitdiff | tree

Richard Scheffenegger [Tue, 21 Jul 2020 16:21:52 +0000 (16:21 +0000)]

Fix style and comment around concave/convex regions in TCP cubic.

In cubic, the concave region is when snd_cwnd starts growing slower
towards max_cwnd (cwnd at the time of the congestion event), and
the convex region is when snd_cwnd starts to grow faster and
eventually appearing like slow-start like growth.

PR: 238478
Reviewed by: tuexen (mentor), rgrimes (mentor)
Approved by: tuexen (mentor), rgrimes (mentor)
MFC after: 2 weeks
Sponsored by: NetApp, Inc.
Differential Revision: https://reviews.freebsd.org/D24657

commit | commitdiff | tree

Fernando Apesteguía [Tue, 21 Jul 2020 16:17:23 +0000 (16:17 +0000)]

netstat(1): Add EXAMPLES section

* Add small EXAMPLES section
* Fix warning reported by mandoc (./netstat.1:747:2: WARNING: skipping paragraph
macro: Pp before Ss)

Approved by: manpages (gbe)
Differential Revision: https://reviews.freebsd.org/D25212

commit | commitdiff | tree

Mark Johnston [Tue, 21 Jul 2020 15:03:36 +0000 (15:03 +0000)]

traceroute6: Fix most warnings at the default WARNS level.

Fix some style issues as well. Leave -Wno-cast-aligned set for now, as
most of the warnings come casts of CMSG_DATA(), which does provide
sufficient alignment in practice.

Submitted by: Shubh Gupta <shubh@freebsd.org>
Sponsored by: Google (GSOC 2020)
MFC after: 1 week
Differential Revision: https://reviews.freebsd.org/D25603

commit | commitdiff | tree

Mateusz Guzik [Tue, 21 Jul 2020 14:42:22 +0000 (14:42 +0000)]

lockmgr: denote recursion with a bit in lock value

This reduces excessive reads from the lock.

Tested by: pho

commit | commitdiff | tree

Mateusz Guzik [Tue, 21 Jul 2020 14:41:25 +0000 (14:41 +0000)]

lockmgr: rewrite upgrade to stop always dropping the lock

This matches rw and sx locks.

commit | commitdiff | tree

Mateusz Guzik [Tue, 21 Jul 2020 14:39:20 +0000 (14:39 +0000)]

lockmgr: add a helper for reading the lock value

commit | commitdiff | tree

Andrew Turner [Tue, 21 Jul 2020 14:25:36 +0000 (14:25 +0000)]

Only write to VIRTIO_MMIO_GUEST_PAGE_SIZE with virtio mmio version 1

This register is only defined for the legacy v1 interface so only write
to it when interacting with a legacy device.

Sponsored by: Innovate UK

commit | commitdiff | tree

Ruslan Bukin [Tue, 21 Jul 2020 13:50:10 +0000 (13:50 +0000)]

Move sys/iommu.h to dev/iommu/ as a part of generic IOMMU busdma backend.

Reviewed by: kib
Sponsored by: DARPA/AFRL
Differential Revision: https://reviews.freebsd.org/D25750

commit | commitdiff | tree

Ruslan Bukin [Tue, 21 Jul 2020 10:38:51 +0000 (10:38 +0000)]

Move the Intel DMAR busdma backend to a generic place so
it can be used on other IOMMU systems.

Reviewed by: kib
Sponsored by: DARPA/AFRL
Differential Revision: https://reviews.freebsd.org/D25720

commit | commitdiff | tree

Andriy Gapon [Tue, 21 Jul 2020 07:41:36 +0000 (07:41 +0000)]

dtrace/fbt: fix return probe arguments on arm

arg0 should be an offset of the return point within the function, arg1
should be the return value. Previously the return probe had arguments as
if for the entry probe.

Tested on armv7.

andrew noted that the same problem seems to be present on arm64, mips,
and riscv.
I am not sure if I will get around to fixing those. So, platform users
or anyone looking to make a contribution please be aware of this
opportunity.

Reviewed by: markj
MFC after: 1 week
Differential Revision: https://reviews.freebsd.org/D25685

commit | commitdiff | tree

Andriy Gapon [Tue, 21 Jul 2020 07:35:03 +0000 (07:35 +0000)]

gpioiic: never drive lines active high

I2C communication is done by a combination of driving a line low or
letting it float, so that it is either pulled up or driven low by
another party.

r355276 besides the stated goal of the change -- using the new GPIO API
-- also changed the logic, so that active state is signaled by actively
driving a line.

That worked with iicbb prior to r362042, but stopped working after that
commit on at least some hardware. My guess that the breakage was
related to getting an ACK bit. A device expected to be able to drive
SDA actively low, but controller was actively driving it high for some
time.

Anyway, this change seems to fix the problem.
Tested using gpioiic on Orange Pi PC Plus with HTU21 sensor.

Reported by: Nick Kostirya <nikolay.kostirya@i11.co>
Reviewed by: manu
MFC after: 1 week
Differential Revision: https://reviews.freebsd.org/D25684

commit | commitdiff | tree

Craig Leres [Mon, 20 Jul 2020 23:57:53 +0000 (23:57 +0000)]

Fix some regressions with the zgrep(1) wrapper.

- Handle whitespace with long flags that take arguments:

echo 'foo bar' > test
zgrep --regexp='foo bar' test

- Do not hang reading from stdin with patterns in a file:

echo foobar > test
echo foo > pattern
zgrep -f pattern test
zgrep --file=pattern test

- Handle any flags after -e:

echo foobar > test
zgrep -e foo --ignore-case < test

These two are still outstanding problems:

- Does not handle flags that take an argument if there is no
   whitespace:

zgrep -enfs /etc/rpc

- When more than one -e pattern used matching should occur for all
   patterns (similar to multiple patterns supplied with -f file).
   Instead only the last pattern is used for matching:

zgrep -e rex -e nfs /etc/rpc

   (This problem is masked in the unpatched version by the "any
   flags after -e" problem.)

Add tests for the above problems.

Update the mange and add references to gzip(1) and zstd(1) and also
document the remaining known problems.

PR: 247126
Approved by: markj
MFC after: 2 weeks
Differential Revision: https://reviews.freebsd.org/D25613

commit | commitdiff | tree

Richard Scheffenegger [Mon, 20 Jul 2020 23:47:27 +0000 (23:47 +0000)]

Add MODULE_VERSION to TCP loadable congestion control modules.

Without versioning information, using preexisting loader /
linker code is not easily possible when another module may
have dependencies on pre-loaded modules, and also doesn't
allow the automatic loading of dependent modules.

No functional change of the actual modules.

Reviewed by: tuexen (mentor), rgrimes (mentor)
Approved by: tuexen (mentor), rgrimes (mentor)
MFC after: 2 weeks
Sponsored by: NetApp, Inc.
Differential Revision: https://reviews.freebsd.org/D25744

commit | commitdiff | tree

John Baldwin [Mon, 20 Jul 2020 22:32:39 +0000 (22:32 +0000)]

Don't dynamically allocate data structures for KTLS crypto requests.

Allocate iovec arrays and struct cryptop and struct ocf_operation
objects on the stack to reduce avoid the overhead of malloc().

These structures are all small enough to fit on the stack of the KTLS
worker threads.

Reviewed by: gallatin
Sponsored by: Netflix
Differential Revision: https://reviews.freebsd.org/D25692

commit | commitdiff | tree

Vincenzo Maffione [Mon, 20 Jul 2020 21:08:56 +0000 (21:08 +0000)]

iflib: initialize netmap with the correct number of descriptors

In case the network device has a RX or TX control queue, the correct
number of TX/RX descriptors is contained in the second entry of the
isc_ntxd (or isc_nrxd) array, rather than in the first entry.
This case is correctly handled by iflib_device_register() and
iflib_pseudo_register(), but not by iflib_netmap_attach().
If the first entry is larger than the second, this can result in a
panic. This change fixes the bug by introducing two helper functions
that also lead to some code simplification.

PR: 247647
MFC after: 3 weeks
Differential Revision: https://reviews.freebsd.org/D25541

commit | commitdiff | tree

Chuck Silvers [Mon, 20 Jul 2020 20:36:32 +0000 (20:36 +0000)]

add a few tests for sendfile.

Reviewed by: markj
Sponsored by: Netflix
Differential Revision: https://reviews.freebsd.org/D25431

commit | commitdiff | tree

Mark Johnston [Mon, 20 Jul 2020 18:22:38 +0000 (18:22 +0000)]

libdwarf: Hide SHT_NOBITS sections.

gnu_debuglink external debug files will contain an .eh_frame section of
type SHT_NOBITS. libdwarf does not handle such sections (or rather, it
expects all debug sections to not have type SHT_NOBITS). Avoid loading
SHT_NOBITS sections, to be consistent with SGI libdwarf's handling of
this case.

PR: 239516
Diagnosed by: Paco Pascal <me@pacopascal.com>
Reviewed by: emaste (previous version)
Event: July 2020 Bugathon
MFC after: 1 week
Differential Revision: https://reviews.freebsd.org/D25626

commit | commitdiff | tree

Mark Johnston [Mon, 20 Jul 2020 17:44:13 +0000 (17:44 +0000)]

crypto(9): Stop checking for failures from malloc(M_WAITOK).

PR: 240545
Submitted by: Andrew Reiter <arr@watson.org>
Reviewed by: cem, delphij, jhb
MFC after: 1 week
Event: July 2020 Bugathon

commit | commitdiff | tree

Mark Johnston [Mon, 20 Jul 2020 17:05:44 +0000 (17:05 +0000)]

Fix a memory leak in dsl_scan_visitbp().

This should be triggered only if arc_read() fails, i.e., quite rarely.
The same logic is already present in OpenZFS.

PR: 247445
Submitted by: jdolecek@NetBSD.org
MFC after: 1 week

Unnamed repository; edit this file 'description' to name the repository.