Changelog in Linux kernel 6.12.78

ACPI: OSI: Add DMI quirk for Acer Aspire One D255 [+ + +]

Author: Sofia Schneider <[email protected]>
Date:   Sun Feb 22 23:52:40 2026 -0300

    ACPI: OSI: Add DMI quirk for Acer Aspire One D255
    
    [ Upstream commit 5ede90206273ff156a778254f0f972a55e973c89 ]
    
    The screen backlight turns off during boot (specifically during udev device
    initialization) when returning true for _OSI("Windows 2009").
    
    Analyzing the device's DSDT reveals that the firmware takes a different
    code path when Windows 7 is reported, which leads to the backlight shutoff.
    Add a DMI quirk to invoke dmi_disable_osi_win7 for this model.
    
    Signed-off-by: Sofia Schneider <[email protected]>
    Link: https://patch.msgid.link/[email protected]
    Signed-off-by: Rafael J. Wysocki <[email protected]>
    Signed-off-by: Sasha Levin <[email protected]>

ACPI: OSL: fix __iomem type on return from acpi_os_map_generic_address() [+ + +]

Author: Ben Dooks <[email protected]>
Date:   Wed Mar 11 10:58:35 2026 +0000

    ACPI: OSL: fix __iomem type on return from acpi_os_map_generic_address()
    
    [ Upstream commit 393815f57651101f1590632092986d1d5a3a41bd ]
    
    The pointer returned from acpi_os_map_generic_address() is
    tagged with __iomem, so make the rv it is returned to also
    of void __iomem * type.
    
    Fixes the following sparse warning:
    
    drivers/acpi/osl.c:1686:20: warning: incorrect type in assignment (different address spaces)
    drivers/acpi/osl.c:1686:20:    expected void *rv
    drivers/acpi/osl.c:1686:20:    got void [noderef] __iomem *
    
    Fixes: 6915564dc5a8 ("ACPI: OSL: Change the type of acpi_os_map_generic_address() return value")
    Signed-off-by: Ben Dooks <[email protected]>
    [ rjw: Subject tweak, added Fixes tag ]
    Link: https://patch.msgid.link/[email protected]
    Signed-off-by: Rafael J. Wysocki <[email protected]>
    Signed-off-by: Sasha Levin <[email protected]>

ACPI: PM: Save NVS memory on Lenovo G70-35 [+ + +]

Author: Piotr Mazek <[email protected]>
Date:   Thu Feb 5 23:05:02 2026 +0100

    ACPI: PM: Save NVS memory on Lenovo G70-35
    
    [ Upstream commit 023cd6d90f8aa2ef7b72d84be84a18e61ecebd64 ]
    
    [821d6f0359b0614792ab8e2fb93b503e25a65079] prevented machines
    produced later than 2012 from saving NVS region to accelerate S3.
    
    Despite being made after 2012, Lenovo G70-35 still needs NVS memory
    saving during S3. A quirk is introduced for this platform.
    
    Signed-off-by: Piotr Mazek <[email protected]>
    [ rjw: Subject adjustment ]
    Link: https://patch.msgid.link/GV2PPF3CD5B63CC2442EE3F76F8443EAD90D499A@GV2PPF3CD5B63CC.EURP251.PROD.OUTLOOK.COM
    Signed-off-by: Rafael J. Wysocki <[email protected]>
    Signed-off-by: Sasha Levin <[email protected]>

ACPI: processor: Fix previous acpi_processor_errata_piix4() fix [+ + +]

Author: Rafael J. Wysocki <[email protected]>
Date:   Tue Mar 17 21:39:05 2026 +0100

    ACPI: processor: Fix previous acpi_processor_errata_piix4() fix
    
    [ Upstream commit bf504b229cb8d534eccbaeaa23eba34c05131e25 ]
    
    After commi f132e089fe89 ("ACPI: processor: Fix NULL-pointer dereference
    in acpi_processor_errata_piix4()"), device pointers may be dereferenced
    after dropping references to the device objects pointed to by them,
    which may cause a use-after-free to occur.
    
    Moreover, debug messages about enabling the errata may be printed
    if the errata flags corresponding to them are unset.
    
    Address all of these issues by moving message printing to the points
    in the code where the errata flags are set.
    
    Fixes: f132e089fe89 ("ACPI: processor: Fix NULL-pointer dereference in acpi_processor_errata_piix4()")
    Reported-by: Guenter Roeck <[email protected]>
    Closes: https://lore.kernel.org/linux-acpi/[email protected]/
    Reviewed-by: Guenter Roeck <[email protected]>
    Signed-off-by: Rafael J. Wysocki <[email protected]>
    Link: https://patch.msgid.link/[email protected]
    Signed-off-by: Sasha Levin <[email protected]>

ALSA: hda/realtek: Fix speaker pop on Star Labs StarFighter [+ + +]

Author: Sean Rhodes <[email protected]>
Date:   Thu Feb 19 20:14:26 2026 +0000

    ALSA: hda/realtek: Fix speaker pop on Star Labs StarFighter
    
    [ Upstream commit 1cb3c20688fc8380c9b365d03aea7e84faf6a9fd ]
    
    On Star Labs StarFighter (Realtek ALC233/235), the internal speakers can
    emit an audible pop when entering or leaving runtime suspend.
    
    Mute the speaker output paths via snd_hda_gen_shutup_speakers() in the
    Realtek shutup callback before the codec is powered down.
    
    This is enough to avoid the pop without special EAPD handling.
    
    Test results:
    - runtime PM pop fixed
    - still reaches D3 (PCI 0000:00:1f.3 power_state=D3hot)
    - does not address pops on cold boot (G3 exit) or around display manager
      start/shutdown
    
    journalctl -k (boot):
    - snd_hda_codec_alc269 hdaudioC0D0: ALC233: picked fixup for PCI SSID
      7017:2014
    - snd_hda_codec_alc269 hdaudioC0D0: autoconfig for ALC233: line_outs=1
      (0x1b/0x0/0x0/0x0/0x0) type:speaker
    
    Suggested-by: Takashi Iwai <[email protected]>
    Tested-by: Sean Rhodes <[email protected]>
    Signed-off-by: Sean Rhodes <[email protected]>
    Link: https://patch.msgid.link/4d5fb71b132bb283fd41c622b8413770b2065242.1771532060.git.sean@starlabs.systems
    Signed-off-by: Takashi Iwai <[email protected]>
    Signed-off-by: Sasha Levin <[email protected]>

ALSA: pcm: fix use-after-free on linked stream runtime in snd_pcm_drain() [+ + +]

Author: Mehul Rao <[email protected]>
Date:   Thu Mar 5 14:35:07 2026 -0500

    ALSA: pcm: fix use-after-free on linked stream runtime in snd_pcm_drain()
    
    commit 9b1dbd69ba6f8f8c69bc7b77c2ce3b9c6ed05ba6 upstream.
    
    In the drain loop, the local variable 'runtime' is reassigned to a
    linked stream's runtime (runtime = s->runtime at line 2157).  After
    releasing the stream lock at line 2169, the code accesses
    runtime->no_period_wakeup, runtime->rate, and runtime->buffer_size
    (lines 2170-2178) — all referencing the linked stream's runtime without
    any lock or refcount protecting its lifetime.
    
    A concurrent close() on the linked stream's fd triggers
    snd_pcm_release_substream() → snd_pcm_drop() → pcm_release_private()
    → snd_pcm_unlink() → snd_pcm_detach_substream() → kfree(runtime).
    No synchronization prevents kfree(runtime) from completing while the
    drain path dereferences the stale pointer.
    
    Fix by caching the needed runtime fields (no_period_wakeup, rate,
    buffer_size) into local variables while still holding the stream lock,
    and using the cached values after the lock is released.
    
    Fixes: f2b3614cefb6 ("ALSA: PCM - Don't check DMA time-out too shortly")
    Cc: [email protected]
    Signed-off-by: Mehul Rao <[email protected]>
    Link: https://patch.msgid.link/[email protected]
    Signed-off-by: Takashi Iwai <[email protected]>
    Signed-off-by: Greg Kroah-Hartman <[email protected]>

ALSA: usb-audio: Avoid implicit feedback mode on DIYINHK USB Audio 2.0 [+ + +]

Author: Takashi Iwai <[email protected]>
Date:   Wed Feb 25 09:52:30 2026 +0100

    ALSA: usb-audio: Avoid implicit feedback mode on DIYINHK USB Audio 2.0
    
    [ Upstream commit c5bf24c8aba1ff711226ee0f039ff01a5754692b ]
    
    Although DIYINHK USB Audio 2.0 (ID 20b1:2009) shows the implicit
    feedback source for the capture stream, this would cause several
    problems for the playback.  Namely, the device can get wMaxPackSize
    1024 for 24/32 bit format with 6 channels, and when a high sample rate
    like 352.8kHz or 384kHz is played, the packet size overflows the max
    limit.  Also, the device has another two playback altsets, and those
    aren't properly handled with the implicit feedback.
    
    Since the device has been working well even before introducing the
    implicit feedback, we can assume that it works fine in the async mode.
    This patch adds the explicit skip of the implicit fb detection to make
    the playback running in the async mode.
    
    Link: https://bugzilla.kernel.org/show_bug.cgi?id=221076
    Signed-off-by: Takashi Iwai <[email protected]>
    Link: https://patch.msgid.link/[email protected]
    Signed-off-by: Takashi Iwai <[email protected]>
    Signed-off-by: Sasha Levin <[email protected]>

ALSA: usb-audio: Check endpoint numbers at parsing Scarlett2 mixer interfaces [+ + +]

Author: Takashi Iwai <[email protected]>
Date:   Mon Mar 9 11:46:27 2026 +0100

    ALSA: usb-audio: Check endpoint numbers at parsing Scarlett2 mixer interfaces
    
    commit df1d8abf36ca3681c21a6809eaa9a1e01ef897a6 upstream.
    
    The Scarlett2 mixer quirk in USB-audio driver may hit a NULL
    dereference when a malformed USB descriptor is passed, since it
    assumes the presence of an endpoint in the parsed interface in
    scarlett2_find_fc_interface(), as reported by fuzzer.
    
    For avoiding the NULL dereference, just add the sanity check of
    bNumEndpoints and skip the invalid interface.
    
    Reported-by: [email protected]
    Closes: https://lore.kernel.org/[email protected]
    Reported-by: [email protected]
    Closes: https://lore.kernel.org/[email protected]
    Cc: <[email protected]>
    Link: https://patch.msgid.link/[email protected]
    Signed-off-by: Takashi Iwai <[email protected]>
    Signed-off-by: Greg Kroah-Hartman <[email protected]>

ALSA: usb-audio: Check max frame size for implicit feedback mode, too [+ + +]

Author: Takashi Iwai <[email protected]>
Date:   Wed Feb 25 09:52:29 2026 +0100

    ALSA: usb-audio: Check max frame size for implicit feedback mode, too
    
    [ Upstream commit 7cb2a5422f5bbdf1cf32eae0eda41000485b9346 ]
    
    When the packet sizes are taken from the capture stream in the
    implicit feedback mode, the sizes might be larger than the upper
    boundary defined by the descriptor.  As already done for other
    transfer modes, we have to cap the sizes accordingly at sending,
    otherwise this would lead to an error in USB core at submission of
    URBs.
    
    Link: https://bugzilla.kernel.org/show_bug.cgi?id=221076
    Signed-off-by: Takashi Iwai <[email protected]>
    Link: https://patch.msgid.link/[email protected]
    Signed-off-by: Takashi Iwai <[email protected]>
    Signed-off-by: Sasha Levin <[email protected]>

amd-xgbe: fix link status handling in xgbe_rx_adaptation [+ + +]

Author: Raju Rangoju <[email protected]>
Date:   Fri Mar 6 16:46:27 2026 +0530

    amd-xgbe: fix link status handling in xgbe_rx_adaptation
    
    [ Upstream commit 6485cb96be5cd0f4bf39554737ba11322cc9b053 ]
    
    The link status bit is latched low to allow detection of momentary
    link drops. If the status indicates that the link is already down,
    read it again to obtain the current state.
    
    Fixes: 4f3b20bfbb75 ("amd-xgbe: add support for rx-adaptation")
    Signed-off-by: Raju Rangoju <[email protected]>
    Link: https://patch.msgid.link/[email protected]
    Signed-off-by: Paolo Abeni <[email protected]>
    Signed-off-by: Sasha Levin <[email protected]>

amd-xgbe: prevent CRC errors during RX adaptation with AN disabled [+ + +]

Author: Raju Rangoju <[email protected]>
Date:   Fri Mar 6 16:46:28 2026 +0530

    amd-xgbe: prevent CRC errors during RX adaptation with AN disabled
    
    [ Upstream commit 27a4dd0c702b3b2b9cf2c045d100cc2fe8720b81 ]
    
    When operating in 10GBASE-KR mode with auto-negotiation disabled and RX
    adaptation enabled, CRC errors can occur during the RX adaptation
    process. This happens because the driver continues transmitting and
    receiving packets while adaptation is in progress.
    
    Fix this by stopping TX/RX immediately when the link goes down and RX
    adaptation needs to be re-triggered, and only re-enabling TX/RX after
    adaptation completes and the link is confirmed up. Introduce a flag to
    track whether TX/RX was disabled for adaptation so it can be restored
    correctly.
    
    This prevents packets from being transmitted or received during the RX
    adaptation window and avoids CRC errors from corrupted frames.
    
    The flag tracking the data path state is synchronized with hardware
    state in xgbe_start() to prevent stale state after device restarts.
    This ensures that after a restart cycle (where xgbe_stop disables
    TX/RX and xgbe_start re-enables them), the flag correctly reflects
    that the data path is active.
    
    Fixes: 4f3b20bfbb75 ("amd-xgbe: add support for rx-adaptation")
    Signed-off-by: Raju Rangoju <[email protected]>
    Link: https://patch.msgid.link/[email protected]
    Signed-off-by: Paolo Abeni <[email protected]>
    Signed-off-by: Sasha Levin <[email protected]>

arm64: contpte: fix set_access_flags() no-op check for SMMU/ATS faults [+ + +]

Author: Piotr Jaroszynski <[email protected]>
Date:   Thu Mar 5 15:26:29 2026 -0800

    arm64: contpte: fix set_access_flags() no-op check for SMMU/ATS faults
    
    commit 97c5550b763171dbef61e6239cab372b9f9cd4a2 upstream.
    
    contpte_ptep_set_access_flags() compared the gathered ptep_get() value
    against the requested entry to detect no-ops. ptep_get() ORs AF/dirty
    from all sub-PTEs in the CONT block, so a dirty sibling can make the
    target appear already-dirty. When the gathered value matches entry, the
    function returns 0 even though the target sub-PTE still has PTE_RDONLY
    set in hardware.
    
    For a CPU with FEAT_HAFDBS this gathered view is fine, since hardware may
    set AF/dirty on any sub-PTE and CPU TLB behavior is effectively gathered
    across the CONT range. But page-table walkers that evaluate each
    descriptor individually (e.g. a CPU without DBM support, or an SMMU
    without HTTU, or with HA/HD disabled in CD.TCR) can keep faulting on the
    unchanged target sub-PTE, causing an infinite fault loop.
    
    Gathering can therefore cause false no-ops when only a sibling has been
    updated:
     - write faults: target still has PTE_RDONLY (needs PTE_RDONLY cleared)
     - read faults:  target still lacks PTE_AF
    
    Fix by checking each sub-PTE against the requested AF/dirty/write state
    (the same bits consumed by __ptep_set_access_flags()), using raw
    per-PTE values rather than the gathered ptep_get() view, before
    returning no-op. Keep using the raw target PTE for the write-bit unfold
    decision.
    
    Per Arm ARM (DDI 0487) D8.7.1 ("The Contiguous bit"), any sub-PTE in a CONT
    range may become the effective cached translation and software must
    maintain consistent attributes across the range.
    
    Fixes: 4602e5757bcc ("arm64/mm: wire up PTE_CONT for user mappings")
    Cc: Ryan Roberts <[email protected]>
    Cc: Catalin Marinas <[email protected]>
    Cc: Will Deacon <[email protected]>
    Cc: Jason Gunthorpe <[email protected]>
    Cc: John Hubbard <[email protected]>
    Cc: Zi Yan <[email protected]>
    Cc: Breno Leitao <[email protected]>
    Cc: [email protected]
    Reviewed-by: Alistair Popple <[email protected]>
    Reviewed-by: James Houghton <[email protected]>
    Reviewed-by: Ryan Roberts <[email protected]>
    Reviewed-by: Catalin Marinas <[email protected]>
    Tested-by: Breno Leitao <[email protected]>
    Signed-off-by: Piotr Jaroszynski <[email protected]>
    Acked-by: Balbir Singh <[email protected]>
    Signed-off-by: Will Deacon <[email protected]>
    Signed-off-by: Greg Kroah-Hartman <[email protected]>

arm64: dts: renesas: r9a09g057: Add RTC node [+ + +]

Author: Ovidiu Panait <[email protected]>
Date:   Fri Nov 7 21:07:05 2025 +0000

    arm64: dts: renesas: r9a09g057: Add RTC node
    
    [ Upstream commit cfc733da4e79018f88d8ac5f3a5306abbba8ef89 ]
    
    Add RTC node to Renesas RZ/V2H ("R9A09G057") SoC DTSI.
    
    Signed-off-by: Ovidiu Panait <[email protected]>
    Reviewed-by: Geert Uytterhoeven <[email protected]>
    Link: https://patch.msgid.link/[email protected]
    Signed-off-by: Geert Uytterhoeven <[email protected]>
    Stable-dep-of: a3f34651de42 ("arm64: dts: renesas: r9a09g057: Remove wdt{0,2,3} nodes")
    Signed-off-by: Sasha Levin <[email protected]>

arm64: dts: renesas: r9a09g057: Remove wdt{0,2,3} nodes [+ + +]

Author: Fabrizio Castro <[email protected]>
Date:   Tue Feb 3 12:42:46 2026 +0000

    arm64: dts: renesas: r9a09g057: Remove wdt{0,2,3} nodes
    
    [ Upstream commit a3f34651de4287138c0da19ba321ad72622b4af3 ]
    
    The HW user manual for the Renesas RZ/V2H(P) SoC (a.k.a r9a09g057)
    states that only WDT1 is supposed to be accessed by the CA55 cores.
    WDT0 is supposed to be used by the CM33 core, WDT2 is supposed
    to be used by the CR8 core 0, and WDT3 is supposed to be used
    by the CR8 core 1.
    
    Remove wdt{0,2,3} from the SoC specific device tree to make it
    compliant with the specification from the HW manual.
    
    This change is harmless as there are currently no users of the
    wdt{0,2,3} device tree nodes, only the wdt1 node is actually used.
    
    Fixes: 095105496e7d ("arm64: dts: renesas: r9a09g057: Add WDT0-WDT3 nodes")
    Signed-off-by: Fabrizio Castro <[email protected]>
    Reviewed-by: Geert Uytterhoeven <[email protected]>
    Link: https://patch.msgid.link/[email protected]
    Signed-off-by: Geert Uytterhoeven <[email protected]>
    Signed-off-by: Sasha Levin <[email protected]>

arm64: mm: Add PTE_DIRTY back to PAGE_KERNEL* to fix kexec/hibernation [+ + +]

Author: Catalin Marinas <[email protected]>
Date:   Fri Feb 27 18:53:06 2026 +0000

    arm64: mm: Add PTE_DIRTY back to PAGE_KERNEL* to fix kexec/hibernation
    
    commit c25c4aa3f79a488cc270507935a29c07dc6bddfc upstream.
    
    Commit 143937ca51cc ("arm64, mm: avoid always making PTE dirty in
    pte_mkwrite()") changed pte_mkwrite_novma() to only clear PTE_RDONLY
    when PTE_DIRTY is set. This was to allow writable-clean PTEs for swap
    pages that haven't actually been written.
    
    However, this broke kexec and hibernation for some platforms. Both go
    through trans_pgd_create_copy() -> _copy_pte(), which calls
    pte_mkwrite_novma() to make the temporary linear-map copy fully
    writable. With the updated pte_mkwrite_novma(), read-only kernel pages
    (without PTE_DIRTY) remain read-only in the temporary mapping.
    While such behaviour is fine for user pages where hardware DBM or
    trapping will make them writeable, subsequent in-kernel writes by the
    kexec relocation code will fault.
    
    Add PTE_DIRTY back to all _PAGE_KERNEL* protection definitions. This was
    the case prior to 5.4, commit aa57157be69f ("arm64: Ensure
    VM_WRITE|VM_SHARED ptes are clean by default"). With the kernel
    linear-map PTEs always having PTE_DIRTY set, pte_mkwrite_novma()
    correctly clears PTE_RDONLY.
    
    Fixes: 143937ca51cc ("arm64, mm: avoid always making PTE dirty in pte_mkwrite()")
    Signed-off-by: Catalin Marinas <[email protected]>
    Cc: [email protected]
    Reported-by: Jianpeng Chang <[email protected]>
    Link: https://lore.kernel.org/r/[email protected]
    Cc: Will Deacon <[email protected]>
    Cc: Huang, Ying <[email protected]>
    Cc: Guenter Roeck <[email protected]>
    Reviewed-by: Huang Ying <[email protected]>
    Signed-off-by: Will Deacon <[email protected]>
    Signed-off-by: Greg Kroah-Hartman <[email protected]>

ASoC: amd: acp3x-rt5682-max9836: Add missing error check for clock acquisition [+ + +]

Author: Chen Ni <[email protected]>
Date:   Tue Mar 10 10:42:46 2026 +0800

    ASoC: amd: acp3x-rt5682-max9836: Add missing error check for clock acquisition
    
    [ Upstream commit 53f3a900e9a383d47af7253076e19f510c5708d0 ]
    
    The acp3x_5682_init() function did not check the return value of
    clk_get(), which could lead to dereferencing error pointers in
    rt5682_clk_enable().
    
    Fix this by:
    1. Changing clk_get() to the device-managed devm_clk_get().
    2. Adding proper IS_ERR() checks for both clock acquisitions.
    
    Fixes: 6b8e4e7db3cd ("ASoC: amd: Add machine driver for Raven based platform")
    Signed-off-by: Chen Ni <[email protected]>
    Link: https://patch.msgid.link/[email protected]
    Signed-off-by: Mark Brown <[email protected]>
    Signed-off-by: Sasha Levin <[email protected]>

ASoC: amd: yc: Add ASUS EXPERTBOOK BM1503CDA to quirk table [+ + +]

Author: Azamat Almazbek uulu <[email protected]>
Date:   Sat Feb 21 12:48:13 2026 +0100

    ASoC: amd: yc: Add ASUS EXPERTBOOK BM1503CDA to quirk table
    
    [ Upstream commit 32fc4168fa56f6301d858c778a3d712774e9657e ]
    
    The ASUS ExpertBook BM1503CDA (Ryzen 5 7535U, Barcelo-R) has an
    internal DMIC connected through the AMD ACP (Audio CoProcessor)
    but is missing from the DMI quirk table, so the acp6x machine
    driver probe returns -ENODEV and no DMIC capture device is created.
    
    Add the DMI entry so the internal microphone works out of the box.
    
    Signed-off-by: Azamat Almazbek uulu <[email protected]>
    Reviewed-by: Vijendar Mukunda <[email protected]>
    Link: https://patch.msgid.link/[email protected]
    Signed-off-by: Mark Brown <[email protected]>
    Signed-off-by: Sasha Levin <[email protected]>

ASoC: amd: yc: Add DMI quirk for ASUS EXPERTBOOK PM1503CDA [+ + +]

Author: Zhang Heng <[email protected]>
Date:   Wed Mar 4 14:32:55 2026 +0800

    ASoC: amd: yc: Add DMI quirk for ASUS EXPERTBOOK PM1503CDA
    
    commit 325291b20f8a6f14b9c82edbf5d12e4e71f6adaa upstream.
    
    Add a DMI quirk for the ASUS EXPERTBOOK PM1503CDA fixing the
    issue where the internal microphone was not detected.
    
    Link: https://bugzilla.kernel.org/show_bug.cgi?id=221070
    Cc: [email protected]
    Signed-off-by: Zhang Heng <[email protected]>
    Link: https://patch.msgid.link/[email protected]
    Signed-off-by: Mark Brown <[email protected]>
    Signed-off-by: Greg Kroah-Hartman <[email protected]>

ASoC: cs42l43: Report insert for exotic peripherals [+ + +]

Author: Charles Keepax <[email protected]>
Date:   Mon Feb 23 09:36:16 2026 +0000

    ASoC: cs42l43: Report insert for exotic peripherals
    
    [ Upstream commit 6510e1324bcdc8caf21f6d17efe27604c48f0d64 ]
    
    For some exotic peripherals the type detect can return a reserved value
    of 0x4. This will currently return an error and not report anything to
    user-space, update this to report the insert normally.
    
    Signed-off-by: Charles Keepax <[email protected]>
    Link: https://patch.msgid.link/[email protected]
    Signed-off-by: Mark Brown <[email protected]>
    Signed-off-by: Sasha Levin <[email protected]>

ASoC: detect empty DMI strings [+ + +]

Author: Casey Connolly <[email protected]>
Date:   Fri Mar 6 18:47:07 2026 +0100

    ASoC: detect empty DMI strings
    
    [ Upstream commit a9683730e8b1d632674f81844ed03ddfbe4821c0 ]
    
    Some bootloaders like recent versions of U-Boot may install some DMI
    properties with empty values rather than not populate them. This manages
    to make its way through the validator and cleanup resulting in a rogue
    hyphen being appended to the card longname.
    
    Fixes: 4e01e5dbba96 ("ASoC: improve the DMI long card code in asoc-core")
    Signed-off-by: Casey Connolly <[email protected]>
    Link: https://patch.msgid.link/[email protected]
    Signed-off-by: Mark Brown <[email protected]>
    Signed-off-by: Sasha Levin <[email protected]>

ASoC: qcom: qdsp6: Fix q6apm remove ordering during ADSP stop and start [+ + +]

Author: Ravi Hothi <[email protected]>
Date:   Fri Feb 27 20:15:34 2026 +0530

    ASoC: qcom: qdsp6: Fix q6apm remove ordering during ADSP stop and start
    
    commit d6db827b430bdcca3976cebca7bd69cca03cde2c upstream.
    
    During ADSP stop and start, the kernel crashes due to the order in which
    ASoC components are removed.
    
    On ADSP stop, the q6apm-audio .remove callback unloads topology and removes
    PCM runtimes during ASoC teardown. This deletes the RTDs that contain the
    q6apm DAI components before their removal pass runs, leaving those
    components still linked to the card and causing crashes on the next rebind.
    
    Fix this by ensuring that all dependent (child) components are removed
    first, and the q6apm component is removed last.
    
    [   48.105720] Unable to handle kernel NULL pointer dereference at virtual address 00000000000000d0
    [   48.114763] Mem abort info:
    [   48.117650]   ESR = 0x0000000096000004
    [   48.121526]   EC = 0x25: DABT (current EL), IL = 32 bits
    [   48.127010]   SET = 0, FnV = 0
    [   48.130172]   EA = 0, S1PTW = 0
    [   48.133415]   FSC = 0x04: level 0 translation fault
    [   48.138446] Data abort info:
    [   48.141422]   ISV = 0, ISS = 0x00000004, ISS2 = 0x00000000
    [   48.147079]   CM = 0, WnR = 0, TnD = 0, TagAccess = 0
    [   48.152354]   GCS = 0, Overlay = 0, DirtyBit = 0, Xs = 0
    [   48.157859] user pgtable: 4k pages, 48-bit VAs, pgdp=00000001173cf000
    [   48.164517] [00000000000000d0] pgd=0000000000000000, p4d=0000000000000000
    [   48.171530] Internal error: Oops: 0000000096000004 [#1]  SMP
    [   48.177348] Modules linked in: q6prm_clocks q6apm_lpass_dais q6apm_dai snd_q6dsp_common q6prm snd_q6apm 8021q garp mrp stp llc snd_soc_hdmi_codec apr pdr_interface phy_qcom_edp fastrpc qcom_pd_mapper rpmsg_ctrl qrtr_smd rpmsg_char qcom_pdr_msg qcom_iris v4l2_mem2mem videobuf2_dma_contig ath11k_pci msm ubwc_config at24 ath11k videobuf2_memops mac80211 ocmem videobuf2_v4l2 libarc4 drm_gpuvm mhi qrtr videodev drm_exec snd_soc_sc8280xp gpu_sched videobuf2_common nvmem_qcom_spmi_sdam snd_soc_qcom_sdw drm_dp_aux_bus qcom_q6v5_pas qcom_spmi_temp_alarm snd_soc_qcom_common rtc_pm8xxx qcom_pon drm_display_helper cec qcom_pil_info qcom_stats soundwire_bus drm_client_lib mc dispcc0_sa8775p videocc_sa8775p qcom_q6v5 camcc_sa8775p snd_soc_dmic phy_qcom_sgmii_eth snd_soc_max98357a i2c_qcom_geni snd_soc_core dwmac_qcom_ethqos llcc_qcom icc_bwmon qcom_sysmon snd_compress qcom_refgen_regulator coresight_stm stmmac_platform snd_pcm_dmaengine qcom_common coresight_tmc stmmac coresight_replicator qcom_glink_smem coresight_cti stm_core
    [   48.177444]  coresight_funnel snd_pcm ufs_qcom phy_qcom_qmp_usb gpi phy_qcom_snps_femto_v2 coresight phy_qcom_qmp_ufs qcom_wdt gpucc_sa8775p pcs_xpcs mdt_loader qcom_ice icc_osm_l3 qmi_helpers snd_timer snd soundcore display_connector qcom_rng nvmem_reboot_mode drm_kms_helper phy_qcom_qmp_pcie sha256 cfg80211 rfkill socinfo fuse drm backlight ipv6
    [   48.301059] CPU: 2 UID: 0 PID: 293 Comm: kworker/u32:2 Not tainted 6.19.0-rc6-dirty #10 PREEMPT
    [   48.310081] Hardware name: Qualcomm Technologies, Inc. Lemans EVK (DT)
    [   48.316782] Workqueue: pdr_notifier_wq pdr_notifier_work [pdr_interface]
    [   48.323672] pstate: 20400005 (nzCv daif +PAN -UAO -TCO -DIT -SSBS BTYPE=--)
    [   48.330825] pc : mutex_lock+0xc/0x54
    [   48.334514] lr : soc_dapm_shutdown_dapm+0x44/0x174 [snd_soc_core]
    [   48.340794] sp : ffff800084ddb7b0
    [   48.344207] x29: ffff800084ddb7b0 x28: ffff00009cd9cf30 x27: ffff00009cd9cc00
    [   48.351544] x26: ffff000099610190 x25: ffffa31d2f19c810 x24: ffffa31d2f185098
    [   48.358869] x23: ffff800084ddb7f8 x22: 0000000000000000 x21: 00000000000000d0
    [   48.366198] x20: ffff00009ba6c338 x19: ffff00009ba6c338 x18: 00000000ffffffff
    [   48.373528] x17: 000000040044ffff x16: ffffa31d4ae6dca8 x15: 072007740775076f
    [   48.380853] x14: 0765076d07690774 x13: 00313a323a656369 x12: 767265733a637673
    [   48.388182] x11: 00000000000003f9 x10: ffffa31d4c7dea98 x9 : 0000000000000001
    [   48.395519] x8 : ffff00009a2aadc0 x7 : 0000000000000003 x6 : 0000000000000000
    [   48.402854] x5 : 0000000000000000 x4 : 0000000000000028 x3 : ffff000ef397a698
    [   48.410180] x2 : ffff00009a2aadc0 x1 : 0000000000000000 x0 : 00000000000000d0
    [   48.417506] Call trace:
    [   48.420025]  mutex_lock+0xc/0x54 (P)
    [   48.423712]  snd_soc_dapm_shutdown+0x44/0xbc [snd_soc_core]
    [   48.429447]  soc_cleanup_card_resources+0x30/0x2c0 [snd_soc_core]
    [   48.435719]  snd_soc_bind_card+0x4dc/0xcc0 [snd_soc_core]
    [   48.441278]  snd_soc_add_component+0x27c/0x2c8 [snd_soc_core]
    [   48.447192]  snd_soc_register_component+0x9c/0xf4 [snd_soc_core]
    [   48.453371]  devm_snd_soc_register_component+0x64/0xc4 [snd_soc_core]
    [   48.459994]  apm_probe+0xb4/0x110 [snd_q6apm]
    [   48.464479]  apr_device_probe+0x24/0x40 [apr]
    [   48.468964]  really_probe+0xbc/0x298
    [   48.472651]  __driver_probe_device+0x78/0x12c
    [   48.477132]  driver_probe_device+0x40/0x160
    [   48.481435]  __device_attach_driver+0xb8/0x134
    [   48.486011]  bus_for_each_drv+0x80/0xdc
    [   48.489964]  __device_attach+0xa8/0x1b0
    [   48.493916]  device_initial_probe+0x50/0x54
    [   48.498219]  bus_probe_device+0x38/0xa0
    [   48.502170]  device_add+0x590/0x760
    [   48.505761]  device_register+0x20/0x30
    [   48.509623]  of_register_apr_devices+0x1d8/0x318 [apr]
    [   48.514905]  apr_pd_status+0x2c/0x54 [apr]
    [   48.519114]  pdr_notifier_work+0x8c/0xe0 [pdr_interface]
    [   48.524570]  process_one_work+0x150/0x294
    [   48.528692]  worker_thread+0x2d8/0x3d8
    [   48.532551]  kthread+0x130/0x204
    [   48.535874]  ret_from_fork+0x10/0x20
    [   48.539559] Code: d65f03c0 d5384102 d503201f d2800001 (c8e17c02)
    [   48.545823] ---[ end trace 0000000000000000 ]---
    
    Fixes: 5477518b8a0e ("ASoC: qdsp6: audioreach: add q6apm support")
    Cc: [email protected]
    Signed-off-by: Ravi Hothi <[email protected]>
    Reviewed-by: Srinivas Kandagatla <[email protected]>
    Link: https://patch.msgid.link/[email protected]
    Signed-off-by: Mark Brown <[email protected]>
    Signed-off-by: Greg Kroah-Hartman <[email protected]>

ASoC: simple-card-utils: fix graph_util_is_ports0() for DT overlays [+ + +]

Author: Sen Wang <[email protected]>
Date:   Sun Mar 8 23:21:09 2026 -0500

    ASoC: simple-card-utils: fix graph_util_is_ports0() for DT overlays
    
    [ Upstream commit 4185b95f8a42d92d68c49289b4644546b51e252b ]
    
    graph_util_is_ports0() identifies DPCM front-end (ports@0) vs back-end
    (ports@1) by calling of_get_child_by_name() to find the first "ports"
    child and comparing pointers. This relies on child iteration order
    matching DTS source order.
    
    When the DPCM topology comes from a DT overlay, __of_attach_node()
    inserts new children at the head of the sibling list, reversing the
    order. of_get_child_by_name() then returns ports@1 instead of ports@0,
    causing all front-end links to be classified as back-ends. The card
    registers with no PCM devices.
    
    Fix this by matching the unit address directly from the node name
    instead of relying on sibling order.
    
    Fixes: 92939252458f ("ASoC: simple-card-utils: add asoc_graph_is_ports0()")
    Signed-off-by: Sen Wang <[email protected]>
    Acked-by: Kuninori Morimoto <[email protected]>
    Link: https://patch.msgid.link/[email protected]
    Signed-off-by: Mark Brown <[email protected]>
    Signed-off-by: Sasha Levin <[email protected]>

ASoC: simple-card-utils: use __free(device_node) for device node [+ + +]

Author: Kuninori Morimoto <[email protected]>
Date:   Tue Dec 3 02:10:23 2024 +0000

    ASoC: simple-card-utils: use __free(device_node) for device node
    
    [ Upstream commit 419d1918105e5d9926ab02f1f834bb416dc76f65 ]
    
    simple-card-utils handles many type of device_node, thus need to
    use of_node_put() in many place. Let's use __free(device_node)
    and avoid it.
    
    Signed-off-by: Kuninori Morimoto <[email protected]>
    Link: https://patch.msgid.link/[email protected]
    Signed-off-by: Mark Brown <[email protected]>
    Stable-dep-of: 4185b95f8a42 ("ASoC: simple-card-utils: fix graph_util_is_ports0() for DT overlays")
    Signed-off-by: Sasha Levin <[email protected]>

ASoC: soc-core: drop delayed_work_pending() check before flush [+ + +]

Author: matteo.cotifava <[email protected]>
Date:   Mon Mar 9 22:54:11 2026 +0100

    ASoC: soc-core: drop delayed_work_pending() check before flush
    
    [ Upstream commit 3c99c9f0ed60582c1c9852b685d78d5d3a50de63 ]
    
    The delayed_work_pending() check before flush_delayed_work() in
    soc_free_pcm_runtime() is unnecessary and racy. flush_delayed_work()
    is safe to call unconditionally - it is a no-op when no work is
    pending. Remove the check.
    
    The original check was added by commit 9c9b65203492 ("ASoC: core:
    only flush inited work during free") but delayed_work_pending()
    followed by flush_delayed_work() has a time-of-check/time-of-use
    window where work can become pending between the two calls.
    
    Fixes: 9c9b65203492 ("ASoC: core: only flush inited work during free")
    Signed-off-by: Matteo Cotifava <[email protected]>
    Link: https://patch.msgid.link/[email protected]
    Signed-off-by: Mark Brown <[email protected]>
    Signed-off-by: Sasha Levin <[email protected]>

ASoC: soc-core: flush delayed work before removing DAIs and widgets [+ + +]

Author: matteo.cotifava <[email protected]>
Date:   Mon Mar 9 22:54:12 2026 +0100

    ASoC: soc-core: flush delayed work before removing DAIs and widgets
    
    [ Upstream commit 95bc5c225513fc3c4ce169563fb5e3929fbb938b ]
    
    When a sound card is unbound while a PCM stream is open, a
    use-after-free can occur in snd_soc_dapm_stream_event(), called from
    the close_delayed_work workqueue handler.
    
    During unbind, snd_soc_unbind_card() flushes delayed work and then
    calls soc_cleanup_card_resources(). Inside cleanup,
    snd_card_disconnect_sync() releases all PCM file descriptors, and
    the resulting PCM close path can call snd_soc_dapm_stream_stop()
    which schedules new delayed work with a pmdown_time timer delay.
    Since this happens after the flush in snd_soc_unbind_card(), the
    new work is not caught. soc_remove_link_components() then frees
    DAPM widgets before this work fires, leading to the use-after-free.
    
    The existing flush in soc_free_pcm_runtime() also cannot help as it
    runs after soc_remove_link_components() has already freed the widgets.
    
    Add a flush in soc_cleanup_card_resources() after
    snd_card_disconnect_sync() (after which no new PCM closes can
    schedule further delayed work) and before soc_remove_link_dais()
    and soc_remove_link_components() (which tear down the structures the
    delayed work accesses).
    
    Fixes: e894efef9ac7 ("ASoC: core: add support to card rebind")
    Signed-off-by: Matteo Cotifava <[email protected]>
    Link: https://patch.msgid.link/[email protected]
    Signed-off-by: Mark Brown <[email protected]>
    Signed-off-by: Sasha Levin <[email protected]>

ata: libata-core: Add BRIDGE_OK quirk for QEMU drives [+ + +]

Author: Pedro Falcato <[email protected]>
Date:   Thu Mar 5 14:53:12 2026 +0000

    ata: libata-core: Add BRIDGE_OK quirk for QEMU drives
    
    commit b92b0075ee1870f78f59ab1f7da7dbfdd718ad7a upstream.
    
    Currently, whenever you boot with a QEMU drive over an AHCI interface,
    you get:
    [    1.632121] ata1.00: applying bridge limits
    
    This happens due to the kernel not believing the given drive is SATA,
    since word 93 of IDENTIFY (ATA_ID_HW_CONFIG) is non-zero. The result is
    a pretty severe limit in max_hw_sectors_kb, which limits our IO sizes.
    
    QEMU has set word 93 erroneously for SATA drives but does not, in any
    way, emulate any of these real hardware details. There is no PATA
    drive and no SATA cable.
    
    As such, add a BRIDGE_OK quirk for QEMU HARDDISK. Special care is taken
    to limit this quirk to "2.5+", to allow for fixed future versions.
    
    This results in the max_hw_sectors being limited solely by the
    controller interface's limits. Which, for AHCI controllers, takes it
    from 128KB to 32767KB.
    
    Cc: [email protected]
    Signed-off-by: Pedro Falcato <[email protected]>
    Reviewed-by: Damien Le Moal <[email protected]>
    Reviewed-by: Hannes Reinecke <[email protected]>
    Signed-off-by: Niklas Cassel <[email protected]>
    Signed-off-by: Greg Kroah-Hartman <[email protected]>

ata: libata-core: disable LPM on ADATA SU680 SSD [+ + +]

Author: Damien Le Moal <[email protected]>
Date:   Sun Mar 15 07:24:15 2026 +0900

    ata: libata-core: disable LPM on ADATA SU680 SSD
    
    commit ce5ae93d1a216680460040c7c0465a6e3b629dec upstream.
    
    ADATA SU680 SSDs suffer from NCQ read and write commands timeouts or bus
    errors when link power management (LPM) is enabled. Flag these devices
    with the ATA_QUIRK_NOLPM quirk to prevent the use of LPM and avoid these
    command failures.
    
    Reported-by: Mohammad Khaled Bayan <[email protected]>
    Closes: https://bugs.launchpad.net/ubuntu/+source/linux-hwe-6.17/+bug/2144060
    Cc: [email protected]
    Tested-by: Mohammad-Khaled Bayan <[email protected]>
    Signed-off-by: Damien Le Moal <[email protected]>
    Reviewed-by: Martin K. Petersen <[email protected]>
    Signed-off-by: Niklas Cassel <[email protected]>
    Signed-off-by: Greg Kroah-Hartman <[email protected]>

ata: libata-core: Disable LPM on ST1000DM010-2EP102 [+ + +]

Author: Maximilian Pezzullo <[email protected]>
Date:   Wed Mar 4 08:22:59 2026 +0100

    ata: libata-core: Disable LPM on ST1000DM010-2EP102
    
    commit b3b1d3ae1d87bc9398fb715c945968bf4c75a09a upstream.
    
    According to a user report, the ST1000DM010-2EP102 has problems with LPM,
    causing random system freezes. The drive belongs to the same BarraCuda
    family as the ST2000DM008-2FR102 which has the same issue.
    
    Cc: [email protected]
    Fixes: 7627a0edef54 ("ata: ahci: Drop low power policy board type")
    Reported-by: Filippo Baiamonte <[email protected]>
    Closes: https://bugzilla.kernel.org/show_bug.cgi?id=221163
    Signed-off-by: Maximilian Pezzullo <[email protected]>
    Reviewed-by: Damien Le Moal <[email protected]>
    Signed-off-by: Niklas Cassel <[email protected]>
    Signed-off-by: Greg Kroah-Hartman <[email protected]>

ata: libata-scsi: report correct sense field pointer in ata_scsiop_maint_in() [+ + +]

Author: Damien Le Moal <[email protected]>
Date:   Fri Mar 20 17:54:45 2026 -0400

    ata: libata-scsi: report correct sense field pointer in ata_scsiop_maint_in()
    
    [ Upstream commit e6d7eba23b666d85cacee0643be280d6ce1ebffc ]
    
    Commit 4ab7bb976343 ("ata: libata-scsi: Refactor ata_scsiop_maint_in()")
    modified ata_scsiop_maint_in() to directly call
    ata_scsi_set_invalid_field() to set the field pointer of the sense data
    of a failed MAINTENANCE IN command. However, in the case of an invalid
    command format, the sense data field incorrectly indicates byte 1 of
    the CDB. Fix this to indicate byte 2 of the command.
    
    Reported-by: Guenter Roeck <[email protected]>
    Fixes: 4ab7bb976343 ("ata: libata-scsi: Refactor ata_scsiop_maint_in()")
    Cc: [email protected]
    Signed-off-by: Damien Le Moal <[email protected]>
    Signed-off-by: Niklas Cassel <[email protected]>
    Signed-off-by: Sasha Levin <[email protected]>
    Signed-off-by: Greg Kroah-Hartman <[email protected]>

ata: libata-scsi: Return residual for emulated SCSI commands [+ + +]

Author: Damien Le Moal <[email protected]>
Date:   Fri Mar 20 17:54:44 2026 -0400

    ata: libata-scsi: Return residual for emulated SCSI commands
    
    [ Upstream commit 5251ae224d8d3caa21b28d12408062b6e75cffad ]
    
    The function ata_scsi_rbuf_fill() used to fill the reply buffer of
    emulated SCSI commands always copies the ATA reply buffer
    (ata_scsi_rbuf) up to the size of the SCSI command buffer (the transfer
    length for the command), even if the reply is shorter than the SCSI
    command buffer. This leads to issuers of the SCSI command to always get
    a result without any residual (resid is always 0) despite the
    potentially shorter reply for the command.
    
    Modify all fill actors used by ata_scsi_rbuf_fill() to return the number
    of bytes filled for the reply and 0 in case of error. Using this value,
    add a call to scsi_set_resid() in ata_scsi_rbuf_fill() to set the
    correct residual for the SCSI command when the reply length is shorter
    than the command buffer.
    
    Signed-off-by: Damien Le Moal <[email protected]>
    Link: https://lore.kernel.org/r/[email protected]
    Signed-off-by: Niklas Cassel <[email protected]>
    Stable-dep-of: e6d7eba23b66 ("ata: libata-scsi: report correct sense field pointer in ata_scsiop_maint_in()")
    Signed-off-by: Sasha Levin <[email protected]>
    Signed-off-by: Greg Kroah-Hartman <[email protected]>

batman-adv: Avoid double-rtnl_lock ELP metric worker [+ + +]

Author: Sven Eckelmann <[email protected]>
Date:   Mon Feb 16 11:20:29 2026 +0100

    batman-adv: Avoid double-rtnl_lock ELP metric worker
    
    commit cfc83a3c71517b59c1047db57da31e26a9dc2f33 upstream.
    
    batadv_v_elp_get_throughput() might be called when the RTNL lock is already
    held. This could be problematic when the work queue item is cancelled via
    cancel_delayed_work_sync() in batadv_v_elp_iface_disable(). In this case,
    an rtnl_lock() would cause a deadlock.
    
    To avoid this, rtnl_trylock() was used in this function to skip the
    retrieval of the ethtool information in case the RTNL lock was already
    held.
    
    But for cfg80211 interfaces, batadv_get_real_netdev() was called - which
    also uses rtnl_lock(). The approach for __ethtool_get_link_ksettings() must
    also be used instead and the lockless version __batadv_get_real_netdev()
    has to be called.
    
    Cc: [email protected]
    Fixes: 8c8ecc98f5c6 ("batman-adv: Drop unmanaged ELP metric worker")
    Reported-by: Christian Schmidbauer <[email protected]>
    Signed-off-by: Sven Eckelmann <[email protected]>
    Tested-by: Sören Skaarup <[email protected]>
    Signed-off-by: Simon Wunderlich <[email protected]>
    Signed-off-by: Greg Kroah-Hartman <[email protected]>

batman-adv: avoid OGM aggregation when skb tailroom is insufficient [+ + +]

Author: Yang Yang <[email protected]>
Date:   Fri Mar 20 11:20:40 2026 +0100

    batman-adv: avoid OGM aggregation when skb tailroom is insufficient
    
    commit 0d4aef630be9d5f9c1227d07669c26c4383b5ad0 upstream.
    
    When OGM aggregation state is toggled at runtime, an existing forwarded
    packet may have been allocated with only packet_len bytes, while a later
    packet can still be selected for aggregation. Appending in this case can
    hit skb_put overflow conditions.
    
    Reject aggregation when the target skb tailroom cannot accommodate the new
    packet. The caller then falls back to creating a new forward packet
    instead of appending.
    
    Fixes: c6c8fea29769 ("net: Add batman-adv meshing protocol")
    Cc: [email protected]
    Reported-by: Yifan Wu <[email protected]>
    Reported-by: Juefei Pu <[email protected]>
    Signed-off-by: Yuan Tan <[email protected]>
    Signed-off-by: Xin Liu <[email protected]>
    Signed-off-by: Ao Zhou <[email protected]>
    Signed-off-by: Yang Yang <[email protected]>
    Signed-off-by: Sven Eckelmann <[email protected]>
    Signed-off-by: Simon Wunderlich <[email protected]>
    [ Adjust context ]
    Signed-off-by: Sven Eckelmann <[email protected]>
    Signed-off-by: Greg Kroah-Hartman <[email protected]>

binfmt_misc: restore write access before closing files opened by open_exec() [+ + +]

Author: Zilin Guan <[email protected]>
Date:   Wed Mar 4 13:43:11 2026 +0800

    binfmt_misc: restore write access before closing files opened by open_exec()
    
    [ Upstream commit 90f601b497d76f40fa66795c3ecf625b6aced9fd ]
    
    bm_register_write() opens an executable file using open_exec(), which
    internally calls do_open_execat() and denies write access on the file to
    avoid modification while it is being executed.
    
    However, when an error occurs, bm_register_write() closes the file using
    filp_close() directly. This does not restore the write permission, which
    may cause subsequent write operations on the same file to fail.
    
    Fix this by calling exe_file_allow_write_access() before filp_close() to
    restore the write permission properly.
    
    Fixes: e7850f4d844e ("binfmt_misc: fix possible deadlock in bm_register_write")
    Signed-off-by: Zilin Guan <[email protected]>
    Link: https://patch.msgid.link/[email protected]
    Signed-off-by: Christian Brauner <[email protected]>
    [ Use allow_write_access() instead of exe_file_allow_write_access()
    according to commit 0357ef03c94ef
    ("fs: don't block write during exec on pre-content watched files"). ]
    Signed-off-by: Robert Garcia <[email protected]>
    Signed-off-by: Greg Kroah-Hartman <[email protected]>

blk-throttle: fix access race during throttle policy activation [+ + +]

Author: Han Guangjiang <[email protected]>
Date:   Fri Feb 27 13:42:23 2026 +0800

    blk-throttle: fix access race during throttle policy activation
    
    [ Upstream commit bd9fd5be6bc0836820500f68fff144609fbd85a9 ]
    
    On repeated cold boots we occasionally hit a NULL pointer crash in
    blk_should_throtl() when throttling is consulted before the throttle
    policy is fully enabled for the queue. Checking only q->td != NULL is
    insufficient during early initialization, so blkg_to_pd() for the
    throttle policy can still return NULL and blkg_to_tg() becomes NULL,
    which later gets dereferenced.
    
     Unable to handle kernel NULL pointer dereference
     at virtual address 0000000000000156
     ...
     pc : submit_bio_noacct+0x14c/0x4c8
     lr : submit_bio_noacct+0x48/0x4c8
     sp : ffff800087f0b690
     x29: ffff800087f0b690 x28: 0000000000005f90 x27: ffff00068af393c0
     x26: 0000000000080000 x25: 000000000002fbc0 x24: ffff000684ddcc70
     x23: 0000000000000000 x22: 0000000000000000 x21: 0000000000000000
     x20: 0000000000080000 x19: ffff000684ddcd08 x18: ffffffffffffffff
     x17: 0000000000000000 x16: ffff80008132a550 x15: 0000ffff98020fff
     x14: 0000000000000000 x13: 1fffe000d11d7021 x12: ffff000688eb810c
     x11: ffff00077ec4bb80 x10: ffff000688dcb720 x9 : ffff80008068ef60
     x8 : 00000a6fb8a86e85 x7 : 000000000000111e x6 : 0000000000000002
     x5 : 0000000000000246 x4 : 0000000000015cff x3 : 0000000000394500
     x2 : ffff000682e35e40 x1 : 0000000000364940 x0 : 000000000000001a
     Call trace:
      submit_bio_noacct+0x14c/0x4c8
      verity_map+0x178/0x2c8
      __map_bio+0x228/0x250
      dm_submit_bio+0x1c4/0x678
      __submit_bio+0x170/0x230
      submit_bio_noacct_nocheck+0x16c/0x388
      submit_bio_noacct+0x16c/0x4c8
      submit_bio+0xb4/0x210
      f2fs_submit_read_bio+0x4c/0xf0
      f2fs_mpage_readpages+0x3b0/0x5f0
      f2fs_readahead+0x90/0xe8
    
    Tighten blk_throtl_activated() to also require that the throttle policy
    bit is set on the queue:
    
      return q->td != NULL &&
             test_bit(blkcg_policy_throtl.plid, q->blkcg_pols);
    
    This prevents blk_should_throtl() from accessing throttle group state
    until policy data has been attached to blkgs.
    
    Fixes: a3166c51702b ("blk-throttle: delay initialization until configuration")
    Co-developed-by: Liang Jie <[email protected]>
    Signed-off-by: Liang Jie <[email protected]>
    Signed-off-by: Han Guangjiang <[email protected]>
    Reviewed-by: Yu Kuai <[email protected]>
    Signed-off-by: Jens Axboe <[email protected]>
    Signed-off-by: Robert Garcia <[email protected]>
    Signed-off-by: Greg Kroah-Hartman <[email protected]>

Bluetooth: hci_sync: Fix hci_le_create_conn_sync [+ + +]

Author: Michael Grzeschik <[email protected]>
Date:   Thu Mar 5 14:50:52 2026 +0100

    Bluetooth: hci_sync: Fix hci_le_create_conn_sync
    
    [ Upstream commit 2cabe7ff1001b7a197009cf50ba71701f9cbd354 ]
    
    While introducing hci_le_create_conn_sync the functionality
    of hci_connect_le was ported to hci_le_create_conn_sync including
    the disable of the scan before starting the connection.
    
    When this code was run non synchronously the immediate call that was
    setting the flag HCI_LE_SCAN_INTERRUPTED had an impact. Since the
    completion handler for the LE_SCAN_DISABLE was not immediately called.
    In the completion handler of the LE_SCAN_DISABLE event, this flag is
    checked to set the state of the hdev to DISCOVERY_STOPPED.
    
    With the synchronised approach the later setting of the
    HCI_LE_SCAN_INTERRUPTED flag has not the same effect. The completion
    handler would immediately fire in the LE_SCAN_DISABLE call, check for
    the flag, which is then not yet set and do nothing.
    
    To fix this issue and make the function call work as before, we move the
    setting of the flag HCI_LE_SCAN_INTERRUPTED before disabling the scan.
    
    Fixes: 8e8b92ee60de ("Bluetooth: hci_sync: Add hci_le_create_conn_sync")
    Signed-off-by: Michael Grzeschik <[email protected]>
    Signed-off-by: Luiz Augusto von Dentz <[email protected]>
    Signed-off-by: Sasha Levin <[email protected]>

Bluetooth: HIDP: Fix possible UAF [+ + +]

Author: Luiz Augusto von Dentz <[email protected]>
Date:   Thu Mar 5 10:17:47 2026 -0500

    Bluetooth: HIDP: Fix possible UAF
    
    [ Upstream commit dbf666e4fc9bdd975a61bf682b3f75cb0145eedd ]
    
    This fixes the following trace caused by not dropping l2cap_conn
    reference when user->remove callback is called:
    
    [   97.809249] l2cap_conn_free: freeing conn ffff88810a171c00
    [   97.809907] CPU: 1 UID: 0 PID: 1419 Comm: repro_standalon Not tainted 7.0.0-rc1-dirty #14 PREEMPT(lazy)
    [   97.809935] Hardware name: QEMU Standard PC (i440FX + PIIX, 1996), BIOS 1.17.0-debian-1.17.0-1 04/01/2014
    [   97.809947] Call Trace:
    [   97.809954]  <TASK>
    [   97.809961]  dump_stack_lvl (lib/dump_stack.c:122)
    [   97.809990]  l2cap_conn_free (net/bluetooth/l2cap_core.c:1808)
    [   97.810017]  l2cap_conn_del (./include/linux/kref.h:66 net/bluetooth/l2cap_core.c:1821 net/bluetooth/l2cap_core.c:1798)
    [   97.810055]  l2cap_disconn_cfm (net/bluetooth/l2cap_core.c:7347 (discriminator 1) net/bluetooth/l2cap_core.c:7340 (discriminator 1))
    [   97.810086]  ? __pfx_l2cap_disconn_cfm (net/bluetooth/l2cap_core.c:7341)
    [   97.810117]  hci_conn_hash_flush (./include/net/bluetooth/hci_core.h:2152 (discriminator 2) net/bluetooth/hci_conn.c:2644 (discriminator 2))
    [   97.810148]  hci_dev_close_sync (net/bluetooth/hci_sync.c:5360)
    [   97.810180]  ? __pfx_hci_dev_close_sync (net/bluetooth/hci_sync.c:5285)
    [   97.810212]  ? srso_alias_return_thunk (arch/x86/lib/retpoline.S:221)
    [   97.810242]  ? up_write (./arch/x86/include/asm/atomic64_64.h:87 (discriminator 5) ./include/linux/atomic/atomic-arch-fallback.h:2852 (discriminator 5) ./include/linux/atomic/atomic-long.h:268 (discriminator 5) ./include/linux/atomic/atomic-instrumented.h:3391 (discriminator 5) kernel/locking/rwsem.c:1385 (discriminator 5) kernel/locking/rwsem.c:1643 (discriminator 5))
    [   97.810267]  ? srso_alias_return_thunk (arch/x86/lib/retpoline.S:221)
    [   97.810290]  ? rcu_is_watching (./arch/x86/include/asm/atomic.h:23 ./include/linux/atomic/atomic-arch-fallback.h:457 ./include/linux/context_tracking.h:128 kernel/rcu/tree.c:752)
    [   97.810320]  hci_unregister_dev (net/bluetooth/hci_core.c:504 net/bluetooth/hci_core.c:2716)
    [   97.810346]  vhci_release (drivers/bluetooth/hci_vhci.c:691)
    [   97.810375]  ? __pfx_vhci_release (drivers/bluetooth/hci_vhci.c:678)
    [   97.810404]  __fput (fs/file_table.c:470)
    [   97.810430]  task_work_run (kernel/task_work.c:235)
    [   97.810451]  ? __pfx_task_work_run (kernel/task_work.c:201)
    [   97.810472]  ? srso_alias_return_thunk (arch/x86/lib/retpoline.S:221)
    [   97.810495]  ? do_raw_spin_unlock (./include/asm-generic/qspinlock.h:128 (discriminator 5) kernel/locking/spinlock_debug.c:142 (discriminator 5))
    [   97.810527]  do_exit (kernel/exit.c:972)
    [   97.810547]  ? srso_alias_return_thunk (arch/x86/lib/retpoline.S:221)
    [   97.810574]  ? __pfx_do_exit (kernel/exit.c:897)
    [   97.810594]  ? lock_acquire (kernel/locking/lockdep.c:470 (discriminator 6) kernel/locking/lockdep.c:5870 (discriminator 6) kernel/locking/lockdep.c:5825 (discriminator 6))
    [   97.810616]  ? srso_alias_return_thunk (arch/x86/lib/retpoline.S:221)
    [   97.810639]  ? do_raw_spin_lock (kernel/locking/spinlock_debug.c:95 (discriminator 4) kernel/locking/spinlock_debug.c:118 (discriminator 4))
    [   97.810664]  ? srso_alias_return_thunk (arch/x86/lib/retpoline.S:221)
    [   97.810688]  ? find_held_lock (kernel/locking/lockdep.c:5350 (discriminator 1))
    [   97.810721]  do_group_exit (kernel/exit.c:1093)
    [   97.810745]  get_signal (kernel/signal.c:3007 (discriminator 1))
    [   97.810772]  ? security_file_permission (./arch/x86/include/asm/jump_label.h:37 security/security.c:2366)
    [   97.810803]  ? srso_alias_return_thunk (arch/x86/lib/retpoline.S:221)
    [   97.810826]  ? vfs_read (fs/read_write.c:555)
    [   97.810854]  ? __pfx_get_signal (kernel/signal.c:2800)
    [   97.810880]  ? srso_alias_return_thunk (arch/x86/lib/retpoline.S:221)
    [   97.810905]  ? __pfx_vfs_read (fs/read_write.c:555)
    [   97.810932]  ? srso_alias_return_thunk (arch/x86/lib/retpoline.S:221)
    [   97.810960]  arch_do_signal_or_restart (arch/x86/kernel/signal.c:337 (discriminator 1))
    [   97.810990]  ? __pfx_arch_do_signal_or_restart (arch/x86/kernel/signal.c:334)
    [   97.811021]  ? srso_alias_return_thunk (arch/x86/lib/retpoline.S:221)
    [   97.811055]  ? srso_alias_return_thunk (arch/x86/lib/retpoline.S:221)
    [   97.811078]  ? ksys_read (fs/read_write.c:707)
    [   97.811106]  ? __pfx_ksys_read (fs/read_write.c:707)
    [   97.811137]  exit_to_user_mode_loop (kernel/entry/common.c:66 kernel/entry/common.c:98)
    [   97.811169]  ? rcu_is_watching (./arch/x86/include/asm/atomic.h:23 ./include/linux/atomic/atomic-arch-fallback.h:457 ./include/linux/context_tracking.h:128 kernel/rcu/tree.c:752)
    [   97.811192]  ? srso_alias_return_thunk (arch/x86/lib/retpoline.S:221)
    [   97.811215]  ? trace_hardirqs_off (./include/trace/events/preemptirq.h:36 (discriminator 33) kernel/trace/trace_preemptirq.c:95 (discriminator 33) kernel/trace/trace_preemptirq.c:90 (discriminator 33))
    [   97.811240]  do_syscall_64 (./include/linux/irq-entry-common.h:226 ./include/linux/irq-entry-common.h:256 ./include/linux/entry-common.h:325 arch/x86/entry/syscall_64.c:100)
    [   97.811268]  ? srso_alias_return_thunk (arch/x86/lib/retpoline.S:221)
    [   97.811292]  ? exc_page_fault (arch/x86/mm/fault.c:1480 (discriminator 3) arch/x86/mm/fault.c:1527 (discriminator 3))
    [   97.811318]  entry_SYSCALL_64_after_hwframe (arch/x86/entry/entry_64.S:130)
    [   97.811338] RIP: 0033:0x445cfe
    [   97.811352] Code: Unable to access opcode bytes at 0x445cd4.
    
    Code starting with the faulting instruction
    ===========================================
    [   97.811360] RSP: 002b:00007f65c41c6dc8 EFLAGS: 00000246 ORIG_RAX: 0000000000000000
    [   97.811378] RAX: fffffffffffffe00 RBX: 00007f65c41c76c0 RCX: 0000000000445cfe
    [   97.811391] RDX: 0000000000000400 RSI: 00007f65c41c6e40 RDI: 0000000000000004
    [   97.811403] RBP: 00007f65c41c7250 R08: 0000000000000000 R09: 0000000000000000
    [   97.811415] R10: 0000000000000000 R11: 0000000000000246 R12: ffffffffffffffe8
    [   97.811428] R13: 0000000000000000 R14: 00007fff780a8c00 R15: 00007f65c41c76c0
    [   97.811453]  </TASK>
    [   98.402453] ==================================================================
    [   98.403560] BUG: KASAN: use-after-free in __mutex_lock (kernel/locking/mutex.c:199 kernel/locking/mutex.c:694 kernel/locking/mutex.c:776)
    [   98.404541] Read of size 8 at addr ffff888113ee40a8 by task khidpd_00050004/1430
    [   98.405361]
    [   98.405563] CPU: 1 UID: 0 PID: 1430 Comm: khidpd_00050004 Not tainted 7.0.0-rc1-dirty #14 PREEMPT(lazy)
    [   98.405588] Hardware name: QEMU Standard PC (i440FX + PIIX, 1996), BIOS 1.17.0-debian-1.17.0-1 04/01/2014
    [   98.405600] Call Trace:
    [   98.405607]  <TASK>
    [   98.405614]  dump_stack_lvl (lib/dump_stack.c:122)
    [   98.405641]  print_report (mm/kasan/report.c:379 mm/kasan/report.c:482)
    [   98.405667]  ? srso_alias_return_thunk (arch/x86/lib/retpoline.S:221)
    [   98.405691]  ? __virt_addr_valid (arch/x86/mm/physaddr.c:55)
    [   98.405724]  ? __mutex_lock (kernel/locking/mutex.c:199 kernel/locking/mutex.c:694 kernel/locking/mutex.c:776)
    [   98.405748]  kasan_report (mm/kasan/report.c:221 mm/kasan/report.c:597)
    [   98.405778]  ? __mutex_lock (kernel/locking/mutex.c:199 kernel/locking/mutex.c:694 kernel/locking/mutex.c:776)
    [   98.405807]  __mutex_lock (kernel/locking/mutex.c:199 kernel/locking/mutex.c:694 kernel/locking/mutex.c:776)
    [   98.405832]  ? do_raw_spin_lock (kernel/locking/spinlock_debug.c:95 (discriminator 4) kernel/locking/spinlock_debug.c:118 (discriminator 4))
    [   98.405859]  ? l2cap_unregister_user (./include/linux/list.h:381 (discriminator 2) net/bluetooth/l2cap_core.c:1723 (discriminator 2))
    [   98.405888]  ? __pfx_do_raw_spin_lock (kernel/locking/spinlock_debug.c:114)
    [   98.405915]  ? __pfx___mutex_lock (kernel/locking/mutex.c:775)
    [   98.405939]  ? srso_alias_return_thunk (arch/x86/lib/retpoline.S:221)
    [   98.405963]  ? lock_acquire (kernel/locking/lockdep.c:470 (discriminator 6) kernel/locking/lockdep.c:5870 (discriminator 6) kernel/locking/lockdep.c:5825 (discriminator 6))
    [   98.405984]  ? find_held_lock (kernel/locking/lockdep.c:5350 (discriminator 1))
    [   98.406015]  ? srso_alias_return_thunk (arch/x86/lib/retpoline.S:221)
    [   98.406038]  ? lock_release (kernel/locking/lockdep.c:5536 kernel/locking/lockdep.c:5889 kernel/locking/lockdep.c:5875)
    [   98.406061]  ? srso_alias_return_thunk (arch/x86/lib/retpoline.S:221)
    [   98.406085]  ? _raw_spin_unlock_irqrestore (./arch/x86/include/asm/irqflags.h:42 ./arch/x86/include/asm/irqflags.h:119 ./arch/x86/include/asm/irqflags.h:159 ./include/linux/spinlock_api_smp.h:178 kernel/locking/spinlock.c:194)
    [   98.406107]  ? srso_alias_return_thunk (arch/x86/lib/retpoline.S:221)
    [   98.406130]  ? __timer_delete_sync (kernel/time/timer.c:1592)
    [   98.406158]  ? l2cap_unregister_user (./include/linux/list.h:381 (discriminator 2) net/bluetooth/l2cap_core.c:1723 (discriminator 2))
    [   98.406186]  ? srso_alias_return_thunk (arch/x86/lib/retpoline.S:221)
    [   98.406210]  l2cap_unregister_user (./include/linux/list.h:381 (discriminator 2) net/bluetooth/l2cap_core.c:1723 (discriminator 2))
    [   98.406263]  hidp_session_thread (./include/linux/instrumented.h:112 ./include/linux/atomic/atomic-instrumented.h:400 ./include/linux/refcount.h:389 ./include/linux/refcount.h:432 ./include/linux/refcount.h:450 ./include/linux/kref.h:64 net/bluetooth/hidp/core.c:996 net/bluetooth/hidp/core.c:1305)
    [   98.406293]  ? __pfx_hidp_session_thread (net/bluetooth/hidp/core.c:1264)
    [   98.406323]  ? kthread (kernel/kthread.c:433)
    [   98.406340]  ? __pfx_hidp_session_wake_function (net/bluetooth/hidp/core.c:1251)
    [   98.406370]  ? srso_alias_return_thunk (arch/x86/lib/retpoline.S:221)
    [   98.406393]  ? find_held_lock (kernel/locking/lockdep.c:5350 (discriminator 1))
    [   98.406424]  ? __pfx_hidp_session_wake_function (net/bluetooth/hidp/core.c:1251)
    [   98.406453]  ? srso_alias_return_thunk (arch/x86/lib/retpoline.S:221)
    [   98.406476]  ? trace_hardirqs_on (kernel/trace/trace_preemptirq.c:79 (discriminator 1))
    [   98.406499]  ? srso_alias_return_thunk (arch/x86/lib/retpoline.S:221)
    [   98.406523]  ? kthread (kernel/kthread.c:433)
    [   98.406539]  ? srso_alias_return_thunk (arch/x86/lib/retpoline.S:221)
    [   98.406565]  ? kthread (kernel/kthread.c:433)
    [   98.406581]  ? __pfx_hidp_session_thread (net/bluetooth/hidp/core.c:1264)
    [   98.406610]  kthread (kernel/kthread.c:467)
    [   98.406627]  ? __pfx_kthread (kernel/kthread.c:412)
    [   98.406645]  ret_from_fork (arch/x86/kernel/process.c:164)
    [   98.406674]  ? __pfx_ret_from_fork (arch/x86/kernel/process.c:153)
    [   98.406704]  ? srso_alias_return_thunk (arch/x86/lib/retpoline.S:221)
    [   98.406728]  ? __pfx_kthread (kernel/kthread.c:412)
    [   98.406747]  ret_from_fork_asm (arch/x86/entry/entry_64.S:258)
    [   98.406774]  </TASK>
    [   98.406780]
    [   98.433693] The buggy address belongs to the physical page:
    [   98.434405] page: refcount:0 mapcount:0 mapping:0000000000000000 index:0xffff888113ee7c40 pfn:0x113ee4
    [   98.435557] flags: 0x200000000000000(node=0|zone=2)
    [   98.436198] raw: 0200000000000000 ffffea0004244308 ffff8881f6f3ebc0 0000000000000000
    [   98.437195] raw: ffff888113ee7c40 0000000000000000 00000000ffffffff 0000000000000000
    [   98.438115] page dumped because: kasan: bad access detected
    [   98.438951]
    [   98.439211] Memory state around the buggy address:
    [   98.439871]  ffff888113ee3f80: fc fc fc fc fc fc fc fc fc fc fc fc fc fc fc fc
    [   98.440714]  ffff888113ee4000: ff ff ff ff ff ff ff ff ff ff ff ff ff ff ff ff
    [   98.441580] >ffff888113ee4080: ff ff ff ff ff ff ff ff ff ff ff ff ff ff ff ff
    [   98.442458]                                   ^
    [   98.443011]  ffff888113ee4100: ff ff ff ff ff ff ff ff ff ff ff ff ff ff ff ff
    [   98.443889]  ffff888113ee4180: ff ff ff ff ff ff ff ff ff ff ff ff ff ff ff ff
    [   98.444768] ==================================================================
    [   98.445719] Disabling lock debugging due to kernel taint
    [   98.448074] l2cap_conn_free: freeing conn ffff88810c22b400
    [   98.450012] CPU: 1 UID: 0 PID: 1430 Comm: khidpd_00050004 Tainted: G    B               7.0.0-rc1-dirty #14 PREEMPT(lazy)
    [   98.450040] Tainted: [B]=BAD_PAGE
    [   98.450047] Hardware name: QEMU Standard PC (i440FX + PIIX, 1996), BIOS 1.17.0-debian-1.17.0-1 04/01/2014
    [   98.450059] Call Trace:
    [   98.450065]  <TASK>
    [   98.450071]  dump_stack_lvl (lib/dump_stack.c:122)
    [   98.450099]  l2cap_conn_free (net/bluetooth/l2cap_core.c:1808)
    [   98.450125]  l2cap_conn_put (net/bluetooth/l2cap_core.c:1822)
    [   98.450154]  session_free (net/bluetooth/hidp/core.c:990)
    [   98.450181]  hidp_session_thread (net/bluetooth/hidp/core.c:1307)
    [   98.450213]  ? __pfx_hidp_session_thread (net/bluetooth/hidp/core.c:1264)
    [   98.450271]  ? kthread (kernel/kthread.c:433)
    [   98.450293]  ? __pfx_hidp_session_wake_function (net/bluetooth/hidp/core.c:1251)
    [   98.450339]  ? srso_alias_return_thunk (arch/x86/lib/retpoline.S:221)
    [   98.450368]  ? find_held_lock (kernel/locking/lockdep.c:5350 (discriminator 1))
    [   98.450406]  ? __pfx_hidp_session_wake_function (net/bluetooth/hidp/core.c:1251)
    [   98.450442]  ? srso_alias_return_thunk (arch/x86/lib/retpoline.S:221)
    [   98.450471]  ? trace_hardirqs_on (kernel/trace/trace_preemptirq.c:79 (discriminator 1))
    [   98.450499]  ? srso_alias_return_thunk (arch/x86/lib/retpoline.S:221)
    [   98.450528]  ? kthread (kernel/kthread.c:433)
    [   98.450547]  ? srso_alias_return_thunk (arch/x86/lib/retpoline.S:221)
    [   98.450578]  ? kthread (kernel/kthread.c:433)
    [   98.450598]  ? __pfx_hidp_session_thread (net/bluetooth/hidp/core.c:1264)
    [   98.450637]  kthread (kernel/kthread.c:467)
    [   98.450657]  ? __pfx_kthread (kernel/kthread.c:412)
    [   98.450680]  ret_from_fork (arch/x86/kernel/process.c:164)
    [   98.450715]  ? __pfx_ret_from_fork (arch/x86/kernel/process.c:153)
    [   98.450752]  ? srso_alias_return_thunk (arch/x86/lib/retpoline.S:221)
    [   98.450782]  ? __pfx_kthread (kernel/kthread.c:412)
    [   98.450804]  ret_from_fork_asm (arch/x86/entry/entry_64.S:258)
    [   98.450836]  </TASK>
    
    Fixes: b4f34d8d9d26 ("Bluetooth: hidp: add new session-management helpers")
    Reported-by: soufiane el hachmi <[email protected]>
    Tested-by: soufiane el hachmi <[email protected]>
    Signed-off-by: Luiz Augusto von Dentz <[email protected]>
    Signed-off-by: Sasha Levin <[email protected]>

Bluetooth: ISO: Fix defer tests being unstable [+ + +]

Author: Luiz Augusto von Dentz <[email protected]>
Date:   Fri Feb 27 15:23:01 2026 -0500

    Bluetooth: ISO: Fix defer tests being unstable
    
    [ Upstream commit 62bcaa6b351b6dc400f6c6b83762001fd9f5c12d ]
    
    iso-tester defer tests seem to fail with hci_conn_hash_lookup_cig
    being unable to resolve a cig in set_cig_params_sync due a race
    where it is run immediatelly before hci_bind_cis is able to set
    the QoS settings into the hci_conn object.
    
    So this moves the assigning of the QoS settings to be done directly
    by hci_le_set_cig_params to prevent that from happening again.
    
    Fixes: 26afbd826ee3 ("Bluetooth: Add initial implementation of CIS connections")
    Signed-off-by: Luiz Augusto von Dentz <[email protected]>
    Signed-off-by: Sasha Levin <[email protected]>

Bluetooth: L2CAP: Fix accepting multiple L2CAP_ECRED_CONN_REQ [+ + +]

Author: Luiz Augusto von Dentz <[email protected]>
Date:   Tue Mar 3 13:29:53 2026 -0500

    Bluetooth: L2CAP: Fix accepting multiple L2CAP_ECRED_CONN_REQ
    
    commit 5b3e2052334f2ff6d5200e952f4aa66994d09899 upstream.
    
    Currently the code attempts to accept requests regardless of the
    command identifier which may cause multiple requests to be marked
    as pending (FLAG_DEFER_SETUP) which can cause more than
    L2CAP_ECRED_MAX_CID(5) to be allocated in l2cap_ecred_rsp_defer
    causing an overflow.
    
    The spec is quite clear that the same identifier shall not be used on
    subsequent requests:
    
    'Within each signaling channel a different Identifier shall be used
    for each successive request or indication.'
    https://www.bluetooth.com/wp-content/uploads/Files/Specification/HTML/Core-62/out/en/host/logical-link-control-and-adaptation-protocol-specification.html#UUID-32a25a06-4aa4-c6c7-77c5-dcfe3682355d
    
    So this attempts to check if there are any channels pending with the
    same identifier and rejects if any are found.
    
    Fixes: 15f02b910562 ("Bluetooth: L2CAP: Add initial code for Enhanced Credit Based Mode")
    Reported-by: Yiming Qian <[email protected]>
    Signed-off-by: Luiz Augusto von Dentz <[email protected]>
    Signed-off-by: Greg Kroah-Hartman <[email protected]>

Bluetooth: L2CAP: Fix type confusion in l2cap_ecred_reconf_rsp() [+ + +]

Author: Lukas Johannes Möller <[email protected]>
Date:   Tue Mar 10 21:59:46 2026 +0000

    Bluetooth: L2CAP: Fix type confusion in l2cap_ecred_reconf_rsp()
    
    commit 15145675690cab2de1056e7ed68e59cbd0452529 upstream.
    
    l2cap_ecred_reconf_rsp() casts the incoming data to struct
    l2cap_ecred_conn_rsp (the ECRED *connection* response, 8 bytes with
    result at offset 6) instead of struct l2cap_ecred_reconf_rsp (2 bytes
    with result at offset 0).
    
    This causes two problems:
    
     - The sizeof(*rsp) length check requires 8 bytes instead of the
       correct 2, so valid L2CAP_ECRED_RECONF_RSP packets are rejected
       with -EPROTO.
    
     - rsp->result reads from offset 6 instead of offset 0, returning
       wrong data when the packet is large enough to pass the check.
    
    Fix by using the correct type.  Also pass the already byte-swapped
    result variable to BT_DBG instead of the raw __le16 field.
    
    Fixes: 15f02b910562 ("Bluetooth: L2CAP: Add initial code for Enhanced Credit Based Mode")
    Cc: [email protected]
    Signed-off-by: Lukas Johannes Möller <[email protected]>
    Signed-off-by: Luiz Augusto von Dentz <[email protected]>
    Signed-off-by: Greg Kroah-Hartman <[email protected]>

Bluetooth: L2CAP: Fix use-after-free in l2cap_unregister_user [+ + +]

Author: Shaurya Rane <[email protected]>
Date:   Thu Nov 6 23:50:16 2025 +0530

    Bluetooth: L2CAP: Fix use-after-free in l2cap_unregister_user
    
    [ Upstream commit 752a6c9596dd25efd6978a73ff21f3b592668f4a ]
    
    After commit ab4eedb790ca ("Bluetooth: L2CAP: Fix corrupted list in
    hci_chan_del"), l2cap_conn_del() uses conn->lock to protect access to
    conn->users. However, l2cap_register_user() and l2cap_unregister_user()
    don't use conn->lock, creating a race condition where these functions can
    access conn->users and conn->hchan concurrently with l2cap_conn_del().
    
    This can lead to use-after-free and list corruption bugs, as reported
    by syzbot.
    
    Fix this by changing l2cap_register_user() and l2cap_unregister_user()
    to use conn->lock instead of hci_dev_lock(), ensuring consistent locking
    for the l2cap_conn structure.
    
    Reported-by: [email protected]
    Closes: https://syzkaller.appspot.com/bug?extid=14b6d57fb728e27ce23c
    Fixes: ab4eedb790ca ("Bluetooth: L2CAP: Fix corrupted list in hci_chan_del")
    Signed-off-by: Shaurya Rane <[email protected]>
    Signed-off-by: Luiz Augusto von Dentz <[email protected]>
    Signed-off-by: Sasha Levin <[email protected]>

Bluetooth: L2CAP: Validate L2CAP_INFO_RSP payload length before access [+ + +]

Author: Lukas Johannes Möller <[email protected]>
Date:   Tue Mar 10 21:59:47 2026 +0000

    Bluetooth: L2CAP: Validate L2CAP_INFO_RSP payload length before access
    
    commit dd815e6e3918dc75a49aaabac36e4f024d675101 upstream.
    
    l2cap_information_rsp() checks that cmd_len covers the fixed
    l2cap_info_rsp header (type + result, 4 bytes) but then reads
    rsp->data without verifying that the payload is present:
    
     - L2CAP_IT_FEAT_MASK calls get_unaligned_le32(rsp->data), which reads
       4 bytes past the header (needs cmd_len >= 8).
    
     - L2CAP_IT_FIXED_CHAN reads rsp->data[0], 1 byte past the header
       (needs cmd_len >= 5).
    
    A truncated L2CAP_INFO_RSP with result == L2CAP_IR_SUCCESS triggers an
    out-of-bounds read of adjacent skb data.
    
    Guard each data access with the required payload length check.  If the
    payload is too short, skip the read and let the state machine complete
    with safe defaults (feat_mask and remote_fixed_chan remain zero from
    kzalloc), so the info timer cleanup and l2cap_conn_start() still run
    and the connection is not stalled.
    
    Fixes: 4e8402a3f884 ("[Bluetooth] Retrieve L2CAP features mask on connection setup")
    Cc: [email protected]
    Signed-off-by: Lukas Johannes Möller <[email protected]>
    Signed-off-by: Luiz Augusto von Dentz <[email protected]>
    Signed-off-by: Greg Kroah-Hartman <[email protected]>

Bluetooth: LE L2CAP: Disconnect if received packet's SDU exceeds IMTU [+ + +]

Author: Christian Eggers <[email protected]>
Date:   Wed Feb 25 18:07:25 2026 +0100

    Bluetooth: LE L2CAP: Disconnect if received packet's SDU exceeds IMTU
    
    [ Upstream commit e1d9a66889867c232657a9b6f25d451d7c3ab96f ]
    
    Core 6.0, Vol 3, Part A, 3.4.3:
    "If the SDU length field value exceeds the receiver's MTU, the receiver
    shall disconnect the channel..."
    
    This fixes L2CAP/LE/CFC/BV-26-C (running together with 'l2test -r -P
    0x0027 -V le_public -I 100').
    
    Fixes: aac23bf63659 ("Bluetooth: Implement LE L2CAP reassembly")
    Signed-off-by: Christian Eggers <[email protected]>
    Signed-off-by: Luiz Augusto von Dentz <[email protected]>
    Signed-off-by: Sasha Levin <[email protected]>

Bluetooth: LE L2CAP: Disconnect if sum of payload sizes exceed SDU [+ + +]

Author: Christian Eggers <[email protected]>
Date:   Wed Feb 25 18:07:27 2026 +0100

    Bluetooth: LE L2CAP: Disconnect if sum of payload sizes exceed SDU
    
    [ Upstream commit b6a2bf43aa37670432843bc73ae2a6288ba4d6f8 ]
    
    Core 6.0, Vol 3, Part A, 3.4.3:
    "... If the sum of the payload sizes for the K-frames exceeds the
    specified SDU length, the receiver shall disconnect the channel."
    
    This fixes L2CAP/LE/CFC/BV-27-C (running together with 'l2test -r -P
    0x0027 -V le_public').
    
    Fixes: aac23bf63659 ("Bluetooth: Implement LE L2CAP reassembly")
    Signed-off-by: Christian Eggers <[email protected]>
    Signed-off-by: Luiz Augusto von Dentz <[email protected]>
    Signed-off-by: Sasha Levin <[email protected]>

Bluetooth: MGMT: Fix list corruption and UAF in command complete handlers [+ + +]

Author: Wang Tao <[email protected]>
Date:   Fri Feb 27 11:03:39 2026 +0000

    Bluetooth: MGMT: Fix list corruption and UAF in command complete handlers
    
    [ Upstream commit 17f89341cb4281d1da0e2fb0de5406ab7c4e25ef ]
    
    Commit 302a1f674c00 ("Bluetooth: MGMT: Fix possible UAFs") introduced
    mgmt_pending_valid(), which not only validates the pending command but
    also unlinks it from the pending list if it is valid. This change in
    semantics requires updates to several completion handlers to avoid list
    corruption and memory safety issues.
    
    This patch addresses two left-over issues from the aforementioned rework:
    
    1. In mgmt_add_adv_patterns_monitor_complete(), mgmt_pending_remove()
    is replaced with mgmt_pending_free() in the success path. Since
    mgmt_pending_valid() already unlinks the command at the beginning of
    the function, calling mgmt_pending_remove() leads to a double list_del()
    and subsequent list corruption/kernel panic.
    
    2. In set_mesh_complete(), the use of mgmt_pending_foreach() in the error
    path is removed. Since the current command is already unlinked by
    mgmt_pending_valid(), this foreach loop would incorrectly target other
    pending mesh commands, potentially freeing them while they are still being
    processed concurrently (leading to UAFs). The redundant mgmt_cmd_status()
    is also simplified to use cmd->opcode directly.
    
    Fixes: 302a1f674c00 ("Bluetooth: MGMT: Fix possible UAFs")
    Signed-off-by: Wang Tao <[email protected]>
    Signed-off-by: Luiz Augusto von Dentz <[email protected]>
    Signed-off-by: Sasha Levin <[email protected]>

Bluetooth: qca: fix ROM version reading on WCN3998 chips [+ + +]

Author: Dmitry Baryshkov <[email protected]>
Date:   Wed Mar 11 01:02:57 2026 +0200

    Bluetooth: qca: fix ROM version reading on WCN3998 chips
    
    [ Upstream commit 99b2c531e0e797119ae1b9195a8764ee98b00e65 ]
    
    WCN3998 uses a bit different format for rom version:
    
    [    5.479978] Bluetooth: hci0: setting up wcn399x
    [    5.633763] Bluetooth: hci0: QCA Product ID   :0x0000000a
    [    5.645350] Bluetooth: hci0: QCA SOC Version  :0x40010224
    [    5.650906] Bluetooth: hci0: QCA ROM Version  :0x00001001
    [    5.665173] Bluetooth: hci0: QCA Patch Version:0x00006699
    [    5.679356] Bluetooth: hci0: QCA controller version 0x02241001
    [    5.691109] Bluetooth: hci0: QCA Downloading qca/crbtfw21.tlv
    [    6.680102] Bluetooth: hci0: QCA Downloading qca/crnv21.bin
    [    6.842948] Bluetooth: hci0: QCA setup on UART is completed
    
    Fixes: 523760b7ff88 ("Bluetooth: hci_qca: Added support for WCN3998")
    Reviewed-by: Bartosz Golaszewski <[email protected]>
    Signed-off-by: Dmitry Baryshkov <[email protected]>
    Signed-off-by: Luiz Augusto von Dentz <[email protected]>
    Signed-off-by: Sasha Levin <[email protected]>

Bluetooth: SMP: make SM/PER/KDU/BI-04-C happy [+ + +]

Author: Christian Eggers <[email protected]>
Date:   Wed Feb 25 18:07:28 2026 +0100

    Bluetooth: SMP: make SM/PER/KDU/BI-04-C happy
    
    [ Upstream commit 0e4d4dcc1a6e82cc6f9abf32193558efa7e1613d ]
    
    The last test step ("Test with Invalid public key X and Y, all set to
    0") expects to get an "DHKEY check failed" instead of "unspecified".
    
    Fixes: 6d19628f539f ("Bluetooth: SMP: Fail if remote and local public keys are identical")
    Signed-off-by: Christian Eggers <[email protected]>
    Signed-off-by: Luiz Augusto von Dentz <[email protected]>
    Signed-off-by: Sasha Levin <[email protected]>

bnxt_en: Fix RSS table size check when changing ethtool channels [+ + +]

Author: Pavan Chebbi <[email protected]>
Date:   Fri Mar 6 14:58:54 2026 -0800

    bnxt_en: Fix RSS table size check when changing ethtool channels
    
    [ Upstream commit 0d9a60a0618d255530ca56072c5f39eb58e1ed4a ]
    
    When changing channels, the current check in bnxt_set_channels()
    is not checking for non-default RSS contexts when the RSS table size
    changes. The current check for IFF_RXFH_CONFIGURED is only sufficient
    for the default RSS context. Expand the check to include the presence
    of any non-default RSS contexts.
    
    Allowing such change will result in incorrect configuration of the
    context's RSS table when the table size changes.
    
    Fixes: b3d0083caf9a ("bnxt_en: Support RSS contexts in ethtool .{get|set}_rxfh()")
    Reported-by: Björn Töpel <[email protected]>
    Link: https://lore.kernel.org/netdev/[email protected]/
    Reviewed-by: Andy Gospodarek <[email protected]>
    Signed-off-by: Pavan Chebbi <[email protected]>
    Signed-off-by: Michael Chan <[email protected]>
    Link: https://patch.msgid.link/[email protected]
    Signed-off-by: Jakub Kicinski <[email protected]>
    Signed-off-by: Sasha Levin <[email protected]>

bonding: add ESP offload features when slaves support [+ + +]

Author: Jianbo Liu <[email protected]>
Date:   Tue Nov 5 21:27:21 2024 +0200

    bonding: add ESP offload features when slaves support
    
    [ Upstream commit 4861333b42178fa3d8fd1bb4e2cfb2fedc968dba ]
    
    Add NETIF_F_GSO_ESP bit to bond's gso_partial_features if all slaves
    support it, such that ESP segmentation is handled by hardware if possible.
    
    Signed-off-by: Jianbo Liu <[email protected]>
    Reviewed-by: Boris Pismenny <[email protected]>
    Signed-off-by: Tariq Toukan <[email protected]>
    Link: https://patch.msgid.link/[email protected]
    Signed-off-by: Jakub Kicinski <[email protected]>
    Stable-dep-of: 950803f72547 ("bonding: fix type confusion in bond_setup_by_slave()")
    Signed-off-by: Sasha Levin <[email protected]>

bonding: Correctly support GSO ESP offload [+ + +]

Author: Cosmin Ratiu <[email protected]>
Date:   Mon Jan 27 12:41:47 2025 +0200

    bonding: Correctly support GSO ESP offload
    
    [ Upstream commit 9e6c4e6b605c1fa3e24f74ee0b641e95f090188a ]
    
    The referenced fix is incomplete. It correctly computes
    bond_dev->gso_partial_features across slaves, but unfortunately
    netdev_fix_features discards gso_partial_features from the feature set
    if NETIF_F_GSO_PARTIAL isn't set in bond_dev->features.
    
    This is visible with ethtool -k bond0 | grep esp:
    tx-esp-segmentation: off [requested on]
    esp-hw-offload: on
    esp-tx-csum-hw-offload: on
    
    This patch reworks the bonding GSO offload support by:
    - making aggregating gso_partial_features across slaves similar to the
      other feature sets (this part is a no-op).
    - advertising the default partial gso features on empty bond devs, same
      as with other feature sets (also a no-op).
    - adding NETIF_F_GSO_PARTIAL to hw_enc_features filtered across slaves.
    - adding NETIF_F_GSO_PARTIAL to features in bond_setup()
    
    With all of these, 'ethtool -k bond0 | grep esp' now reports:
    tx-esp-segmentation: on
    esp-hw-offload: on
    esp-tx-csum-hw-offload: on
    
    Fixes: 4861333b4217 ("bonding: add ESP offload features when slaves support")
    Signed-off-by: Hangbin Liu <[email protected]>
    Signed-off-by: Cosmin Ratiu <[email protected]>
    Acked-by: Jay Vosburgh <[email protected]>
    Link: https://patch.msgid.link/[email protected]
    Signed-off-by: Paolo Abeni <[email protected]>
    Stable-dep-of: 950803f72547 ("bonding: fix type confusion in bond_setup_by_slave()")
    Signed-off-by: Sasha Levin <[email protected]>

bonding: fix type confusion in bond_setup_by_slave() [+ + +]

Author: Jiayuan Chen <[email protected]>
Date:   Fri Mar 6 10:15:07 2026 +0800

    bonding: fix type confusion in bond_setup_by_slave()
    
    [ Upstream commit 950803f7254721c1c15858fbbfae3deaaeeecb11 ]
    
    kernel BUG at net/core/skbuff.c:2306!
    Oops: invalid opcode: 0000 [#1] SMP KASAN NOPTI
    RIP: 0010:pskb_expand_head+0xa08/0xfe0 net/core/skbuff.c:2306
    RSP: 0018:ffffc90004aff760 EFLAGS: 00010293
    RAX: 0000000000000000 RBX: ffff88807e3c8780 RCX: ffffffff89593e0e
    RDX: ffff88807b7c4900 RSI: ffffffff89594747 RDI: ffff88807b7c4900
    RBP: 0000000000000820 R08: 0000000000000005 R09: 0000000000000000
    R10: 00000000961a63e0 R11: 0000000000000000 R12: ffff88807e3c8780
    R13: 00000000961a6560 R14: dffffc0000000000 R15: 00000000961a63e0
    CS:  0010 DS: 0000 ES: 0000 CR0: 0000000080050033
    CR2: 00007fe1a0ed8df0 CR3: 000000002d816000 CR4: 00000000003526f0
    Call Trace:
     <TASK>
     ipgre_header+0xdd/0x540 net/ipv4/ip_gre.c:900
     dev_hard_header include/linux/netdevice.h:3439 [inline]
     packet_snd net/packet/af_packet.c:3028 [inline]
     packet_sendmsg+0x3ae5/0x53c0 net/packet/af_packet.c:3108
     sock_sendmsg_nosec net/socket.c:727 [inline]
     __sock_sendmsg net/socket.c:742 [inline]
     ____sys_sendmsg+0xa54/0xc30 net/socket.c:2592
     ___sys_sendmsg+0x190/0x1e0 net/socket.c:2646
     __sys_sendmsg+0x170/0x220 net/socket.c:2678
     do_syscall_x64 arch/x86/entry/syscall_64.c:63 [inline]
     do_syscall_64+0x106/0xf80 arch/x86/entry/syscall_64.c:94
     entry_SYSCALL_64_after_hwframe+0x77/0x7f
    RIP: 0033:0x7fe1a0e6c1a9
    
    When a non-Ethernet device (e.g. GRE tunnel) is enslaved to a bond,
    bond_setup_by_slave() directly copies the slave's header_ops to the
    bond device:
    
        bond_dev->header_ops = slave_dev->header_ops;
    
    This causes a type confusion when dev_hard_header() is later called
    on the bond device. Functions like ipgre_header(), ip6gre_header(),all use
    netdev_priv(dev) to access their device-specific private data. When
    called with the bond device, netdev_priv() returns the bond's private
    data (struct bonding) instead of the expected type (e.g. struct
    ip_tunnel), leading to garbage values being read and kernel crashes.
    
    Fix this by introducing bond_header_ops with wrapper functions that
    delegate to the active slave's header_ops using the slave's own
    device. This ensures netdev_priv() in the slave's header functions
    always receives the correct device.
    
    The fix is placed in the bonding driver rather than individual device
    drivers, as the root cause is bond blindly inheriting header_ops from
    the slave without considering that these callbacks expect a specific
    netdev_priv() layout.
    
    The type confusion can be observed by adding a printk in
    ipgre_header() and running the following commands:
    
        ip link add dummy0 type dummy
        ip addr add 10.0.0.1/24 dev dummy0
        ip link set dummy0 up
        ip link add gre1 type gre local 10.0.0.1
        ip link add bond1 type bond mode active-backup
        ip link set gre1 master bond1
        ip link set gre1 up
        ip link set bond1 up
        ip addr add fe80::1/64 dev bond1
    
    Fixes: 1284cd3a2b74 ("bonding: two small fixes for IPoIB support")
    Suggested-by: Jay Vosburgh <[email protected]>
    Reviewed-by: Eric Dumazet <[email protected]>
    Signed-off-by: Jiayuan Chen <[email protected]>
    Link: https://patch.msgid.link/[email protected]
    Signed-off-by: Paolo Abeni <[email protected]>
    Signed-off-by: Sasha Levin <[email protected]>

bonding: handle BOND_LINK_FAIL, BOND_LINK_BACK as valid link states [+ + +]

Author: Hangbin Liu <[email protected]>
Date:   Wed Mar 4 15:13:54 2026 +0800

    bonding: handle BOND_LINK_FAIL, BOND_LINK_BACK as valid link states
    
    [ Upstream commit 3348be7978f450ede0c308a4e8416ac716cf1015 ]
    
    Before the fixed commit, we check slave->new_link during commit
    state, which values are only BOND_LINK_{NOCHANGE, UP, DOWN}. After
    the commit, we start using slave->link_new_state, which state also could
    be BOND_LINK_{FAIL, BACK}.
    
    For example, when we set updelay/downdelay, after a failover,
    the slave->link_new_state could be set to BOND_LINK_{FAIL, BACK} in
    bond_miimon_inspect(). And later in bond_miimon_commit(), it will treat
    it as invalid and print an error, which would cause confusion for users.
    
    [  106.440254] bond0: (slave veth2): link status down for interface, disabling it in 200 ms
    [  106.440265] bond0: (slave veth2): invalid new link 1 on slave
    [  106.648276] bond0: (slave veth2): link status definitely down, disabling slave
    [  107.480271] bond0: (slave veth2): link status up, enabling it in 200 ms
    [  107.480288] bond0: (slave veth2): invalid new link 3 on slave
    [  107.688302] bond0: (slave veth2): link status definitely up, 10000 Mbps full duplex
    
    Let's handle BOND_LINK_{FAIL, BACK} as valid link states.
    
    Fixes: 1899bb325149 ("bonding: fix state transition issue in link monitoring")
    Signed-off-by: Hangbin Liu <[email protected]>
    Link: https://patch.msgid.link/[email protected]
    Signed-off-by: Jakub Kicinski <[email protected]>
    Signed-off-by: Sasha Levin <[email protected]>

bonding: prevent potential infinite loop in bond_header_parse() [+ + +]

Author: Eric Dumazet <[email protected]>
Date:   Sun Mar 15 10:41:52 2026 +0000

    bonding: prevent potential infinite loop in bond_header_parse()
    
    [ Upstream commit b7405dcf7385445e10821777143f18c3ce20fa04 ]
    
    bond_header_parse() can loop if a stack of two bonding devices is setup,
    because skb->dev always points to the hierarchy top.
    
    Add new "const struct net_device *dev" parameter to
    (struct header_ops)->parse() method to make sure the recursion
    is bounded, and that the final leaf parse method is called.
    
    Fixes: 950803f72547 ("bonding: fix type confusion in bond_setup_by_slave()")
    Signed-off-by: Eric Dumazet <[email protected]>
    Reviewed-by: Jiayuan Chen <[email protected]>
    Tested-by: Jiayuan Chen <[email protected]>
    Cc: Jay Vosburgh <[email protected]>
    Cc: Andrew Lunn <[email protected]>
    Link: https://patch.msgid.link/[email protected]
    Signed-off-by: Jakub Kicinski <[email protected]>
    Signed-off-by: Sasha Levin <[email protected]>

bonding: use common function to compute the features [+ + +]

Author: Hangbin Liu <[email protected]>
Date:   Fri Oct 17 03:41:53 2025 +0000

    bonding: use common function to compute the features
    
    [ Upstream commit d4fde269a970666a30dd3abd0413273a06dd972d ]
    
    Use the new functon netdev_compute_master_upper_features() to compute the bonding
    features.
    
    Note that bond_compute_features() currently uses bond_for_each_slave()
    to traverse the lower devices list, and that is just a macro wrapper of
    netdev_for_each_lower_private(). We use similar helper
    netdev_for_each_lower_dev() in netdev_compute_master_upper_features() to
    iterate the slave device, as there is not need to get the private data.
    
    No functional change intended.
    
    Signed-off-by: Hangbin Liu <[email protected]>
    Reviewed-by: Sabrina Dubroca <[email protected]>
    Reviewed-by: Jiri Pirko <[email protected]>
    Link: https://patch.msgid.link/[email protected]
    Signed-off-by: Jakub Kicinski <[email protected]>
    Stable-dep-of: 950803f72547 ("bonding: fix type confusion in bond_setup_by_slave()")
    Signed-off-by: Sasha Levin <[email protected]>

bridge: cfm: Fix race condition in peer_mep deletion [+ + +]

Author: Hyunwoo Kim <[email protected]>
Date:   Wed Mar 11 03:18:09 2026 +0900

    bridge: cfm: Fix race condition in peer_mep deletion
    
    [ Upstream commit 3715a00855316066cdda69d43648336367422127 ]
    
    When a peer MEP is being deleted, cancel_delayed_work_sync() is called
    on ccm_rx_dwork before freeing. However, br_cfm_frame_rx() runs in
    softirq context under rcu_read_lock (without RTNL) and can re-schedule
    ccm_rx_dwork via ccm_rx_timer_start() between cancel_delayed_work_sync()
    returning and kfree_rcu() being called.
    
    The following is a simple race scenario:
    
               cpu0                                     cpu1
    
    mep_delete_implementation()
      cancel_delayed_work_sync(ccm_rx_dwork);
                                               br_cfm_frame_rx()
                                                 // peer_mep still in hlist
                                                 if (peer_mep->ccm_defect)
                                                   ccm_rx_timer_start()
                                                     queue_delayed_work(ccm_rx_dwork)
      hlist_del_rcu(&peer_mep->head);
      kfree_rcu(peer_mep, rcu);
                                               ccm_rx_work_expired()
                                                 // on freed peer_mep
    
    To prevent this, cancel_delayed_work_sync() is replaced with
    disable_delayed_work_sync() in both peer MEP deletion paths, so
    that subsequent queue_delayed_work() calls from br_cfm_frame_rx()
    are silently rejected.
    
    The cc_peer_disable() helper retains cancel_delayed_work_sync()
    because it is also used for the CC enable/disable toggle path where
    the work must remain re-schedulable.
    
    Fixes: dc32cbb3dbd7 ("bridge: cfm: Kernel space implementation of CFM. CCM frame RX added.")
    Signed-off-by: Hyunwoo Kim <[email protected]>
    Reviewed-by: Ido Schimmel <[email protected]>
    Link: https://patch.msgid.link/abBgYT5K_FI9rD1a@v4bel
    Signed-off-by: Jakub Kicinski <[email protected]>
    Signed-off-by: Sasha Levin <[email protected]>

btrfs: abort transaction on failure to update root in the received subvol ioctl [+ + +]

Author: Filipe Manana <[email protected]>
Date:   Fri Feb 27 00:02:33 2026 +0000

    btrfs: abort transaction on failure to update root in the received subvol ioctl
    
    commit 0f475ee0ebce5c9492b260027cd95270191675fa upstream.
    
    If we failed to update the root we don't abort the transaction, which is
    wrong since we already used the transaction to remove an item from the
    uuid tree.
    
    Fixes: dd5f9615fc5c ("Btrfs: maintain subvolume items in the UUID tree")
    CC: [email protected] # 3.12+
    Reviewed-by: Anand Jain <[email protected]>
    Signed-off-by: Filipe Manana <[email protected]>
    Reviewed-by: David Sterba <[email protected]>
    Signed-off-by: David Sterba <[email protected]>
    Signed-off-by: Greg Kroah-Hartman <[email protected]>

btrfs: do not strictly require dirty metadata threshold for metadata writepages [+ + +]

Author: Qu Wenruo <[email protected]>
Date:   Fri Feb 27 09:56:58 2026 +0800

    btrfs: do not strictly require dirty metadata threshold for metadata writepages
    
    [ Upstream commit 4e159150a9a56d66d247f4b5510bed46fe58aa1c ]
    
    [BUG]
    There is an internal report that over 1000 processes are
    waiting at the io_schedule_timeout() of balance_dirty_pages(), causing
    a system hang and trigger a kernel coredump.
    
    The kernel is v6.4 kernel based, but the root problem still applies to
    any upstream kernel before v6.18.
    
    [CAUSE]
    >From Jan Kara for his wisdom on the dirty page balance behavior first.
    
      This cgroup dirty limit was what was actually playing the role here
      because the cgroup had only a small amount of memory and so the dirty
      limit for it was something like 16MB.
    
      Dirty throttling is responsible for enforcing that nobody can dirty
      (significantly) more dirty memory than there's dirty limit. Thus when
      a task is dirtying pages it periodically enters into balance_dirty_pages()
      and we let it sleep there to slow down the dirtying.
    
      When the system is over dirty limit already (either globally or within
      a cgroup of the running task), we will not let the task exit from
      balance_dirty_pages() until the number of dirty pages drops below the
      limit.
    
      So in this particular case, as I already mentioned, there was a cgroup
      with relatively small amount of memory and as a result with dirty limit
      set at 16MB. A task from that cgroup has dirtied about 28MB worth of
      pages in btrfs btree inode and these were practically the only dirty
      pages in that cgroup.
    
    So that means the only way to reduce the dirty pages of that cgroup is
    to writeback the dirty pages of btrfs btree inode, and only after that
    those processes can exit balance_dirty_pages().
    
    Now back to the btrfs part, btree_writepages() is responsible for
    writing back dirty btree inode pages.
    
    The problem here is, there is a btrfs internal threshold that if the
    btree inode's dirty bytes are below the 32M threshold, it will not
    do any writeback.
    
    This behavior is to batch as much metadata as possible so we won't write
    back those tree blocks and then later re-COW them again for another
    modification.
    
    This internal 32MiB is higher than the existing dirty page size (28MiB),
    meaning no writeback will happen, causing a deadlock between btrfs and
    cgroup:
    
    - Btrfs doesn't want to write back btree inode until more dirty pages
    
    - Cgroup/MM doesn't want more dirty pages for btrfs btree inode
      Thus any process touching that btree inode is put into sleep until
      the number of dirty pages is reduced.
    
    Thanks Jan Kara a lot for the analysis of the root cause.
    
    [ENHANCEMENT]
    Since kernel commit b55102826d7d ("btrfs: set AS_KERNEL_FILE on the
    btree_inode"), btrfs btree inode pages will only be charged to the root
    cgroup which should have a much larger limit than btrfs' 32MiB
    threshold.
    So it should not affect newer kernels.
    
    But for all current LTS kernels, they are all affected by this problem,
    and backporting the whole AS_KERNEL_FILE may not be a good idea.
    
    Even for newer kernels I still think it's a good idea to get
    rid of the internal threshold at btree_writepages(), since for most cases
    cgroup/MM has a better view of full system memory usage than btrfs' fixed
    threshold.
    
    For internal callers using btrfs_btree_balance_dirty() since that
    function is already doing internal threshold check, we don't need to
    bother them.
    
    But for external callers of btree_writepages(), just respect their
    requests and write back whatever they want, ignoring the internal
    btrfs threshold to avoid such deadlock on btree inode dirty page
    balancing.
    
    CC: [email protected]
    CC: Jan Kara <[email protected]>
    Reviewed-by: Boris Burkov <[email protected]>
    Signed-off-by: Qu Wenruo <[email protected]>
    Signed-off-by: David Sterba <[email protected]>
    [ The context change is due to the commit 5e121ae687b8
    ("btrfs: use buffer xarray for extent buffer writeback operations")
    in v6.16 which is irrelevant to the logic of this patch. ]
    Signed-off-by: Rahul Sharma <[email protected]>
    Signed-off-by: Greg Kroah-Hartman <[email protected]>

btrfs: fix chunk map leak in btrfs_map_block() after btrfs_chunk_map_num_copies() [+ + +]

Author: Mark Harmstone <[email protected]>
Date:   Fri Feb 20 12:53:17 2026 +0000

    btrfs: fix chunk map leak in btrfs_map_block() after btrfs_chunk_map_num_copies()
    
    commit f15fb3d41543244d1179f423da4a4832a55bc050 upstream.
    
    Fix a chunk map leak in btrfs_map_block(): if we return early with -EINVAL,
    we're not freeing the chunk map that we've just looked up.
    
    Fixes: 0ae653fbec2b ("btrfs: reduce chunk_map lookups in btrfs_map_block()")
    CC: [email protected] # 6.12+
    Reviewed-by: Filipe Manana <[email protected]>
    Signed-off-by: Mark Harmstone <[email protected]>
    Reviewed-by: David Sterba <[email protected]>
    Signed-off-by: David Sterba <[email protected]>
    Signed-off-by: Greg Kroah-Hartman <[email protected]>

btrfs: fix transaction abort on file creation due to name hash collision [+ + +]

Author: Filipe Manana <[email protected]>
Date:   Thu Feb 26 11:05:43 2026 +0000

    btrfs: fix transaction abort on file creation due to name hash collision
    
    commit 2d1ababdedd4ba38867c2500eb7f95af5ddeeef7 upstream.
    
    If we attempt to create several files with names that result in the same
    hash, we have to pack them in same dir item and that has a limit inherent
    to the leaf size. However if we reach that limit, we trigger a transaction
    abort and turns the filesystem into RO mode. This allows for a malicious
    user to disrupt a system, without the need to have administration
    privileges/capabilities.
    
    Reproducer:
    
      $ cat exploit-hash-collisions.sh
      #!/bin/bash
    
      DEV=/dev/sdi
      MNT=/mnt/sdi
    
      # Use smallest node size to make the test faster and require fewer file
      # names that result in hash collision.
      mkfs.btrfs -f --nodesize 4K $DEV
      mount $DEV $MNT
    
      # List of names that result in the same crc32c hash for btrfs.
      declare -a names=(
       'foobar'
       '%a8tYkxfGMLWRGr55QSeQc4PBNH9PCLIvR6jZnkDtUUru1t@RouaUe_L:@xGkbO3nCwvLNYeK9vhE628gss:T$yZjZ5l-Nbd6CbC$M=hqE-ujhJICXyIxBvYrIU9-TDC'
       'AQci3EUB%shMsg-N%frgU:02ByLs=IPJU0OpgiWit5nexSyxZDncY6WB:=zKZuk5Zy0DD$Ua78%MelgBuMqaHGyKsJUFf9s=UW80PcJmKctb46KveLSiUtNmqrMiL9-Y0I_l5Fnam04CGIg=8@U:Z'
       'CvVqJpJzueKcuA$wqwePfyu7VxuWNN3ho$p0zi2H8QFYK$7YlEqOhhb%:hHgjhIjW5vnqWHKNP4'
       'ET:vk@rFU4tsvMB0$C_p=xQHaYZjvoF%-BTc%wkFW8yaDAPcCYoR%x$FH5O:'
       'HwTon%v7SGSP4FE08jBwwiu5aot2CFKXHTeEAa@38fUcNGOWvE@Mz6WBeDH_VooaZ6AgsXPkVGwy9l@@ZbNXabUU9csiWrrOp0MWUdfi$EZ3w9GkIqtz7I_eOsByOkBOO'
       'Ij%2VlFGXSuPvxJGf5UWy6O@1svxGha%b@=%wjkq:CIgE6u7eJOjmQY5qTtxE2Rjbis9@us'
       'KBkjG5%9R8K9sOG8UTnAYjxLNAvBmvV5vz3IiZaPmKuLYO03-6asI9lJ_j4@6Xo$KZicaLWJ3Pv8XEwVeUPMwbHYWwbx0pYvNlGMO9F:ZhHAwyctnGy%_eujl%WPd4U2BI7qooOSr85J-C2V$LfY'
       'NcRfDfuUQ2=zP8K3CCF5dFcpfiOm6mwenShsAb_F%n6GAGC7fT2JFFn:c35X-3aYwoq7jNX5$ZJ6hI3wnZs$7KgGi7wjulffhHNUxAT0fRRLF39vJ@NvaEMxsMO'
       'Oj42AQAEzRoTxa5OuSKIr=A_lwGMy132v4g3Pdq1GvUG9874YseIFQ6QU'
       'Ono7avN5GjC:_6dBJ_'
       'WHmN2gnmaN-9dVDy4aWo:yNGFzz8qsJyJhWEWcud7$QzN2D9R0efIWWEdu5kwWr73NZm4=@CoCDxrrZnRITr-kGtU_cfW2:%2_am'
       'WiFnuTEhAG9FEC6zopQmj-A-$LDQ0T3WULz%ox3UZAPybSV6v1Z$b4L_XBi4M4BMBtJZpz93r9xafpB77r:lbwvitWRyo$odnAUYlYMmU4RvgnNd--e=I5hiEjGLETTtaScWlQp8mYsBovZwM2k'
       'XKyH=OsOAF3p%uziGF_ZVr$ivrvhVgD@1u%5RtrV-gl_vqAwHkK@x7YwlxX3qT6WKKQ%PR56NrUBU2dOAOAdzr2=5nJuKPM-T-$ZpQfCL7phxQbUcb:BZOTPaFExc-qK-gDRCDW2'
       'd3uUR6OFEwZr%ns1XH_@tbxA@cCPmbBRLdyh7p6V45H$P2$F%w0RqrD3M0g8aGvWpoTFMiBdOTJXjD:JF7=h9a_43xBywYAP%r$SPZi%zDg%ql-KvkdUCtF9OLaQlxmd'
       'ePTpbnit%hyNm@WELlpKzNZYOzOTf8EQ$sEfkMy1VOfIUu3coyvIr13-Y7Sv5v-Ivax2Go_GQRFMU1b3362nktT9WOJf3SpT%z8sZmM3gvYQBDgmKI%%RM-G7hyrhgYflOw%z::ZRcv5O:lDCFm'
       'evqk743Y@dvZAiG5J05L_ROFV@$2%rVWJ2%3nxV72-W7$e$-SK3tuSHA2mBt$qloC5jwNx33GmQUjD%akhBPu=VJ5g$xhlZiaFtTrjeeM5x7dt4cHpX0cZkmfImndYzGmvwQG:$euFYmXn$_2rA9mKZ'
       'gkgUtnihWXsZQTEkrMAWIxir09k3t7jk_IK25t1:cy1XWN0GGqC%FrySdcmU7M8MuPO_ppkLw3=Dfr0UuBAL4%GFk2$Ma10V1jDRGJje%Xx9EV2ERaWKtjpwiZwh0gCSJsj5UL7CR8RtW5opCVFKGGy8Cky'
       'hNgsG_8lNRik3PvphqPm0yEH3P%%fYG:kQLY=6O-61Wa6nrV_WVGR6TLB09vHOv%g4VQRP8Gzx7VXUY1qvZyS'
       'isA7JVzN12xCxVPJZ_qoLm-pTBuhjjHMvV7o=F:EaClfYNyFGlsfw-Kf%uxdqW-kwk1sPl2vhbjyHU1A6$hz'
       'kiJ_fgcdZFDiOptjgH5PN9-PSyLO4fbk_:u5_2tz35lV_iXiJ6cx7pwjTtKy-XGaQ5IefmpJ4N_ZqGsqCsKuqOOBgf9LkUdffHet@Wu'
       'lvwtxyhE9:%Q3UxeHiViUyNzJsy:fm38pg_b6s25JvdhOAT=1s0$pG25x=LZ2rlHTszj=gN6M4zHZYr_qrB49i=pA--@WqWLIuX7o1S_SfS@2FSiUZN'
       'rC24cw3UBDZ=5qJBUMs9e$=S4Y94ni%Z8639vnrGp=0Hv4z3dNFL0fBLmQ40=EYIY:Z=SLc@QLMSt2zsss2ZXrP7j4='
       'uwGl2s-fFrf@GqS=DQqq2I0LJSsOmM%xzTjS:lzXguE3wChdMoHYtLRKPvfaPOZF2fER@j53evbKa7R%A7r4%YEkD=kicJe@SFiGtXHbKe4gCgPAYbnVn'
       'UG37U6KKua2bgc:IHzRs7BnB6FD:2Mt5Cc5NdlsW%$1tyvnfz7S27FvNkroXwAW:mBZLA1@qa9WnDbHCDmQmfPMC9z-Eq6QT0jhhPpqyymaD:R02ghwYo%yx7SAaaq-:x33LYpei$5g8DMl3C'
       'y2vjek0FE1PDJC0qpfnN:x8k2wCFZ9xiUF2ege=JnP98R%wxjKkdfEiLWvQzmnW'
       '8-HCSgH5B%K7P8_jaVtQhBXpBk:pE-$P7ts58U0J@iR9YZntMPl7j$s62yAJO@_9eanFPS54b=UTw$94C-t=HLxT8n6o9P=QnIxq-f1=Ne2dvhe6WbjEQtc'
       'YPPh:IFt2mtR6XWSmjHptXL_hbSYu8bMw-JP8@PNyaFkdNFsk$M=xfL6LDKCDM-mSyGA_2MBwZ8Dr4=R1D%7-mCaaKGxb990jzaagRktDTyp'
       '9hD2ApKa_t_7x-a@GCG28kY:7$M@5udI1myQ$x5udtggvagmCQcq9QXWRC5hoB0o-_zHQUqZI5rMcz_kbMgvN5jr63LeYA4Cj-c6F5Ugmx6DgVf@2Jqm%MafecpgooqreJ53P-QTS'
      )
    
      # Now create files with all those names in the same parent directory.
      # It should not fail since a 4K leaf has enough space for them.
      for name in "${names[@]}"; do
           touch $MNT/$name
      done
    
      # Now add one more file name that causes a crc32c hash collision.
      # This should fail, but it should not turn the filesystem into RO mode
      # (which could be exploited by malicious users) due to a transaction
      # abort.
      touch $MNT/'W6tIm-VK2@BGC@IBfcgg6j_p:pxp_QUqtWpGD5Ok_GmijKOJJt'
    
      # Check that we are able to create another file, with a name that does not cause
      # a crc32c hash collision.
      echo -n "hello world" > $MNT/baz
    
      # Unmount and mount again, verify file baz exists and with the right content.
      umount $MNT
      mount $DEV $MNT
      echo "File baz content: $(cat $MNT/baz)"
    
      umount $MNT
    
    When running the reproducer:
    
      $ ./exploit-hash-collisions.sh
      (...)
      touch: cannot touch '/mnt/sdi/W6tIm-VK2@BGC@IBfcgg6j_p:pxp_QUqtWpGD5Ok_GmijKOJJt': Value too large for defined data type
      ./exploit-hash-collisions.sh: line 57: /mnt/sdi/baz: Read-only file system
      cat: /mnt/sdi/baz: No such file or directory
      File baz content:
    
    And the transaction abort stack trace in dmesg/syslog:
    
      $ dmesg
      (...)
      [758240.509761] ------------[ cut here ]------------
      [758240.510668] BTRFS: Transaction aborted (error -75)
      [758240.511577] WARNING: fs/btrfs/inode.c:6854 at btrfs_create_new_inode+0x805/0xb50 [btrfs], CPU#6: touch/888644
      [758240.513513] Modules linked in: btrfs dm_zero (...)
      [758240.523221] CPU: 6 UID: 0 PID: 888644 Comm: touch Tainted: G        W           6.19.0-rc8-btrfs-next-225+ #1 PREEMPT(full)
      [758240.524621] Tainted: [W]=WARN
      [758240.525037] Hardware name: QEMU Standard PC (i440FX + PIIX, 1996), BIOS rel-1.16.2-0-gea1b7a073390-prebuilt.qemu.org 04/01/2014
      [758240.526331] RIP: 0010:btrfs_create_new_inode+0x80b/0xb50 [btrfs]
      [758240.527093] Code: 0f 82 cf (...)
      [758240.529211] RSP: 0018:ffffce64418fbb48 EFLAGS: 00010292
      [758240.529935] RAX: 00000000ffffffd3 RBX: 0000000000000000 RCX: 00000000ffffffb5
      [758240.531040] RDX: 0000000d04f33e06 RSI: 00000000ffffffb5 RDI: ffffffffc0919dd0
      [758240.531920] RBP: ffffce64418fbc10 R08: 0000000000000000 R09: 00000000ffffffb5
      [758240.532928] R10: 0000000000000000 R11: ffff8e52c0000000 R12: ffff8e53eee7d0f0
      [758240.533818] R13: ffff8e57f70932a0 R14: ffff8e5417629568 R15: 0000000000000000
      [758240.534664] FS:  00007f1959a2a740(0000) GS:ffff8e5b27cae000(0000) knlGS:0000000000000000
      [758240.535821] CS:  0010 DS: 0000 ES: 0000 CR0: 0000000080050033
      [758240.536644] CR2: 00007f1959b10ce0 CR3: 000000012a2cc005 CR4: 0000000000370ef0
      [758240.537517] Call Trace:
      [758240.537828]  <TASK>
      [758240.538099]  btrfs_create_common+0xbf/0x140 [btrfs]
      [758240.538760]  path_openat+0x111a/0x15b0
      [758240.539252]  do_filp_open+0xc2/0x170
      [758240.539699]  ? preempt_count_add+0x47/0xa0
      [758240.540200]  ? __virt_addr_valid+0xe4/0x1a0
      [758240.540800]  ? __check_object_size+0x1b3/0x230
      [758240.541661]  ? alloc_fd+0x118/0x180
      [758240.542315]  do_sys_openat2+0x70/0xd0
      [758240.543012]  __x64_sys_openat+0x50/0xa0
      [758240.543723]  do_syscall_64+0x50/0xf20
      [758240.544462]  entry_SYSCALL_64_after_hwframe+0x76/0x7e
      [758240.545397] RIP: 0033:0x7f1959abc687
      [758240.546019] Code: 48 89 fa (...)
      [758240.548522] RSP: 002b:00007ffe16ff8690 EFLAGS: 00000202 ORIG_RAX: 0000000000000101
      [758240.566278] RAX: ffffffffffffffda RBX: 00007f1959a2a740 RCX: 00007f1959abc687
      [758240.567068] RDX: 0000000000000941 RSI: 00007ffe16ffa333 RDI: ffffffffffffff9c
      [758240.567860] RBP: 0000000000000000 R08: 0000000000000000 R09: 0000000000000000
      [758240.568707] R10: 00000000000001b6 R11: 0000000000000202 R12: 0000561eec7c4b90
      [758240.569712] R13: 0000561eec7c311f R14: 00007ffe16ffa333 R15: 0000000000000000
      [758240.570758]  </TASK>
      [758240.571040] ---[ end trace 0000000000000000 ]---
      [758240.571681] BTRFS: error (device sdi state A) in btrfs_create_new_inode:6854: errno=-75 unknown
      [758240.572899] BTRFS info (device sdi state EA): forced readonly
    
    Fix this by checking for hash collision, and if the adding a new name is
    possible, early in btrfs_create_new_inode() before we do any tree updates,
    so that we don't need to abort the transaction if we cannot add the new
    name due to the leaf size limit.
    
    A test case for fstests will be sent soon.
    
    Fixes: caae78e03234 ("btrfs: move common inode creation code into btrfs_create_new_inode()")
    CC: [email protected] # 6.1+
    Reviewed-by: Boris Burkov <[email protected]>
    Reviewed-by: Qu Wenruo <[email protected]>
    Signed-off-by: Filipe Manana <[email protected]>
    Reviewed-by: David Sterba <[email protected]>
    Signed-off-by: David Sterba <[email protected]>
    Signed-off-by: Greg Kroah-Hartman <[email protected]>

btrfs: fix transaction abort on set received ioctl due to item overflow [+ + +]

Author: Filipe Manana <[email protected]>
Date:   Thu Feb 26 23:41:07 2026 +0000

    btrfs: fix transaction abort on set received ioctl due to item overflow
    
    commit 87f2c46003fce4d739138aab4af1942b1afdadac upstream.
    
    If the set received ioctl fails due to an item overflow when attempting to
    add the BTRFS_UUID_KEY_RECEIVED_SUBVOL we have to abort the transaction
    since we did some metadata updates before.
    
    This means that if a user calls this ioctl with the same received UUID
    field for a lot of subvolumes, we will hit the overflow, trigger the
    transaction abort and turn the filesystem into RO mode. A malicious user
    could exploit this, and this ioctl does not even requires that a user
    has admin privileges (CAP_SYS_ADMIN), only that he/she owns the subvolume.
    
    Fix this by doing an early check for item overflow before starting a
    transaction. This is also race safe because we are holding the subvol_sem
    semaphore in exclusive (write) mode.
    
    A test case for fstests will follow soon.
    
    Fixes: dd5f9615fc5c ("Btrfs: maintain subvolume items in the UUID tree")
    CC: [email protected] # 3.12+
    Reviewed-by: Anand Jain <[email protected]>
    Signed-off-by: Filipe Manana <[email protected]>
    Reviewed-by: David Sterba <[email protected]>
    Signed-off-by: David Sterba <[email protected]>
    Signed-off-by: Greg Kroah-Hartman <[email protected]>

btrfs: fix transaction abort when snapshotting received subvolumes [+ + +]

Author: Filipe Manana <[email protected]>
Date:   Thu Mar 19 13:04:44 2026 -0400

    btrfs: fix transaction abort when snapshotting received subvolumes
    
    [ Upstream commit e1b18b959025e6b5dbad668f391f65d34b39595a ]
    
    Currently a user can trigger a transaction abort by snapshotting a
    previously received snapshot a bunch of times until we reach a
    BTRFS_UUID_KEY_RECEIVED_SUBVOL item overflow (the maximum item size we
    can store in a leaf). This is very likely not common in practice, but
    if it happens, it turns the filesystem into RO mode. The snapshot, send
    and set_received_subvol and subvol_setflags (used by receive) don't
    require CAP_SYS_ADMIN, just inode_owner_or_capable(). A malicious user
    could use this to turn a filesystem into RO mode and disrupt a system.
    
    Reproducer script:
    
      $ cat test.sh
      #!/bin/bash
    
      DEV=/dev/sdi
      MNT=/mnt/sdi
    
      # Use smallest node size to make the test faster.
      mkfs.btrfs -f --nodesize 4K $DEV
      mount $DEV $MNT
    
      # Create a subvolume and set it to RO so that it can be used for send.
      btrfs subvolume create $MNT/sv
      touch $MNT/sv/foo
      btrfs property set $MNT/sv ro true
    
      # Send and receive the subvolume into snaps/sv.
      mkdir $MNT/snaps
      btrfs send $MNT/sv | btrfs receive $MNT/snaps
    
      # Now snapshot the received subvolume, which has a received_uuid, a
      # lot of times to trigger the leaf overflow.
      total=500
      for ((i = 1; i <= $total; i++)); do
          echo -ne "\rCreating snapshot $i/$total"
          btrfs subvolume snapshot -r $MNT/snaps/sv $MNT/snaps/sv_$i > /dev/null
      done
      echo
    
      umount $MNT
    
    When running the test:
    
      $ ./test.sh
      (...)
      Create subvolume '/mnt/sdi/sv'
      At subvol /mnt/sdi/sv
      At subvol sv
      Creating snapshot 496/500ERROR: Could not create subvolume: Value too large for defined data type
      Creating snapshot 497/500ERROR: Could not create subvolume: Read-only file system
      Creating snapshot 498/500ERROR: Could not create subvolume: Read-only file system
      Creating snapshot 499/500ERROR: Could not create subvolume: Read-only file system
      Creating snapshot 500/500ERROR: Could not create subvolume: Read-only file system
    
    And in dmesg/syslog:
    
      $ dmesg
      (...)
      [251067.627338] BTRFS warning (device sdi): insert uuid item failed -75 (0x4628b21c4ac8d898, 0x2598bee2b1515c91) type 252!
      [251067.629212] ------------[ cut here ]------------
      [251067.630033] BTRFS: Transaction aborted (error -75)
      [251067.630871] WARNING: fs/btrfs/transaction.c:1907 at create_pending_snapshot.cold+0x52/0x465 [btrfs], CPU#10: btrfs/615235
      [251067.632851] Modules linked in: btrfs dm_zero (...)
      [251067.644071] CPU: 10 UID: 0 PID: 615235 Comm: btrfs Tainted: G        W           6.19.0-rc8-btrfs-next-225+ #1 PREEMPT(full)
      [251067.646165] Tainted: [W]=WARN
      [251067.646733] Hardware name: QEMU Standard PC (i440FX + PIIX, 1996), BIOS rel-1.16.2-0-gea1b7a073390-prebuilt.qemu.org 04/01/2014
      [251067.648735] RIP: 0010:create_pending_snapshot.cold+0x55/0x465 [btrfs]
      [251067.649984] Code: f0 48 0f (...)
      [251067.653313] RSP: 0018:ffffce644908fae8 EFLAGS: 00010292
      [251067.653987] RAX: 00000000ffffff01 RBX: ffff8e5639e63a80 RCX: 00000000ffffffd3
      [251067.655042] RDX: ffff8e53faa76b00 RSI: 00000000ffffffb5 RDI: ffffffffc0919750
      [251067.656077] RBP: ffffce644908fbd8 R08: 0000000000000000 R09: ffffce644908f820
      [251067.657068] R10: ffff8e5adc1fffa8 R11: 0000000000000003 R12: ffff8e53c0431bd0
      [251067.658050] R13: ffff8e5414593600 R14: ffff8e55efafd000 R15: 00000000ffffffb5
      [251067.659019] FS:  00007f2a4944b3c0(0000) GS:ffff8e5b27dae000(0000) knlGS:0000000000000000
      [251067.660115] CS:  0010 DS: 0000 ES: 0000 CR0: 0000000080050033
      [251067.660943] CR2: 00007ffc5aa57898 CR3: 00000005813a2003 CR4: 0000000000370ef0
      [251067.661972] Call Trace:
      [251067.662292]  <TASK>
      [251067.662653]  create_pending_snapshots+0x97/0xc0 [btrfs]
      [251067.663413]  btrfs_commit_transaction+0x26e/0xc00 [btrfs]
      [251067.664257]  ? btrfs_qgroup_convert_reserved_meta+0x35/0x390 [btrfs]
      [251067.665238]  ? _raw_spin_unlock+0x15/0x30
      [251067.665837]  ? record_root_in_trans+0xa2/0xd0 [btrfs]
      [251067.666531]  btrfs_mksubvol+0x330/0x580 [btrfs]
      [251067.667145]  btrfs_mksnapshot+0x74/0xa0 [btrfs]
      [251067.667827]  __btrfs_ioctl_snap_create+0x194/0x1d0 [btrfs]
      [251067.668595]  btrfs_ioctl_snap_create_v2+0x107/0x130 [btrfs]
      [251067.669479]  btrfs_ioctl+0x1580/0x2690 [btrfs]
      [251067.670093]  ? count_memcg_events+0x6d/0x180
      [251067.670849]  ? handle_mm_fault+0x1a0/0x2a0
      [251067.671652]  __x64_sys_ioctl+0x92/0xe0
      [251067.672406]  do_syscall_64+0x50/0xf20
      [251067.673129]  entry_SYSCALL_64_after_hwframe+0x76/0x7e
      [251067.674096] RIP: 0033:0x7f2a495648db
      [251067.674812] Code: 00 48 89 (...)
      [251067.678227] RSP: 002b:00007ffc5aa57840 EFLAGS: 00000246 ORIG_RAX: 0000000000000010
      [251067.679691] RAX: ffffffffffffffda RBX: 0000000000000003 RCX: 00007f2a495648db
      [251067.681145] RDX: 00007ffc5aa588b0 RSI: 0000000050009417 RDI: 0000000000000004
      [251067.682511] RBP: 0000000000000002 R08: 0000000000000000 R09: 0000000000000000
      [251067.683842] R10: 000000000000000a R11: 0000000000000246 R12: 00007ffc5aa59910
      [251067.685176] R13: 00007ffc5aa588b0 R14: 0000000000000004 R15: 0000000000000006
      [251067.686524]  </TASK>
      [251067.686972] ---[ end trace 0000000000000000 ]---
      [251067.687890] BTRFS: error (device sdi state A) in create_pending_snapshot:1907: errno=-75 unknown
      [251067.689049] BTRFS info (device sdi state EA): forced readonly
      [251067.689054] BTRFS warning (device sdi state EA): Skipping commit of aborted transaction.
      [251067.690119] BTRFS: error (device sdi state EA) in cleanup_transaction:2043: errno=-75 unknown
      [251067.702028] BTRFS info (device sdi state EA): last unmount of filesystem 46dc3975-30a2-4a69-a18f-418b859cccda
    
    Fix this by ignoring -EOVERFLOW errors from btrfs_uuid_tree_add() in the
    snapshot creation code when attempting to add the
    BTRFS_UUID_KEY_RECEIVED_SUBVOL item. This is OK because it's not critical
    and we are still able to delete the snapshot, as snapshot/subvolume
    deletion ignores if a BTRFS_UUID_KEY_RECEIVED_SUBVOL is missing (see
    inode.c:btrfs_delete_subvolume()). As for send/receive, we can still do
    send/receive operations since it always peeks the first root ID in the
    existing BTRFS_UUID_KEY_RECEIVED_SUBVOL (it could peek any since all
    snapshots have the same content), and even if the key is missing, it
    falls back to searching by BTRFS_UUID_KEY_SUBVOL key.
    
    A test case for fstests will be sent soon.
    
    Fixes: dd5f9615fc5c ("Btrfs: maintain subvolume items in the UUID tree")
    CC: [email protected] # 3.12+
    Reviewed-by: Boris Burkov <[email protected]>
    Reviewed-by: Qu Wenruo <[email protected]>
    Signed-off-by: Filipe Manana <[email protected]>
    Reviewed-by: David Sterba <[email protected]>
    Signed-off-by: David Sterba <[email protected]>
    [ adapted error check condition to omit unlikely() wrapper ]
    Signed-off-by: Sasha Levin <[email protected]>
    Signed-off-by: Greg Kroah-Hartman <[email protected]>

btrfs: hold space_info->lock when clearing periodic reclaim ready [+ + +]

Author: Sun YangKai <[email protected]>
Date:   Mon Feb 9 20:53:39 2026 +0800

    btrfs: hold space_info->lock when clearing periodic reclaim ready
    
    [ Upstream commit b8883b61f2fc50dcf22938cbed40fec05020552f ]
    
    btrfs_set_periodic_reclaim_ready() requires space_info->lock to be held,
    as enforced by lockdep_assert_held(). However, btrfs_reclaim_sweep() was
    calling it after do_reclaim_sweep() returns, at which point
    space_info->lock is no longer held.
    
    Fix this by explicitly acquiring space_info->lock before clearing the
    periodic reclaim ready flag in btrfs_reclaim_sweep().
    
    Reported-by: Chris Mason <[email protected]>
    Link: https://lore.kernel.org/linux-btrfs/[email protected]/
    Fixes: 19eff93dc738 ("btrfs: fix periodic reclaim condition")
    Reviewed-by: Boris Burkov <[email protected]>
    Signed-off-by: Sun YangKai <[email protected]>
    Reviewed-by: David Sterba <[email protected]>
    Signed-off-by: David Sterba <[email protected]>
    Signed-off-by: Sasha Levin <[email protected]>

btrfs: log new dentries when logging parent dir of a conflicting inode [+ + +]

Author: Filipe Manana <[email protected]>
Date:   Tue Mar 3 16:57:43 2026 +0000

    btrfs: log new dentries when logging parent dir of a conflicting inode
    
    [ Upstream commit 9573a365ff9ff45da9222d3fe63695ce562beb24 ]
    
    If we log the parent directory of a conflicting inode, we are not logging
    the new dentries of the directory, so when we finish we have the parent
    directory's inode marked as logged but we did not log its new dentries.
    As a consequence if the parent directory is explicitly fsynced later and
    it does not have any new changes since we logged it, the fsync is a no-op
    and after a power failure the new dentries are missing.
    
    Example scenario:
    
      $ mkdir foo
    
      $ sync
    
      $rmdir foo
    
      $ mkdir dir1
      $ mkdir dir2
    
      # A file with the same name and parent as the directory we just deleted
      # and was persisted in a past transaction. So the deleted directory's
      # inode is a conflicting inode of this new file's inode.
      $ touch foo
    
      $ ln foo dir2/link
    
      # The fsync on dir2 will log the parent directory (".") because the
      # conflicting inode (deleted directory) does not exists anymore, but it
      # it does not log its new dentries (dir1).
      $ xfs_io -c "fsync" dir2
    
      # This fsync on the parent directory is no-op, since the previous fsync
      # logged it (but without logging its new dentries).
      $ xfs_io -c "fsync" .
    
      <power failure>
    
      # After log replay dir1 is missing.
    
    Fix this by ensuring we log new dir dentries whenever we log the parent
    directory of a no longer existing conflicting inode.
    
    A test case for fstests will follow soon.
    
    Reported-by: Vyacheslav Kovalevsky <[email protected]>
    Link: https://lore.kernel.org/linux-btrfs/[email protected]/
    Fixes: a3baaf0d786e ("Btrfs: fix fsync after succession of renames and unlink/rmdir")
    Reviewed-by: Boris Burkov <[email protected]>
    Signed-off-by: Filipe Manana <[email protected]>
    Signed-off-by: David Sterba <[email protected]>
    Signed-off-by: Sasha Levin <[email protected]>

btrfs: tree-checker: fix misleading root drop_level error message [+ + +]

Author: ZhengYuan Huang <[email protected]>
Date:   Thu Mar 12 08:33:21 2026 +0800

    btrfs: tree-checker: fix misleading root drop_level error message
    
    [ Upstream commit fc1cd1f18c34f91e78362f9629ab9fd43b9dcab9 ]
    
    Fix tree-checker error message to report "invalid root drop_level"
    instead of the misleading "invalid root level".
    
    Fixes: 259ee7754b67 ("btrfs: tree-checker: Add ROOT_ITEM check")
    Reviewed-by: Qu Wenruo <[email protected]>
    Signed-off-by: ZhengYuan Huang <[email protected]>
    Reviewed-by: David Sterba <[email protected]>
    Signed-off-by: David Sterba <[email protected]>
    Signed-off-by: Sasha Levin <[email protected]>

cache: ax45mp: Fix device node reference leak in ax45mp_cache_init() [+ + +]

Author: Felix Gu <[email protected]>
Date:   Sat Jan 31 01:49:09 2026 +0800

    cache: ax45mp: Fix device node reference leak in ax45mp_cache_init()
    
    [ Upstream commit 0528a348b04b327a4611e29589beb4c9ae81304a ]
    
    In ax45mp_cache_init(), of_find_matching_node() returns a device node
    with an incremented reference count that must be released with
    of_node_put(). The current code fails to call of_node_put() which
    causes a reference leak.
    
    Use the __free(device_node) attribute to ensure automatic cleanup when
    the variable goes out of scope.
    
    Fixes: d34599bcd2e4 ("cache: Add L2 cache management for Andes AX45MP RISC-V core")
    Signed-off-by: Felix Gu <[email protected]>
    Signed-off-by: Conor Dooley <[email protected]>
    Signed-off-by: Sasha Levin <[email protected]>

cache: starfive: fix device node leak in starlink_cache_init() [+ + +]

Author: Felix Gu <[email protected]>
Date:   Sat Jan 31 01:13:45 2026 +0800

    cache: starfive: fix device node leak in starlink_cache_init()
    
    [ Upstream commit 3c85234b979af71cb9db5eb976ea08a468415767 ]
    
    of_find_matching_node() returns a device_node with refcount incremented.
    
    Use __free(device_node) attribute to automatically call of_node_put()
    when the variable goes out of scope, preventing the refcount leak.
    
    Fixes: cabff60ca77d ("cache: Add StarFive StarLink cache management")
    Signed-off-by: Felix Gu <[email protected]>
    Reviewed-by: Jonathan Cameron <[email protected]>
    Signed-off-by: Conor Dooley <[email protected]>
    Signed-off-by: Sasha Levin <[email protected]>

can: gs_usb: gs_can_open(): always configure bitrates before starting device [+ + +]

Author: Marc Kleine-Budde <[email protected]>
Date:   Mon Mar 16 13:08:39 2026 -0400

    can: gs_usb: gs_can_open(): always configure bitrates before starting device
    
    [ Upstream commit 2df6162785f31f1bbb598cfc3b08e4efc88f80b6 ]
    
    So far the driver populated the struct can_priv::do_set_bittiming() and
    struct can_priv::fd::do_set_data_bittiming() callbacks.
    
    Before bringing up the interface, user space has to configure the bitrates.
    With these callbacks the configuration is directly forwarded into the CAN
    hardware. Then the interface can be brought up.
    
    An ifdown-ifup cycle (without changing the bit rates) doesn't re-configure
    the bitrates in the CAN hardware. This leads to a problem with the
    CANable-2.5 [1] firmware, which resets the configured bit rates during
    ifdown.
    
    To fix the problem remove both bit timing callbacks and always configure
    the bitrates in the struct net_device_ops::ndo_open() callback.
    
    [1] https://github.com/Elmue/CANable-2.5-firmware-Slcan-and-Candlelight
    
    Cc: [email protected]
    Fixes: d08e973a77d1 ("can: gs_usb: Added support for the GS_USB CAN devices")
    Link: https://patch.msgid.link/20260219-gs_usb-always-configure-bitrates-v2-1-671f8ba5b0a5@pengutronix.de
    Signed-off-by: Marc Kleine-Budde <[email protected]>
    [ adapted the `.fd` sub-struct ]
    Signed-off-by: Sasha Levin <[email protected]>
    Signed-off-by: Greg Kroah-Hartman <[email protected]>

can: hi311x: hi3110_open(): add check for hi3110_power_enable() return value [+ + +]

Author: Wenyuan Li <[email protected]>
Date:   Tue Mar 10 13:08:44 2026 +0800

    can: hi311x: hi3110_open(): add check for hi3110_power_enable() return value
    
    [ Upstream commit 47bba09b14fa21712398febf36cb14fd4fc3bded ]
    
    In hi3110_open(), the return value of hi3110_power_enable() is not checked.
    If power enable fails, the device may not function correctly, while the
    driver still returns success.
    
    Add a check for the return value and propagate the error accordingly.
    
    Signed-off-by: Wenyuan Li <[email protected]>
    Link: https://patch.msgid.link/[email protected]
    Fixes: 57e83fb9b746 ("can: hi311x: Add Holt HI-311x CAN driver")
    [mkl: adjust subject, commit message and jump label]
    Signed-off-by: Marc Kleine-Budde <[email protected]>
    Signed-off-by: Sasha Levin <[email protected]>

ceph: add a bunch of missing ceph_path_info initializers [+ + +]

Author: Max Kellermann <[email protected]>
Date:   Tue Feb 24 14:10:29 2026 +0100

    ceph: add a bunch of missing ceph_path_info initializers
    
    commit 43323a5934b660afae687e8e4e95ac328615a5c4 upstream.
    
    ceph_mdsc_build_path() must be called with a zero-initialized
    ceph_path_info parameter, or else the following
    ceph_mdsc_free_path_info() may crash.
    
    Example crash (on Linux 6.18.12):
    
      virt_to_cache: Object is not a Slab page!
      WARNING: CPU: 184 PID: 2871736 at mm/slub.c:6732 kmem_cache_free+0x316/0x400
      [...]
      Call Trace:
       [...]
       ceph_open+0x13d/0x3e0
       do_dentry_open+0x134/0x480
       vfs_open+0x2a/0xe0
       path_openat+0x9a3/0x1160
      [...]
      cache_from_obj: Wrong slab cache. names_cache but object is from ceph_inode_info
      WARNING: CPU: 184 PID: 2871736 at mm/slub.c:6746 kmem_cache_free+0x2dd/0x400
      [...]
      kernel BUG at mm/slub.c:634!
      Oops: invalid opcode: 0000 [#1] SMP NOPTI
      RIP: 0010:__slab_free+0x1a4/0x350
    
    Some of the ceph_mdsc_build_path() callers had initializers, but
    others had not, even though they were all added by commit 15f519e9f883
    ("ceph: fix race condition validating r_parent before applying state").
    The ones without initializer are suspectible to random crashes.  (I can
    imagine it could even be possible to exploit this bug to elevate
    privileges.)
    
    Unfortunately, these Ceph functions are undocumented and its semantics
    can only be derived from the code.  I see that ceph_mdsc_build_path()
    initializes the structure only on success, but not on error.
    
    Calling ceph_mdsc_free_path_info() after a failed
    ceph_mdsc_build_path() call does not even make sense, but that's what
    all callers do, and for it to be safe, the structure must be
    zero-initialized.  The least intrusive approach to fix this is
    therefore to add initializers everywhere.
    
    Cc: [email protected]
    Fixes: 15f519e9f883 ("ceph: fix race condition validating r_parent before applying state")
    Signed-off-by: Max Kellermann <[email protected]>
    Reviewed-by: Viacheslav Dubeyko <[email protected]>
    Signed-off-by: Ilya Dryomov <[email protected]>
    Signed-off-by: Greg Kroah-Hartman <[email protected]>

ceph: fix i_nlink underrun during async unlink [+ + +]

Author: Max Kellermann <[email protected]>
Date:   Fri Sep 5 23:15:30 2025 +0200

    ceph: fix i_nlink underrun during async unlink
    
    commit ce0123cbb4a40a2f1bbb815f292b26e96088639f upstream.
    
    During async unlink, we drop the `i_nlink` counter before we receive
    the completion (that will eventually update the `i_nlink`) because "we
    assume that the unlink will succeed".  That is not a bad idea, but it
    races against deletions by other clients (or against the completion of
    our own unlink) and can lead to an underrun which emits a WARNING like
    this one:
    
     WARNING: CPU: 85 PID: 25093 at fs/inode.c:407 drop_nlink+0x50/0x68
     Modules linked in:
     CPU: 85 UID: 3221252029 PID: 25093 Comm: php-cgi8.1 Not tainted 6.14.11-cm4all1-ampere #655
     Hardware name: Supermicro ARS-110M-NR/R12SPD-A, BIOS 1.1b 10/17/2023
     pstate: 60400009 (nZCv daif +PAN -UAO -TCO -DIT -SSBS BTYPE=--)
     pc : drop_nlink+0x50/0x68
     lr : ceph_unlink+0x6c4/0x720
     sp : ffff80012173bc90
     x29: ffff80012173bc90 x28: ffff086d0a45aaf8 x27: ffff0871d0eb5680
     x26: ffff087f2a64a718 x25: 0000020000000180 x24: 0000000061c88647
     x23: 0000000000000002 x22: ffff07ff9236d800 x21: 0000000000001203
     x20: ffff07ff9237b000 x19: ffff088b8296afc0 x18: 00000000f3c93365
     x17: 0000000000070000 x16: ffff08faffcbdfe8 x15: ffff08faffcbdfec
     x14: 0000000000000000 x13: 45445f65645f3037 x12: 34385f6369706f74
     x11: 0000a2653104bb20 x10: ffffd85f26d73290 x9 : ffffd85f25664f94
     x8 : 00000000000000c0 x7 : 0000000000000000 x6 : 0000000000000002
     x5 : 0000000000000081 x4 : 0000000000000481 x3 : 0000000000000000
     x2 : 0000000000000000 x1 : 0000000000000000 x0 : ffff08727d3f91e8
     Call trace:
      drop_nlink+0x50/0x68 (P)
      vfs_unlink+0xb0/0x2e8
      do_unlinkat+0x204/0x288
      __arm64_sys_unlinkat+0x3c/0x80
      invoke_syscall.constprop.0+0x54/0xe8
      do_el0_svc+0xa4/0xc8
      el0_svc+0x18/0x58
      el0t_64_sync_handler+0x104/0x130
      el0t_64_sync+0x154/0x158
    
    In ceph_unlink(), a call to ceph_mdsc_submit_request() submits the
    CEPH_MDS_OP_UNLINK to the MDS, but does not wait for completion.
    
    Meanwhile, between this call and the following drop_nlink() call, a
    worker thread may process a CEPH_CAP_OP_IMPORT, CEPH_CAP_OP_GRANT or
    just a CEPH_MSG_CLIENT_REPLY (the latter of which could be our own
    completion).  These will lead to a set_nlink() call, updating the
    `i_nlink` counter to the value received from the MDS.  If that new
    `i_nlink` value happens to be zero, it is illegal to decrement it
    further.  But that is exactly what ceph_unlink() will do then.
    
    The WARNING can be reproduced this way:
    
    1. Force async unlink; only the async code path is affected.  Having
       no real clue about Ceph internals, I was unable to find out why the
       MDS wouldn't give me the "Fxr" capabilities, so I patched
       get_caps_for_async_unlink() to always succeed.
    
       (Note that the WARNING dump above was found on an unpatched kernel,
       without this kludge - this is not a theoretical bug.)
    
    2. Add a sleep call after ceph_mdsc_submit_request() so the unlink
       completion gets handled by a worker thread before drop_nlink() is
       called.  This guarantees that the `i_nlink` is already zero before
       drop_nlink() runs.
    
    The solution is to skip the counter decrement when it is already zero,
    but doing so without a lock is still racy (TOCTOU).  Since
    ceph_fill_inode() and handle_cap_grant() both hold the
    `ceph_inode_info.i_ceph_lock` spinlock while set_nlink() runs, this
    seems like the proper lock to protect the `i_nlink` updates.
    
    I found prior art in NFS and SMB (using `inode.i_lock`) and AFS (using
    `afs_vnode.cb_lock`).  All three have the zero check as well.
    
    Cc: [email protected]
    Fixes: 2ccb45462aea ("ceph: perform asynchronous unlink if we have sufficient caps")
    Signed-off-by: Max Kellermann <[email protected]>
    Reviewed-by: Viacheslav Dubeyko <[email protected]>
    Signed-off-by: Ilya Dryomov <[email protected]>
    Signed-off-by: Greg Kroah-Hartman <[email protected]>

ceph: fix memory leaks in ceph_mdsc_build_path() [+ + +]

Author: Max Kellermann <[email protected]>
Date:   Tue Feb 24 14:26:57 2026 +0100

    ceph: fix memory leaks in ceph_mdsc_build_path()
    
    commit 040d159a45ded7f33201421a81df0aa2a86e5a0b upstream.
    
    Add __putname() calls to error code paths that did not free the "path"
    pointer obtained by __getname().  If ownership of this pointer is not
    passed to the caller via path_info.path, the function must free it
    before returning.
    
    Cc: [email protected]
    Fixes: 3fd945a79e14 ("ceph: encode encrypted name in ceph_mdsc_build_path and dentry release")
    Fixes: 550f7ca98ee0 ("ceph: give up on paths longer than PATH_MAX")
    Signed-off-by: Max Kellermann <[email protected]>
    Reviewed-by: Viacheslav Dubeyko <[email protected]>
    Signed-off-by: Ilya Dryomov <[email protected]>
    Signed-off-by: Greg Kroah-Hartman <[email protected]>

cgroup: fix race between task migration and iteration [+ + +]

Author: Qingye Zhao <[email protected]>
Date:   Wed Feb 11 09:24:04 2026 +0000

    cgroup: fix race between task migration and iteration
    
    commit 5ee01f1a7343d6a3547b6802ca2d4cdce0edacb1 upstream.
    
    When a task is migrated out of a css_set, cgroup_migrate_add_task()
    first moves it from cset->tasks to cset->mg_tasks via:
    
        list_move_tail(&task->cg_list, &cset->mg_tasks);
    
    If a css_task_iter currently has it->task_pos pointing to this task,
    css_set_move_task() calls css_task_iter_skip() to keep the iterator
    valid. However, since the task has already been moved to ->mg_tasks,
    the iterator is advanced relative to the mg_tasks list instead of the
    original tasks list. As a result, remaining tasks on cset->tasks, as
    well as tasks queued on cset->mg_tasks, can be skipped by iteration.
    
    Fix this by calling css_set_skip_task_iters() before unlinking
    task->cg_list from cset->tasks. This advances all active iterators to
    the next task on cset->tasks, so iteration continues correctly even
    when a task is concurrently being migrated.
    
    This race is hard to hit in practice without instrumentation, but it
    can be reproduced by artificially slowing down cgroup_procs_show().
    For example, on an Android device a temporary
    /sys/kernel/cgroup/cgroup_test knob can be added to inject a delay
    into cgroup_procs_show(), and then:
    
      1) Spawn three long-running tasks (PIDs 101, 102, 103).
      2) Create a test cgroup and move the tasks into it.
      3) Enable a large delay via /sys/kernel/cgroup/cgroup_test.
      4) In one shell, read cgroup.procs from the test cgroup.
      5) Within the delay window, in another shell migrate PID 102 by
         writing it to a different cgroup.procs file.
    
    Under this setup, cgroup.procs can intermittently show only PID 101
    while skipping PID 103. Once the migration completes, reading the
    file again shows all tasks as expected.
    
    Note that this change does not allow removing the existing
    css_set_skip_task_iters() call in css_set_move_task(). The new call
    in cgroup_migrate_add_task() only handles iterators that are racing
    with migration while the task is still on cset->tasks. Iterators may
    also start after the task has been moved to cset->mg_tasks. If we
    dropped css_set_skip_task_iters() from css_set_move_task(), such
    iterators could keep task_pos pointing to a migrating task, causing
    css_task_iter_advance() to malfunction on the destination css_set,
    up to and including crashes or infinite loops.
    
    The race window between migration and iteration is very small, and
    css_task_iter is not on a hot path. In the worst case, when an
    iterator is positioned on the first thread of the migrating process,
    cgroup_migrate_add_task() may have to skip multiple tasks via
    css_set_skip_task_iters(). However, this only happens when migration
    and iteration actually race, so the performance impact is negligible
    compared to the correctness fix provided here.
    
    Fixes: b636fd38dc40 ("cgroup: Implement css_task_iter_skip()")
    Cc: [email protected] # v5.2+
    Signed-off-by: Qingye Zhao <[email protected]>
    Reviewed-by: Michal Koutný <[email protected]>
    Signed-off-by: Tejun Heo <[email protected]>
    Signed-off-by: Greg Kroah-Hartman <[email protected]>

cifs: make default value of retrans as zero [+ + +]

Author: Shyam Prasad N <[email protected]>
Date:   Wed Mar 11 10:48:54 2026 +0530

    cifs: make default value of retrans as zero
    
    commit e3beefd3af09f8e460ddaf39063d3d7664d7ab59 upstream.
    
    When retrans mount option was introduced, the default value was set
    as 1. However, in the light of some bugs that this has exposed recently
    we should change it to 0 and retain the old behaviour before this option
    was introduced.
    
    Cc: <[email protected]>
    Reviewed-by: Bharath SM <[email protected]>
    Signed-off-by: Shyam Prasad N <[email protected]>
    Signed-off-by: Steve French <[email protected]>
    Signed-off-by: Greg Kroah-Hartman <[email protected]>

cifs: open files should not hold ref on superblock [+ + +]

Author: Shyam Prasad N <[email protected]>
Date:   Wed Mar 18 17:31:29 2026 -0400

    cifs: open files should not hold ref on superblock
    
    [ Upstream commit 340cea84f691c5206561bb2e0147158fe02070be ]
    
    Today whenever we deal with a file, in addition to holding
    a reference on the dentry, we also get a reference on the
    superblock. This happens in two cases:
    1. when a new cinode is allocated
    2. when an oplock break is being processed
    
    The reasoning for holding the superblock ref was to make sure
    that when umount happens, if there are users of inodes and
    dentries, it does not try to clean them up and wait for the
    last ref to superblock to be dropped by last of such users.
    
    But the side effect of doing that is that umount silently drops
    a ref on the superblock and we could have deferred closes and
    lease breaks still holding these refs.
    
    Ideally, we should ensure that all of these users of inodes and
    dentries are cleaned up at the time of umount, which is what this
    code is doing.
    
    This code change allows these code paths to use a ref on the
    dentry (and hence the inode). That way, umount is
    ensured to clean up SMB client resources when it's the last
    ref on the superblock (For ex: when same objects are shared).
    
    The code change also moves the call to close all the files in
    deferred close list to the umount code path. It also waits for
    oplock_break workers to be flushed before calling
    kill_anon_super (which eventually frees up those objects).
    
    Fixes: 24261fc23db9 ("cifs: delay super block destruction until all cifsFileInfo objects are gone")
    Fixes: 705c79101ccf ("smb: client: fix use-after-free in cifs_oplock_break")
    Cc: <[email protected]>
    Signed-off-by: Shyam Prasad N <[email protected]>
    Signed-off-by: Steve French <[email protected]>
    [ replaced kmalloc_obj() with kmalloc(sizeof(...)) ]
    Signed-off-by: Sasha Levin <[email protected]>
    Signed-off-by: Greg Kroah-Hartman <[email protected]>

cleanup: Provide retain_and_null_ptr() [+ + +]

Author: Thomas Gleixner <[email protected]>
Date:   Mon Mar 16 19:14:30 2026 -0400

    cleanup: Provide retain_and_null_ptr()
    
    [ Upstream commit 092d00ead733563f6d278295e0b5c5f97558b726 ]
    
    In cases where an allocation is consumed by another function, the
    allocation needs to be retained on success or freed on failure. The code
    pattern is usually:
    
            struct foo *f = kzalloc(sizeof(*f), GFP_KERNEL);
            struct bar *b;
    
            ,,,
            // Initialize f
            ...
            if (ret)
                    goto free;
            ...
            bar = bar_create(f);
            if (!bar) {
                    ret = -ENOMEM;
                    goto free;
            }
            ...
            return 0;
    free:
            kfree(f);
            return ret;
    
    This prevents using __free(kfree) on @f because there is no canonical way
    to tell the cleanup code that the allocation should not be freed.
    
    Abusing no_free_ptr() by force ignoring the return value is not really a
    sensible option either.
    
    Provide an explicit macro retain_and_null_ptr(), which NULLs the cleanup
    pointer. That makes it easy to analyze and reason about.
    
    Signed-off-by: Thomas Gleixner <[email protected]>
    Reviewed-by: Jonathan Cameron <[email protected]>
    Reviewed-by: James Bottomley <[email protected]>
    Link: https://lore.kernel.org/all/[email protected]
    Signed-off-by: Sasha Levin <[email protected]>
    Signed-off-by: Greg Kroah-Hartman <[email protected]>

clsact: Fix use-after-free in init/destroy rollback asymmetry [+ + +]

Author: Daniel Borkmann <[email protected]>
Date:   Fri Mar 13 07:55:31 2026 +0100

    clsact: Fix use-after-free in init/destroy rollback asymmetry
    
    [ Upstream commit a0671125d4f55e1e98d9bde8a0b671941987e208 ]
    
    Fix a use-after-free in the clsact qdisc upon init/destroy rollback asymmetry.
    The latter is achieved by first fully initializing a clsact instance, and
    then in a second step having a replacement failure for the new clsact qdisc
    instance. clsact_init() initializes ingress first and then takes care of the
    egress part. This can fail midway, for example, via tcf_block_get_ext(). Upon
    failure, the kernel will trigger the clsact_destroy() callback.
    
    Commit 1cb6f0bae504 ("bpf: Fix too early release of tcx_entry") details the
    way how the transition is happening. If tcf_block_get_ext on the q->ingress_block
    ends up failing, we took the tcx_miniq_inc reference count on the ingress
    side, but not yet on the egress side. clsact_destroy() tests whether the
    {ingress,egress}_entry was non-NULL. However, even in midway failure on the
    replacement, both are in fact non-NULL with a valid egress_entry from the
    previous clsact instance.
    
    What we really need to test for is whether the qdisc instance-specific ingress
    or egress side previously got initialized. This adds a small helper for checking
    the miniq initialization called mini_qdisc_pair_inited, and utilizes that upon
    clsact_destroy() in order to fix the use-after-free scenario. Convert the
    ingress_destroy() side as well so both are consistent to each other.
    
    Fixes: 1cb6f0bae504 ("bpf: Fix too early release of tcx_entry")
    Reported-by: Keenan Dong <[email protected]>
    Signed-off-by: Daniel Borkmann <[email protected]>
    Cc: Martin KaFai Lau <[email protected]>
    Acked-by: Martin KaFai Lau <[email protected]>
    Link: https://patch.msgid.link/[email protected]
    Signed-off-by: Paolo Abeni <[email protected]>
    Signed-off-by: Sasha Levin <[email protected]>

crypto: atmel-sha204a - Fix OOM ->tfm_count leak [+ + +]

Author: Thorsten Blum <[email protected]>
Date:   Wed Mar 18 20:59:41 2026 -0400

    crypto: atmel-sha204a - Fix OOM ->tfm_count leak
    
    [ Upstream commit d240b079a37e90af03fd7dfec94930eb6c83936e ]
    
    If memory allocation fails, decrement ->tfm_count to avoid blocking
    future reads.
    
    Cc: [email protected]
    Fixes: da001fb651b0 ("crypto: atmel-i2c - add support for SHA204A random number generator")
    Signed-off-by: Thorsten Blum <[email protected]>
    Signed-off-by: Herbert Xu <[email protected]>
    [ adapted kmalloc_obj() macro to kmalloc(sizeof()) ]
    Signed-off-by: Sasha Levin <[email protected]>
    Signed-off-by: Greg Kroah-Hartman <[email protected]>

device property: Allow secondary lookup in fwnode_get_next_child_node() [+ + +]

Author: Andy Shevchenko <[email protected]>
Date:   Tue Feb 10 14:58:22 2026 +0100

    device property: Allow secondary lookup in fwnode_get_next_child_node()
    
    commit 2692c614f8f05929d692b3dbfd3faef1f00fbaf0 upstream.
    
    When device_get_child_node_count() got split to the fwnode and device
    respective APIs, the fwnode didn't inherit the ability to traverse over
    the secondary fwnode. Hence any user, that switches from device to fwnode
    API misses this feature. In particular, this was revealed by the commit
    1490cbb9dbfd ("device property: Split fwnode_get_child_node_count()")
    that effectively broke the GPIO enumeration on Intel Galileo boards.
    Fix this by moving the secondary lookup from device to fwnode API.
    
    Note, in general no device_*() API should go into the depth of the fwnode
    implementation.
    
    Fixes: 114dbb4fa7c4 ("drivers property: When no children in primary, try secondary")
    Cc: [email protected]
    Signed-off-by: Andy Shevchenko <[email protected]>
    Reviewed-by: Rafael J. Wysocki (Intel) <[email protected]>
    Reviewed-by: Sakari Ailus <[email protected]>
    Link: https://patch.msgid.link/[email protected]
    Signed-off-by: Danilo Krummrich <[email protected]>
    Signed-off-by: Greg Kroah-Hartman <[email protected]>

dm-verity: disable recursive forward error correction [+ + +]

Author: Mikulas Patocka <[email protected]>
Date:   Thu Feb 26 12:35:00 2026 +0800

    dm-verity: disable recursive forward error correction
    
    [ Upstream commit d9f3e47d3fae0c101d9094bc956ed24e7a0ee801 ]
    
    There are two problems with the recursive correction:
    
    1. It may cause denial-of-service. In fec_read_bufs, there is a loop that
    has 253 iterations. For each iteration, we may call verity_hash_for_block
    recursively. There is a limit of 4 nested recursions - that means that
    there may be at most 253^4 (4 billion) iterations. Red Hat QE team
    actually created an image that pushes dm-verity to this limit - and this
    image just makes the udev-worker process get stuck in the 'D' state.
    
    2. It doesn't work. In fec_read_bufs we store data into the variable
    "fio->bufs", but fio bufs is shared between recursive invocations, if
    "verity_hash_for_block" invoked correction recursively, it would
    overwrite partially filled fio->bufs.
    
    Signed-off-by: Mikulas Patocka <[email protected]>
    Reported-by: Guangwu Zhang <[email protected]>
    Reviewed-by: Sami Tolvanen <[email protected]>
    Reviewed-by: Eric Biggers <[email protected]>
    [ The context change is due to the commit bdf253d580d7
    ("dm-verity: remove support for asynchronous hashes")
    in v6.18 which is irrelevant to the logic of this patch. ]
    Signed-off-by: Rahul Sharma <[email protected]>
    Signed-off-by: Greg Kroah-Hartman <[email protected]>

dmaengine: mmp_pdma: Fix race condition in mmp_pdma_residue() [+ + +]

Author: Guodong Xu <[email protected]>
Date:   Sat Feb 28 11:02:47 2026 +0800

    dmaengine: mmp_pdma: Fix race condition in mmp_pdma_residue()
    
    [ Upstream commit a143545855bc2c6e1330f6f57ae375ac44af00a7 ]
    
    Add proper locking in mmp_pdma_residue() to prevent use-after-free when
    accessing descriptor list and descriptor contents.
    
    The race occurs when multiple threads call tx_status() while the tasklet
    on another CPU is freeing completed descriptors:
    
    CPU 0                              CPU 1
    -----                              -----
    mmp_pdma_tx_status()
    mmp_pdma_residue()
      -> NO LOCK held
         list_for_each_entry(sw, ..)
                                       DMA interrupt
                                       dma_do_tasklet()
                                         -> spin_lock(&desc_lock)
                                            list_move(sw->node, ...)
                                            spin_unlock(&desc_lock)
      |                                     dma_pool_free(sw) <- FREED!
      -> access sw->desc <- UAF!
    
    This issue can be reproduced when running dmatest on the same channel with
    multiple threads (threads_per_chan > 1).
    
    Fix by protecting the chain_running list iteration and descriptor access
    with the chan->desc_lock spinlock.
    
    Signed-off-by: Juan Li <[email protected]>
    Signed-off-by: Guodong Xu <[email protected]>
    Link: https://patch.msgid.link/[email protected]
    Signed-off-by: Vinod Koul <[email protected]>
    [ Minor context conflict resolved. ]
    Signed-off-by: Wenshan Lan <[email protected]>
    Signed-off-by: Greg Kroah-Hartman <[email protected]>

drm/amd/display: Add pixel_clock to amd_pp_display_configuration [+ + +]

Author: Timur Kristóf <[email protected]>
Date:   Tue Sep 9 16:17:50 2025 +0200

    drm/amd/display: Add pixel_clock to amd_pp_display_configuration
    
    commit b515dcb0dc4e85d8254f5459cfb32fce88dacbfb upstream.
    
    This commit adds the pixel_clock field to the display config
    struct so that power management (DPM) can use it.
    
    We currently don't have a proper bandwidth calculation on old
    GPUs with DCE 6-10 because dce_calcs only supports DCE 11+.
    So the power management (DPM) on these GPUs may need to make
    ad-hoc decisions for display based on the pixel clock.
    
    Also rename sym_clock to pixel_clock in dm_pp_single_disp_config
    to avoid confusion with other code where the sym_clock refers to
    the DisplayPort symbol clock.
    
    Signed-off-by: Timur Kristóf <[email protected]>
    Signed-off-by: Alex Deucher <[email protected]>
    Signed-off-by: Rosen Penev <[email protected]>
    Signed-off-by: Greg Kroah-Hartman <[email protected]>

drm/amd/display: Fallback to boot snapshot for dispclk [+ + +]

Author: Dillon Varone <[email protected]>
Date:   Wed Feb 18 14:34:28 2026 -0500

    drm/amd/display: Fallback to boot snapshot for dispclk
    
    commit 30d937f63bd19bbcaafa4b892eb251f8bbbf04ef upstream.
    
    [WHY & HOW]
    If the dentist is unavailable, fallback to reading CLKIP via the boot
    snapshot to get the current dispclk.
    
    Reviewed-by: Nicholas Kazlauskas <[email protected]>
    Signed-off-by: Dillon Varone <[email protected]>
    Signed-off-by: Alex Hung <[email protected]>
    Cc: Mario Limonciello <[email protected]>
    Cc: Alex Deucher <[email protected]>
    Tested-by: Dan Wheeler <[email protected]>
    Signed-off-by: Alex Deucher <[email protected]>
    (cherry picked from commit 2ab77600d1e55a042c02437326d3c7563e853c6c)
    Cc: [email protected]
    Signed-off-by: Greg Kroah-Hartman <[email protected]>

drm/amd/display: Fix DisplayID not-found handling in parse_edid_displayid_vrr() [+ + +]

Author: Srinivasan Shanmugam <[email protected]>
Date:   Sun Mar 15 18:30:26 2026 +0530

    drm/amd/display: Fix DisplayID not-found handling in parse_edid_displayid_vrr()
    
    [ Upstream commit 2323b019651ad81c20a0f7f817c63392b3110652 ]
    
    parse_edid_displayid_vrr() searches the EDID extension blocks for a
    DisplayID extension before parsing the dynamic video timing range.
    
    The code previously checked whether edid_ext was NULL after the search
    loop. However, edid_ext is assigned during each iteration of the loop,
    so it will never be NULL once the loop has executed. If no DisplayID
    extension is found, edid_ext ends up pointing to the last extension
    block, and the NULL check does not correctly detect the failure case.
    
    Instead, check whether the loop completed without finding a matching
    DisplayID block by testing "i == edid->extensions". This ensures the
    function exits early when no DisplayID extension is present and avoids
    parsing an unrelated EDID extension block.
    
    Also simplify the EDID validation check using "!edid ||
    !edid->extensions".
    
    Fixes the below:
    drivers/gpu/drm/amd/amdgpu/../display/amdgpu_dm/amdgpu_dm.c:13079 parse_edid_displayid_vrr() warn: variable dereferenced before check 'edid_ext' (see line 13075)
    
    Fixes: a638b837d0e6 ("drm/amd/display: Fix refresh rate range for some panel")
    Cc: Roman Li <[email protected]>
    Cc: Alex Hung <[email protected]>
    Cc: Jerry Zuo <[email protected]>
    Cc: Sun peng Li <[email protected]>
    Cc: Tom Chung <[email protected]>
    Cc: Dan Carpenter <[email protected]>
    Cc: Aurabindo Pillai <[email protected]>
    Signed-off-by: Srinivasan Shanmugam <[email protected]>
    Reviewed-by: Tom Chung <[email protected]>
    Signed-off-by: Alex Deucher <[email protected]>
    (cherry picked from commit 91c7e6342e98c846b259c57273436fdea4c043f2)
    Signed-off-by: Sasha Levin <[email protected]>

drm/amd/display: Use GFP_ATOMIC in dc_create_stream_for_sink [+ + +]

Author: Natalie Vock <[email protected]>
Date:   Mon Mar 9 10:03:17 2026 -0400

    drm/amd/display: Use GFP_ATOMIC in dc_create_stream_for_sink
    
    [ Upstream commit 28dfe4317541e57fe52f9a290394cd29c348228b ]
    
    This can be called while preemption is disabled, for example by
    dcn32_internal_validate_bw which is called with the FPU active.
    
    Fixes "BUG: scheduling while atomic" messages I encounter on my Navi31
    machine.
    
    Signed-off-by: Natalie Vock <[email protected]>
    Signed-off-by: Alex Deucher <[email protected]>
    (cherry picked from commit b42dae2ebc5c84a68de63ec4ffdfec49362d53f1)
    Cc: [email protected]
    [ Context ]
    Signed-off-by: Sasha Levin <[email protected]>
    Signed-off-by: Greg Kroah-Hartman <[email protected]>

drm/amd/display: Wrap dcn32_override_min_req_memclk() in DC_FP_{START, END} [+ + +]

Author: Xi Ruoyao <[email protected]>
Date:   Fri Mar 6 14:28:03 2026 +0800

    drm/amd/display: Wrap dcn32_override_min_req_memclk() in DC_FP_{START, END}
    
    commit ebe82c6e75cfc547154d0fd843b0dd6cca3d548f upstream.
    
    [Why]
    The dcn32_override_min_req_memclk function is in dcn32_fpu.c, which is
    compiled with CC_FLAGS_FPU into FP instructions.  So when we call it we
    must use DC_FP_{START,END} to save and restore the FP context, and
    prepare the FP unit on architectures like LoongArch where the FP unit
    isn't always on.
    
    Reported-by: LiarOnce <[email protected]>
    Fixes: ee7be8f3de1c ("drm/amd/display: Limit DCN32 8 channel or less parts to DPM1 for FPO")
    Signed-off-by: Xi Ruoyao <[email protected]>
    Reviewed-by: Alex Hung <[email protected]>
    Signed-off-by: Alex Deucher <[email protected]>
    (cherry picked from commit 25bb1d54ba3983c064361033a8ec15474fece37e)
    Cc: [email protected]
    Signed-off-by: Greg Kroah-Hartman <[email protected]>

drm/amd/pm: add missing od setting PP_OD_FEATURE_ZERO_FAN_BIT for smu v14 [+ + +]

Author: Yang Wang <[email protected]>
Date:   Tue Mar 3 21:14:10 2026 -0500

    drm/amd/pm: add missing od setting PP_OD_FEATURE_ZERO_FAN_BIT for smu v14
    
    [ Upstream commit 9d4837a26149355ffe3a1f80de80531eafdd3353 ]
    
    add missing od setting PP_OD_FEATURE_ZERO_FAN_BIT for smu v14.0.2/14.0.3
    
    Fixes: 9710b84e2a6a ("drm/amd/pm: add overdrive support on smu v14.0.2/3")
    Closes: https://gitlab.freedesktop.org/drm/amd/-/issues/5018
    Signed-off-by: Yang Wang <[email protected]>
    Acked-by: Alex Deucher <[email protected]>
    Signed-off-by: Alex Deucher <[email protected]>
    (cherry picked from commit 1b5cf07d80bb16d1593579ccdb23f08ea4262c14)
    Signed-off-by: Sasha Levin <[email protected]>

drm/amd/pm: remove invalid gpu_metrics.energy_accumulator on smu v13.0.x [+ + +]

Author: Yang Wang <[email protected]>
Date:   Wed Feb 25 22:51:06 2026 -0500

    drm/amd/pm: remove invalid gpu_metrics.energy_accumulator on smu v13.0.x
    
    commit 68785c5e79e0fc1eacf63026fbba32be3867f410 upstream.
    
    v1:
    The metrics->EnergyAccumulator field has been deprecated on newer pmfw.
    
    v2:
    add smu 13.0.0/13.0.7/13.0.10 support.
    
    Signed-off-by: Yang Wang <[email protected]>
    Acked-by: Alex Deucher <[email protected]>
    Signed-off-by: Alex Deucher <[email protected]>
    (cherry picked from commit 8de9edb35976fa56565dc8fbb5d1310e8e10187c)
    Cc: [email protected]
    Signed-off-by: Greg Kroah-Hartman <[email protected]>

drm/amd/pm: Use pm_display_cfg in legacy DPM (v2) [+ + +]

Author: Timur Kristóf <[email protected]>
Date:   Tue Sep 9 16:17:51 2025 +0200

    drm/amd/pm: Use pm_display_cfg in legacy DPM (v2)
    
    commit 9d73b107a61b73e7101d4b728ddac3d2c77db111 upstream.
    
    This commit is necessary for DC to function well with chips
    that use the legacy power management code, ie. SI and KV.
    Communicate display information from DC to the legacy PM code.
    
    Currently DC uses pm_display_cfg to communicate power management
    requirements from the display code to the DPM code.
    However, the legacy (non-DC) code path used different fields
    and therefore could not take into account anything from DC.
    
    Change the legacy display code to fill the same pm_display_cfg
    struct as DC and use the same in the legacy DPM code.
    
    To ease review and reduce churn, this commit does not yet
    delete the now unneeded code, that is done in the next commit.
    
    v2:
    Rebase.
    Fix single_display in amdgpu_dpm_pick_power_state.
    
    Signed-off-by: Timur Kristóf <[email protected]>
    Signed-off-by: Alex Deucher <[email protected]>
    Signed-off-by: Rosen Penev <[email protected]>
    Signed-off-by: Greg Kroah-Hartman <[email protected]>

drm/amd: Disable MES LR compute W/A [+ + +]

Author: Mario Limonciello <[email protected]>
Date:   Wed Feb 25 10:51:16 2026 -0600

    drm/amd: Disable MES LR compute W/A
    
    commit 6b0d812971370c64b837a2db4275410f478272fe upstream.
    
    A workaround was introduced in commit 1fb710793ce2 ("drm/amdgpu: Enable
    MES lr_compute_wa by default") to help with some hangs observed in gfx1151.
    
    This WA didn't fully fix the issue.  It was actually fixed by adjusting
    the VGPR size to the correct value that matched the hardware in commit
    b42f3bf9536c ("drm/amdkfd: bump minimum vgpr size for gfx1151").
    
    There are reports of instability on other products with newer GC microcode
    versions, and I believe they're caused by this workaround. As we don't
    need the workaround any more, remove it.
    
    Fixes: b42f3bf9536c ("drm/amdkfd: bump minimum vgpr size for gfx1151")
    Acked-by: Alex Deucher <[email protected]>
    Signed-off-by: Mario Limonciello <[email protected]>
    Signed-off-by: Alex Deucher <[email protected]>
    (cherry picked from commit 9973e64bd6ee7642860a6f3b6958cbf14e89cabd)
    Cc: [email protected]
    Signed-off-by: Greg Kroah-Hartman <[email protected]>

drm/amd: fix dcn 2.01 check [+ + +]

Author: Andy Nguyen <[email protected]>
Date:   Sun Mar 15 17:51:47 2026 +0100

    drm/amd: fix dcn 2.01 check
    
    [ Upstream commit 39f44f54afa58661ecae9c27e15f5dbce2372892 ]
    
    The ASICREV_IS_BEIGE_GOBY_P check always took precedence, because it includes all chip revisions upto NV_UNKNOWN.
    
    Fixes: 54b822b3eac3 ("drm/amd/display: Use dce_version instead of chip_id")
    Signed-off-by: Andy Nguyen <[email protected]>
    Signed-off-by: Alex Deucher <[email protected]>
    (cherry picked from commit 9c7be0efa6f0daa949a5f3e3fdf9ea090b0713cb)
    Signed-off-by: Sasha Levin <[email protected]>

drm/amd: Set num IP blocks to 0 if discovery fails [+ + +]

Author: Mario Limonciello <[email protected]>
Date:   Tue Mar 10 11:58:22 2026 -0500

    drm/amd: Set num IP blocks to 0 if discovery fails
    
    commit 3646ff28780b4c52c5b5081443199e7a430110e5 upstream.
    
    If discovery has failed for any reason (such as no support for a block)
    then there is no need to unwind all the IP blocks in fini. In this
    condition there can actually be failures during the unwind too.
    
    Reset num_ip_blocks to zero during failure path and skip the unnecessary
    cleanup path.
    
    Suggested-by: Lijo Lazar <[email protected]>
    Reviewed-by: Lijo Lazar <[email protected]>
    Signed-off-by: Mario Limonciello <[email protected]>
    Signed-off-by: Alex Deucher <[email protected]>
    (cherry picked from commit fae5984296b981c8cc3acca35b701c1f332a6cd8)
    Cc: [email protected]
    Signed-off-by: Greg Kroah-Hartman <[email protected]>

drm/amdgpu/gmc9.0: add bounds checking for cid [+ + +]

Author: Alex Deucher <[email protected]>
Date:   Mon Mar 16 15:51:08 2026 -0400

    drm/amdgpu/gmc9.0: add bounds checking for cid
    
    commit f39e1270277f4b06db0b2c6ec9405b6dd766fb13 upstream.
    
    The value should never exceed the array size as those
    are the only values the hardware is expected to return,
    but add checks anyway.
    
    Cc: Benjamin Cheng <[email protected]>
    Reviewed-by: Benjamin Cheng <[email protected]>
    Signed-off-by: Alex Deucher <[email protected]>
    (cherry picked from commit e14d468304832bcc4a082d95849bc0a41b18ddea)
    Cc: [email protected]
    Signed-off-by: Greg Kroah-Hartman <[email protected]>

drm/amdgpu/mmhub2.0: add bounds checking for cid [+ + +]

Author: Alex Deucher <[email protected]>
Date:   Wed Mar 4 17:22:43 2026 -0500

    drm/amdgpu/mmhub2.0: add bounds checking for cid
    
    commit 0b26edac4ac5535df1f63e6e8ab44c24fe1acad7 upstream.
    
    The value should never exceed the array size as those
    are the only values the hardware is expected to return,
    but add checks anyway.
    
    Reviewed-by: Benjamin Cheng <[email protected]>
    Signed-off-by: Alex Deucher <[email protected]>
    (cherry picked from commit e064cef4b53552602bb6ac90399c18f662f3cacd)
    Cc: [email protected]
    Signed-off-by: Greg Kroah-Hartman <[email protected]>

drm/amdgpu/mmhub2.3: add bounds checking for cid [+ + +]

Author: Alex Deucher <[email protected]>
Date:   Wed Mar 4 17:24:10 2026 -0500

    drm/amdgpu/mmhub2.3: add bounds checking for cid
    
    commit a54403a534972af5d9ba5aaa3bb6ead612500ec6 upstream.
    
    The value should never exceed the array size as those
    are the only values the hardware is expected to return,
    but add checks anyway.
    
    Reviewed-by: Benjamin Cheng <[email protected]>
    Signed-off-by: Alex Deucher <[email protected]>
    (cherry picked from commit 89cd90375c19fb45138990b70e9f4ba4806f05c4)
    Cc: [email protected]
    Signed-off-by: Greg Kroah-Hartman <[email protected]>

drm/amdgpu/mmhub3.0.1: add bounds checking for cid [+ + +]

Author: Alex Deucher <[email protected]>
Date:   Wed Mar 4 17:24:35 2026 -0500

    drm/amdgpu/mmhub3.0.1: add bounds checking for cid
    
    commit 5d4e88bcfef29569a1db224ef15e28c603666c6d upstream.
    
    The value should never exceed the array size as those
    are the only values the hardware is expected to return,
    but add checks anyway.
    
    Reviewed-by: Benjamin Cheng <[email protected]>
    Signed-off-by: Alex Deucher <[email protected]>
    (cherry picked from commit 5f76083183363c4528a4aaa593f5d38c28fe7d7b)
    Cc: [email protected]
    Signed-off-by: Greg Kroah-Hartman <[email protected]>

drm/amdgpu/mmhub3.0.2: add bounds checking for cid [+ + +]

Author: Alex Deucher <[email protected]>
Date:   Wed Mar 4 17:25:09 2026 -0500

    drm/amdgpu/mmhub3.0.2: add bounds checking for cid
    
    commit e5e6d67b1ce9764e67aef2d0eef9911af53ad99a upstream.
    
    The value should never exceed the array size as those
    are the only values the hardware is expected to return,
    but add checks anyway.
    
    Reviewed-by: Benjamin Cheng <[email protected]>
    Signed-off-by: Alex Deucher <[email protected]>
    (cherry picked from commit 1441f52c7f6ae6553664aa9e3e4562f6fc2fe8ea)
    Cc: [email protected]
    Signed-off-by: Greg Kroah-Hartman <[email protected]>

drm/amdgpu/mmhub3.0: add bounds checking for cid [+ + +]

Author: Alex Deucher <[email protected]>
Date:   Wed Mar 4 17:25:30 2026 -0500

    drm/amdgpu/mmhub3.0: add bounds checking for cid
    
    commit cdb82ecbeccb55fae75a3c956b605f7801a30db1 upstream.
    
    The value should never exceed the array size as those
    are the only values the hardware is expected to return,
    but add checks anyway.
    
    Reviewed-by: Benjamin Cheng <[email protected]>
    Signed-off-by: Alex Deucher <[email protected]>
    (cherry picked from commit f14f27bbe2a3ed7af32d5f6eaf3f417139f45253)
    Cc: [email protected]
    Signed-off-by: Greg Kroah-Hartman <[email protected]>

drm/amdgpu/mmhub4.1.0: add bounds checking for cid [+ + +]

Author: Alex Deucher <[email protected]>
Date:   Wed Mar 4 17:25:56 2026 -0500

    drm/amdgpu/mmhub4.1.0: add bounds checking for cid
    
    commit 3cdd405831d8cc50a5eae086403402697bb98a4a upstream.
    
    The value should never exceed the array size as those
    are the only values the hardware is expected to return,
    but add checks anyway.
    
    Reviewed-by: Benjamin Cheng <[email protected]>
    Signed-off-by: Alex Deucher <[email protected]>
    (cherry picked from commit 04f063d85090f5dd0c671010ce88ee49d9dcc8ed)
    Cc: [email protected]
    Signed-off-by: Greg Kroah-Hartman <[email protected]>

drm/amdgpu/vcn5: Add SMU dpm interface type [+ + +]

Author: sguttula <[email protected]>
Date:   Sat Feb 21 10:03:32 2026 +0530

    drm/amdgpu/vcn5: Add SMU dpm interface type
    
    [ Upstream commit a5fe1a54513196e4bc8f9170006057dc31e7155e ]
    
    This will set AMDGPU_VCN_SMU_DPM_INTERFACE_* smu_type
    based on soc type and fixing ring timeout issue seen
    for DPM enabled case.
    
    Signed-off-by: sguttula <[email protected]>
    Reviewed-by: Pratik Vishwakarma <[email protected]>
    Signed-off-by: Alex Deucher <[email protected]>
    (cherry picked from commit f0f23c315b38c55e8ce9484cf59b65811f350630)
    Signed-off-by: Sasha Levin <[email protected]>

drm/amdgpu: Add basic validation for RAS header [+ + +]

Author: Lijo Lazar <[email protected]>
Date:   Wed Mar 26 13:28:38 2025 +0530

    drm/amdgpu: Add basic validation for RAS header
    
    commit 5df0d6addb7e9b6f71f7162d1253762a5be9138e upstream.
    
    If RAS header read from EEPROM is corrupted, it could result in trying
    to allocate huge memory for reading the records. Add some validation to
    header fields.
    
    Signed-off-by: Lijo Lazar <[email protected]>
    Reviewed-by: Hawking Zhang <[email protected]>
    Signed-off-by: Alex Deucher <[email protected]>
    [ RAS_TABLE_VER_V3 is not supported in v6.6.y. ]
    Signed-off-by: Alva Lan <[email protected]>
    Signed-off-by: Greg Kroah-Hartman <[email protected]>

drm/amdgpu: apply state adjust rules to some additional HAINAN vairants [+ + +]

Author: Alex Deucher <[email protected]>
Date:   Mon Sep 25 10:44:07 2023 -0400

    drm/amdgpu: apply state adjust rules to some additional HAINAN vairants
    
    commit 9787f7da186ee8143b7b6d914cfa0b6e7fee2648 upstream.
    
    They need a similar workaround.
    
    Closes: https://gitlab.freedesktop.org/drm/amd/-/issues/1839
    Signed-off-by: Alex Deucher <[email protected]>
    (cherry picked from commit 0de31d92a173d3d94f28051b0b80a6c98913aed4)
    Cc: [email protected]
    Signed-off-by: Greg Kroah-Hartman <[email protected]>

drm/amdgpu: Fix use-after-free race in VM acquire [+ + +]

Author: Alysa Liu <[email protected]>
Date:   Thu Feb 5 11:21:45 2026 -0500

    drm/amdgpu: Fix use-after-free race in VM acquire
    
    commit 2c1030f2e84885cc58bffef6af67d5b9d2e7098f upstream.
    
    Replace non-atomic vm->process_info assignment with cmpxchg()
    to prevent race when parent/child processes sharing a drm_file
    both try to acquire the same VM after fork().
    
    Reviewed-by: Harish Kasiviswanathan <[email protected]>
    Signed-off-by: Alysa Liu <[email protected]>
    Signed-off-by: Alex Deucher <[email protected]>
    (cherry picked from commit c7c573275ec20db05be769288a3e3bb2250ec618)
    Cc: [email protected]
    Signed-off-by: Greg Kroah-Hartman <[email protected]>

drm/amdkfd: Unreserve bo if queue update failed [+ + +]

Author: Philip Yang <[email protected]>
Date:   Tue Dec 9 15:13:23 2025 -0500

    drm/amdkfd: Unreserve bo if queue update failed
    
    [ Upstream commit 2ce75a0b7e1bfddbcb9bc8aeb2e5e7fa99971acf ]
    
    Error handling path should unreserve bo then return failed.
    
    Fixes: 305cd109b761 ("drm/amdkfd: Validate user queue update")
    Signed-off-by: Philip Yang <[email protected]>
    Reviewed-by: Alex Sierra <[email protected]>
    Signed-off-by: Alex Deucher <[email protected]>
    (cherry picked from commit c24afed7de9ecce341825d8ab55a43a254348b33)
    Signed-off-by: Sasha Levin <[email protected]>

drm/bridge: samsung-dsim: Fix memory leak in error path [+ + +]

Author: Osama Abdelkader <[email protected]>
Date:   Mon Feb 9 19:41:14 2026 +0100

    drm/bridge: samsung-dsim: Fix memory leak in error path
    
    commit 803ec1faf7c1823e6e3b1f2aaa81be18528c9436 upstream.
    
    In samsung_dsim_host_attach(), drm_bridge_add() is called to add the
    bridge. However, if samsung_dsim_register_te_irq() or
    pdata->host_ops->attach() fails afterwards, the function returns
    without removing the bridge, causing a memory leak.
    
    Fix this by adding proper error handling with goto labels to ensure
    drm_bridge_remove() is called in all error paths. Also ensure that
    samsung_dsim_unregister_te_irq() is called if the attach operation
    fails after the TE IRQ has been registered.
    
    samsung_dsim_unregister_te_irq() function is moved without changes
    to be before samsung_dsim_host_attach() to avoid forward declaration.
    
    Fixes: e7447128ca4a ("drm: bridge: Generalize Exynos-DSI driver into a Samsung DSIM bridge")
    Cc: [email protected]
    Signed-off-by: Osama Abdelkader <[email protected]>
    Reviewed-by: Luca Ceresoli <[email protected]>
    Link: https://patch.msgid.link/[email protected]
    Signed-off-by: Luca Ceresoli <[email protected]>
    Signed-off-by: Greg Kroah-Hartman <[email protected]>

drm/bridge: ti-sn65dsi83: fix CHA_DSI_CLK_RANGE rounding [+ + +]

Author: Luca Ceresoli <[email protected]>
Date:   Thu Feb 26 17:16:44 2026 +0100

    drm/bridge: ti-sn65dsi83: fix CHA_DSI_CLK_RANGE rounding
    
    commit 2f22702dc0fee06a240404e0f7ead5b789b253d8 upstream.
    
    The DSI frequency must be in the range:
    
      (CHA_DSI_CLK_RANGE * 5 MHz) <= DSI freq < ((CHA_DSI_CLK_RANGE + 1) * 5 MHz)
    
    So the register value should point to the lower range value, but
    DIV_ROUND_UP() rounds the division to the higher range value, resulting in
    an excess of 1 (unless the frequency is an exact multiple of 5 MHz).
    
    For example for a 437100000 MHz clock CHA_DSI_CLK_RANGE should be 87 (0x57):
    
      (87 * 5 = 435) <= 437.1 < (88 * 5 = 440)
    
    but current code returns 88 (0x58).
    
    Fix the computation by removing the DIV_ROUND_UP().
    
    Fixes: ceb515ba29ba ("drm/bridge: ti-sn65dsi83: Add TI SN65DSI83 and SN65DSI84 driver")
    Cc: [email protected]
    Reviewed-by: Marek Vasut <[email protected]>
    Link: https://patch.msgid.link/20260226-ti-sn65dsi83-dual-lvds-fixes-and-test-pattern-v1-1-2e15f5a9a6a0@bootlin.com
    Signed-off-by: Luca Ceresoli <[email protected]>
    Signed-off-by: Greg Kroah-Hartman <[email protected]>

drm/bridge: ti-sn65dsi83: halve horizontal syncs for dual LVDS output [+ + +]

Author: Luca Ceresoli <[email protected]>
Date:   Wed Mar 18 12:02:18 2026 -0400

    drm/bridge: ti-sn65dsi83: halve horizontal syncs for dual LVDS output
    
    [ Upstream commit d0d727746944096a6681dc6adb5f123fc5aa018d ]
    
    Dual LVDS output (available on the SN65DSI84) requires HSYNC_PULSE_WIDTH
    and HORIZONTAL_BACK_PORCH to be divided by two with respect to the values
    used for single LVDS output.
    
    While not clearly stated in the datasheet, this is needed according to the
    DSI Tuner [0] output. It also makes sense intuitively because in dual LVDS
    output two pixels at a time are output and so the output clock is half of
    the pixel clock.
    
    Some dual-LVDS panels refuse to show any picture without this fix.
    
    Divide by two HORIZONTAL_FRONT_PORCH too, even though this register is used
    only for test pattern generation which is not currently implemented by this
    driver.
    
    [0] https://www.ti.com/tool/DSI-TUNER
    
    Fixes: ceb515ba29ba ("drm/bridge: ti-sn65dsi83: Add TI SN65DSI83 and SN65DSI84 driver")
    Cc: [email protected]
    Reviewed-by: Marek Vasut <[email protected]>
    Link: https://patch.msgid.link/20260226-ti-sn65dsi83-dual-lvds-fixes-and-test-pattern-v1-2-2e15f5a9a6a0@bootlin.com
    Signed-off-by: Luca Ceresoli <[email protected]>
    [ adapted variable declaration placement ]
    Signed-off-by: Sasha Levin <[email protected]>
    Signed-off-by: Greg Kroah-Hartman <[email protected]>

drm/bridge: ti-sn65dsi86: Add support for DisplayPort mode with HPD [+ + +]

Author: John Ripple <[email protected]>
Date:   Mon Sep 15 11:45:43 2025 -0600

    drm/bridge: ti-sn65dsi86: Add support for DisplayPort mode with HPD
    
    commit 9133bc3f0564890218cbba6cc7e81ebc0841a6f1 upstream.
    
    Add support for DisplayPort to the bridge, which entails the following:
    - Get and use an interrupt for HPD;
    - Properly clear all status bits in the interrupt handler;
    
    Signed-off-by: John Ripple <[email protected]>
    Reviewed-by: Douglas Anderson <[email protected]>
    Signed-off-by: Douglas Anderson <[email protected]>
    Link: https://lore.kernel.org/r/[email protected]
    Signed-off-by: Greg Kroah-Hartman <[email protected]>

drm/bridge: ti-sn65dsi86: Enable HPD polling if IRQ is not used [+ + +]

Author: Franz Schnyder <[email protected]>
Date:   Fri Feb 6 13:37:36 2026 +0100

    drm/bridge: ti-sn65dsi86: Enable HPD polling if IRQ is not used
    
    commit 0b87d51690dd5131cbe9fbd23746b037aab89815 upstream.
    
    Fallback to polling to detect hotplug events on systems without
    interrupts.
    
    On systems where the interrupt line of the bridge is not connected,
    the bridge cannot notify hotplug events. Only add the
    DRM_BRIDGE_OP_HPD flag if an interrupt has been registered
    otherwise remain in polling mode.
    
    Fixes: 55e8ff842051 ("drm/bridge: ti-sn65dsi86: Add HPD for DisplayPort connector type")
    Cc: [email protected] # 6.16: 9133bc3f0564: drm/bridge: ti-sn65dsi86: Add
    Signed-off-by: Franz Schnyder <[email protected]>
    Reviewed-by: Douglas Anderson <[email protected]>
    [dianders: Adjusted Fixes/stable line based on discussion]
    Signed-off-by: Douglas Anderson <[email protected]>
    Link: https://patch.msgid.link/[email protected]
    Signed-off-by: Greg Kroah-Hartman <[email protected]>

drm/i915/alpm: ALPM disable fixes [+ + +]

Author: Jouni Högander <[email protected]>
Date:   Thu Feb 12 08:27:31 2026 +0200

    drm/i915/alpm: ALPM disable fixes
    
    commit eb4a7139e97374f42b7242cc754e77f1623fbcd5 upstream.
    
    PORT_ALPM_CTL is supposed to be written only before link training. Remove
    writing it from ALPM disable.
    
    Also clearing ALPM_CTL_ALPM_AUX_LESS_ENABLE and is not about disabling ALPM
    but switching to AUX-Wake ALPM. Stop touching this bit on ALPM disable.
    
    Closes: https://gitlab.freedesktop.org/drm/xe/kernel/-/issues/7153
    Fixes: 1ccbf135862b ("drm/i915/psr: Enable ALPM on source side for eDP Panel replay")
    Cc: Animesh Manna <[email protected]>
    Cc: Jani Nikula <[email protected]>
    Cc: <[email protected]> # v6.10+
    Signed-off-by: Jouni Högander <[email protected]>
    Reviewed-by: Michał Grzelak <[email protected]>
    Link: https://patch.msgid.link/[email protected]
    (cherry picked from commit 008304c9ae75c772d3460040de56e12112cdf5e6)
    Signed-off-by: Joonas Lahtinen <[email protected]>
    Signed-off-by: Greg Kroah-Hartman <[email protected]>

drm/i915/dsc: Add helper for writing DSC Selective Update ET parameters [+ + +]

Author: Jouni Högander <[email protected]>
Date:   Wed Mar 4 13:30:10 2026 +0200

    drm/i915/dsc: Add helper for writing DSC Selective Update ET parameters
    
    commit bb5f1cd10101c2567bff4d0e760b74aee7c42f44 upstream.
    
    There are slice row per frame and pic height configuration in DSC Selective
    Update Parameter Set 1 register. Add helper for configuring these.
    
    v2:
      - Add WARN_ON_ONCE if vdsc instances per pipe > 2
      - instead of checking vdsc instances per pipe being > 1 check == 2
    
    Bspec: 71709
    Signed-off-by: Jouni Högander <[email protected]>
    Reviewed-by: Ankit Nautiyal <[email protected]>
    Link: https://patch.msgid.link/[email protected]
    (cherry picked from commit c8698d61aeb3f70fe33761ee9d3d0e131b5bc2eb)
    Signed-off-by: Tvrtko Ursulin <[email protected]>
    [tursulin: fixup forward declaration conflict]
    Signed-off-by: Greg Kroah-Hartman <[email protected]>

drm/i915/dsc: Add Selective Update register definitions [+ + +]

Author: Jouni Högander <[email protected]>
Date:   Wed Mar 4 13:30:09 2026 +0200

    drm/i915/dsc: Add Selective Update register definitions
    
    commit c2c79c6d5b939ae8a42ddb884f576bddae685672 upstream.
    
    Add definitions for DSC_SU_PARAMETER_SET_0_DSC0 and
    DSC_SU_PARAMETER_SET_0_DSC1 registers. These are for Selective Update Early
    Transport configuration.
    
    Bspec: 71709
    Signed-off-by: Jouni Högander <[email protected]>
    Reviewed-by: Ankit Nautiyal <[email protected]>
    Link: https://patch.msgid.link/[email protected]
    (cherry picked from commit 24f96d903daf3dcf8fafe84d3d22b80ef47ba493)
    Signed-off-by: Tvrtko Ursulin <[email protected]>
    Signed-off-by: Greg Kroah-Hartman <[email protected]>

drm/i915/gt: Check set_default_submission() before deferencing [+ + +]

Author: Rahul Bukte <[email protected]>
Date:   Tue Feb 3 10:18:39 2026 +0530

    drm/i915/gt: Check set_default_submission() before deferencing
    
    [ Upstream commit 0162ab3220bac870e43e229e6e3024d1a21c3f26 ]
    
    When the i915 driver firmware binaries are not present, the
    set_default_submission pointer is not set. This pointer is
    dereferenced during suspend anyways.
    
    Add a check to make sure it is set before dereferencing.
    
    [   23.289926] PM: suspend entry (deep)
    [   23.293558] Filesystems sync: 0.000 seconds
    [   23.298010] Freezing user space processes
    [   23.302771] Freezing user space processes completed (elapsed 0.000 seconds)
    [   23.309766] OOM killer disabled.
    [   23.313027] Freezing remaining freezable tasks
    [   23.318540] Freezing remaining freezable tasks completed (elapsed 0.001 seconds)
    [   23.342038] serial 00:05: disabled
    [   23.345719] serial 00:02: disabled
    [   23.349342] serial 00:01: disabled
    [   23.353782] sd 0:0:0:0: [sda] Synchronizing SCSI cache
    [   23.358993] sd 1:0:0:0: [sdb] Synchronizing SCSI cache
    [   23.361635] ata1.00: Entering standby power mode
    [   23.368863] ata2.00: Entering standby power mode
    [   23.445187] BUG: kernel NULL pointer dereference, address: 0000000000000000
    [   23.452194] #PF: supervisor instruction fetch in kernel mode
    [   23.457896] #PF: error_code(0x0010) - not-present page
    [   23.463065] PGD 0 P4D 0
    [   23.465640] Oops: Oops: 0010 [#1] SMP NOPTI
    [   23.469869] CPU: 8 UID: 0 PID: 211 Comm: kworker/u48:18 Tainted: G S      W           6.19.0-rc4-00020-gf0b9d8eb98df #10 PREEMPT(voluntary)
    [   23.482512] Tainted: [S]=CPU_OUT_OF_SPEC, [W]=WARN
    [   23.496511] Workqueue: async async_run_entry_fn
    [   23.501087] RIP: 0010:0x0
    [   23.503755] Code: Unable to access opcode bytes at 0xffffffffffffffd6.
    [   23.510324] RSP: 0018:ffffb4a60065fca8 EFLAGS: 00010246
    [   23.515592] RAX: 0000000000000000 RBX: ffff9f428290e000 RCX: 000000000000000f
    [   23.522765] RDX: 0000000000000000 RSI: 0000000000000282 RDI: ffff9f428290e000
    [   23.529937] RBP: ffff9f4282907070 R08: ffff9f4281130428 R09: 00000000ffffffff
    [   23.537111] R10: 0000000000000000 R11: 0000000000000001 R12: ffff9f42829070f8
    [   23.544284] R13: ffff9f4282906028 R14: ffff9f4282900000 R15: ffff9f4282906b68
    [   23.551457] FS:  0000000000000000(0000) GS:ffff9f466b2cf000(0000) knlGS:0000000000000000
    [   23.559588] CS:  0010 DS: 0000 ES: 0000 CR0: 0000000080050033
    [   23.565365] CR2: ffffffffffffffd6 CR3: 000000031c230001 CR4: 0000000000f70ef0
    [   23.572539] PKRU: 55555554
    [   23.575281] Call Trace:
    [   23.577770]  <TASK>
    [   23.579905]  intel_engines_reset_default_submission+0x42/0x60
    [   23.585695]  __intel_gt_unset_wedged+0x191/0x200
    [   23.590360]  intel_gt_unset_wedged+0x20/0x40
    [   23.594675]  gt_sanitize+0x15e/0x170
    [   23.598290]  i915_gem_suspend_late+0x6b/0x180
    [   23.602692]  i915_drm_suspend_late+0x35/0xf0
    [   23.607008]  ? __pfx_pci_pm_suspend_late+0x10/0x10
    [   23.611843]  dpm_run_callback+0x78/0x1c0
    [   23.615817]  device_suspend_late+0xde/0x2e0
    [   23.620037]  async_suspend_late+0x18/0x30
    [   23.624082]  async_run_entry_fn+0x25/0xa0
    [   23.628129]  process_one_work+0x15b/0x380
    [   23.632182]  worker_thread+0x2a5/0x3c0
    [   23.635973]  ? __pfx_worker_thread+0x10/0x10
    [   23.640279]  kthread+0xf6/0x1f0
    [   23.643464]  ? __pfx_kthread+0x10/0x10
    [   23.647263]  ? __pfx_kthread+0x10/0x10
    [   23.651045]  ret_from_fork+0x131/0x190
    [   23.654837]  ? __pfx_kthread+0x10/0x10
    [   23.658634]  ret_from_fork_asm+0x1a/0x30
    [   23.662597]  </TASK>
    [   23.664826] Modules linked in:
    [   23.667914] CR2: 0000000000000000
    [   23.671271] ------------[ cut here ]------------
    
    Signed-off-by: Rahul Bukte <[email protected]>
    Reviewed-by: Suraj Kandpal <[email protected]>
    Signed-off-by: Suraj Kandpal <[email protected]>
    Link: https://patch.msgid.link/[email protected]
    (cherry picked from commit daa199abc3d3d1740c9e3a2c3e9216ae5b447cad)
    Fixes: ff44ad51ebf8 ("drm/i915: Move engine->submit_request selection to a vfunc")
    Signed-off-by: Joonas Lahtinen <[email protected]>
    Signed-off-by: Sasha Levin <[email protected]>

drm/i915/psr: Compute PSR entry_setup_frames into intel_crtc_state [+ + +]

Author: Jouni Högander <[email protected]>
Date:   Thu Mar 12 10:37:10 2026 +0200

    drm/i915/psr: Compute PSR entry_setup_frames into intel_crtc_state
    
    commit 7caac659a837af9fd4cad85be851982b88859484 upstream.
    
    PSR entry_setup_frames is currently computed directly into struct
    intel_dp:intel_psr:entry_setup_frames. This causes a problem if mode change
    gets rejected after PSR compute config: Psr_entry_setup_frames computed for
    this rejected state is in intel_dp:intel_psr:entry_setup_frame. Fix this by
    computing it into intel_crtc_state and copy the value into
    intel_dp:intel_psr:entry_setup_frames on PSR enable.
    
    Fixes: 2b981d57e480 ("drm/i915/display: Support PSR entry VSC packet to be transmitted one frame earlier")
    Cc: Mika Kahola <[email protected]>
    Cc: <[email protected]> # v6.8+
    Signed-off-by: Jouni Högander <[email protected]>
    Reviewed-by: Suraj Kandpal <[email protected]>
    Link: https://patch.msgid.link/[email protected]
    (cherry picked from commit 8c229b4aa00262c13787982e998c61c0783285e0)
    Signed-off-by: Joonas Lahtinen <[email protected]>
    [ dropped intel_psr_needs_wa_18037818876 hunk and adjusted surrounding context ]
    Signed-off-by: Sasha Levin <[email protected]>
    Signed-off-by: Greg Kroah-Hartman <[email protected]>

drm/i915/psr: Repeat Selective Update area alignment [+ + +]

Author: Jouni Högander <[email protected]>
Date:   Wed Mar 4 13:30:08 2026 +0200

    drm/i915/psr: Repeat Selective Update area alignment
    
    commit 1be2fca84f520105413d0d89ed04bb0ff742ab16 upstream.
    
    Currently we are aligning Selective Update area to cover cursor fully if
    needed only once. It may happen that cursor is in Selective Update area
    after pipe alignment and after that covering cursor plane only
    partially. Fix this by looping alignment as long as alignment isn't needed
    anymore.
    
    v2:
      - do not unecessarily loop if cursor was already fully covered
      - rename aligned as su_area_changed
    
    Fixes: 1bff93b8bc27 ("drm/i915/psr: Extend SU area to cover cursor fully if needed")
    Cc: <[email protected]> # v6.9+
    Signed-off-by: Jouni Högander <[email protected]>
    Reviewed-by: Ankit Nautiyal <[email protected]>
    Link: https://patch.msgid.link/[email protected]
    (cherry picked from commit 681e12440d8b110350a5709101169f319e10ccbb)
    Signed-off-by: Tvrtko Ursulin <[email protected]>
    Signed-off-by: Greg Kroah-Hartman <[email protected]>

drm/i915/psr: Write DSC parameters on Selective Update in ET mode [+ + +]

Author: Jouni Högander <[email protected]>
Date:   Wed Mar 4 13:30:11 2026 +0200

    drm/i915/psr: Write DSC parameters on Selective Update in ET mode
    
    commit 5923a6e0459fdd3edac4ad5abccb24d777d8f1b6 upstream.
    
    There are slice row per frame and pic height parameters in DSC that needs
    to be configured on every Selective Update in Early Transport mode. Use
    helper provided by DSC code to configure these on Selective Update when in
    Early Transport mode. Also fill crtc_state->psr2_su_area with full frame
    area on full frame update for DSC calculation.
    
    v2: move psr2_su_area under skip_sel_fetch_set_loop label
    
    Bspec: 68927, 71709
    Fixes: 467e4e061c44 ("drm/i915/psr: Enable psr2 early transport as possible")
    Cc: <[email protected]> # v6.9+
    Signed-off-by: Jouni Högander <[email protected]>
    Reviewed-by: Ankit Nautiyal <[email protected]>
    Link: https://patch.msgid.link/[email protected]
    (cherry picked from commit 3140af2fab505a4cd47d516284529bf1585628be)
    Signed-off-by: Tvrtko Ursulin <[email protected]>
    Signed-off-by: Greg Kroah-Hartman <[email protected]>

drm/i915: Fix potential overflow of shmem scatterlist length [+ + +]

Author: Janusz Krzysztofik <[email protected]>
Date:   Tue Feb 24 10:49:06 2026 +0100

    drm/i915: Fix potential overflow of shmem scatterlist length
    
    commit 029ae067431ab9d0fca479bdabe780fa436706ea upstream.
    
    When a scatterlists table of a GEM shmem object of size 4 GB or more is
    populated with pages allocated from a folio, unsigned int .length
    attribute of a scatterlist may get overflowed if total byte length of
    pages allocated to that single scatterlist happens to reach or cross the
    4GB limit.  As a consequence, users of the object may suffer from hitting
    unexpected, premature end of the object's backing pages.
    
    [278.780187] ------------[ cut here ]------------
    [278.780377] WARNING: CPU: 1 PID: 2326 at drivers/gpu/drm/i915/i915_mm.c:55 remap_sg+0x199/0x1d0 [i915]
    ...
    [278.780654] CPU: 1 UID: 0 PID: 2326 Comm: gem_mmap_offset Tainted: G S   U              6.17.0-rc1-CI_DRM_16981-ged823aaa0607+ #1 PREEMPT(voluntary)
    [278.780656] Tainted: [S]=CPU_OUT_OF_SPEC, [U]=USER
    [278.780658] Hardware name: Intel Corporation Meteor Lake Client Platform/MTL-P LP5x T3 RVP, BIOS MTLPFWI1.R00.3471.D91.2401310918 01/31/2024
    [278.780659] RIP: 0010:remap_sg+0x199/0x1d0 [i915]
    ...
    [278.780786] Call Trace:
    [278.780787]  <TASK>
    [278.780788]  ? __apply_to_page_range+0x3e6/0x910
    [278.780795]  ? __pfx_remap_sg+0x10/0x10 [i915]
    [278.780906]  apply_to_page_range+0x14/0x30
    [278.780908]  remap_io_sg+0x14d/0x260 [i915]
    [278.781013]  vm_fault_cpu+0xd2/0x330 [i915]
    [278.781137]  __do_fault+0x3a/0x1b0
    [278.781140]  do_fault+0x322/0x640
    [278.781143]  __handle_mm_fault+0x938/0xfd0
    [278.781150]  handle_mm_fault+0x12c/0x300
    [278.781152]  ? lock_mm_and_find_vma+0x4b/0x760
    [278.781155]  do_user_addr_fault+0x2d6/0x8e0
    [278.781160]  exc_page_fault+0x96/0x2c0
    [278.781165]  asm_exc_page_fault+0x27/0x30
    ...
    
    That issue was apprehended by the author of a change that introduced it,
    and potential risk even annotated with a comment, but then never addressed.
    
    When adding folio pages to a scatterlist table, take care of byte length
    of any single scatterlist not exceeding max_segment.
    
    Fixes: 0b62af28f249b ("i915: convert shmem_sg_free_table() to use a folio_batch")
    Closes: https://gitlab.freedesktop.org/drm/i915/kernel/-/issues/14809
    Cc: Matthew Wilcox (Oracle) <[email protected]>
    Cc: Andrew Morton <[email protected]>
    Cc: [email protected] # v6.5+
    Signed-off-by: Janusz Krzysztofik <[email protected]>
    Reviewed-by: Andi Shyti <[email protected]>
    Signed-off-by: Andi Shyti <[email protected]>
    Link: https://lore.kernel.org/r/[email protected]
    (cherry picked from commit 06249b4e691a75694c014a61708c007fb5755f60)
    Signed-off-by: Tvrtko Ursulin <[email protected]>
    Signed-off-by: Greg Kroah-Hartman <[email protected]>

drm/imagination: Fix deadlock in soft reset sequence [+ + +]

Author: Alessio Belle <[email protected]>
Date:   Mon Mar 9 15:23:48 2026 +0000

    drm/imagination: Fix deadlock in soft reset sequence
    
    commit a55c2a5c8d680156495b7b1e2a9f5a3e313ba524 upstream.
    
    The soft reset sequence is currently executed from the threaded IRQ
    handler, hence it cannot call disable_irq() which internally waits
    for IRQ handlers, i.e. itself, to complete.
    
    Use disable_irq_nosync() during a soft reset instead.
    
    Fixes: cc1aeedb98ad ("drm/imagination: Implement firmware infrastructure and META FW support")
    Cc: [email protected]
    Signed-off-by: Alessio Belle <[email protected]>
    Reviewed-by: Matt Coster <[email protected]>
    Link: https://patch.msgid.link/[email protected]
    Signed-off-by: Matt Coster <[email protected]>
    Signed-off-by: Greg Kroah-Hartman <[email protected]>

drm/msm/dsi: fix hdisplay calculation when programming dsi registers [+ + +]

Author: Pengyu Luo <[email protected]>
Date:   Sat Feb 14 18:51:28 2026 +0800

    drm/msm/dsi: fix hdisplay calculation when programming dsi registers
    
    [ Upstream commit ac47870fd795549f03d57e0879fc730c79119f4b ]
    
    Recently, the hdisplay calculation is working for 3:1 compressed ratio
    only. If we have a video panel with DSC BPP = 8, and BPC = 10, we still
    use the default bits_per_pclk = 24, then we get the wrong hdisplay. We
    can draw the conclusion by cross-comparing the calculation with the
    calculation in dsi_adjust_pclk_for_compression().
    
    Since CMD mode does not use this, we can remove
    !(msm_host->mode_flags & MIPI_DSI_MODE_VIDEO) safely.
    
    Fixes: efcbd6f9cdeb ("drm/msm/dsi: Enable widebus for DSI")
    Signed-off-by: Pengyu Luo <[email protected]>
    Reviewed-by: Dmitry Baryshkov <[email protected]>
    Patchwork: https://patchwork.freedesktop.org/patch/704822/
    Link: https://lore.kernel.org/r/[email protected]
    Signed-off-by: Dmitry Baryshkov <[email protected]>
    Signed-off-by: Sasha Levin <[email protected]>

drm/msm/dsi: fix pclk rate calculation for bonded dsi [+ + +]

Author: Pengyu Luo <[email protected]>
Date:   Sat Mar 7 00:32:38 2026 +0800

    drm/msm/dsi: fix pclk rate calculation for bonded dsi
    
    [ Upstream commit e4eb11b34d6c84f398d8f08d7cb4d6c38e739dd2 ]
    
    Recently, we round up new_hdisplay once at most, for bonded dsi, we
    may need twice, since they are independent links, we should round up
    each half separately. This also aligns with the hdisplay we program
    later in dsi_timing_setup()
    
    Example:
            full_hdisplay = 1904, dsc_bpp = 8, bpc = 8
            new_full_hdisplay = DIV_ROUND_UP(1904 * 8, 8 * 3) = 635
    
    if we use half display
            new_half_hdisplay = DIV_ROUND_UP(952 * 8, 8 * 3) = 318
            new_full_display = 636
    
    Fixes: 7c9e4a554d4a ("drm/msm/dsi: Reduce pclk rate for compression")
    Signed-off-by: Pengyu Luo <[email protected]>
    Reviewed-by: Dmitry Baryshkov <[email protected]>
    Patchwork: https://patchwork.freedesktop.org/patch/709716/
    Link: https://lore.kernel.org/r/[email protected]
    Signed-off-by: Dmitry Baryshkov <[email protected]>
    Signed-off-by: Sasha Levin <[email protected]>

drm/msm: Fix dma_free_attrs() buffer size [+ + +]

Author: Thomas Fourier <[email protected]>
Date:   Thu Feb 26 10:57:11 2026 +0100

    drm/msm: Fix dma_free_attrs() buffer size
    
    commit e4eb6e4dd6348dd00e19c2275e3fbaed304ca3bd upstream.
    
    The gpummu->table buffer is alloc'd with size TABLE_SIZE + 32 in
    a2xx_gpummu_new() but freed with size TABLE_SIZE in
    a2xx_gpummu_destroy().
    
    Change the free size to match the allocation.
    
    Fixes: c2052a4e5c99 ("drm/msm: implement a2xx mmu")
    Cc: <[email protected]>
    Signed-off-by: Thomas Fourier <[email protected]>
    Reviewed-by: Dmitry Baryshkov <[email protected]>
    Patchwork: https://patchwork.freedesktop.org/patch/707340/
    Message-ID: <[email protected]>
    Signed-off-by: Rob Clark <[email protected]>
    Signed-off-by: Greg Kroah-Hartman <[email protected]>

drm/radeon: apply state adjust rules to some additional HAINAN vairants [+ + +]

Author: Alex Deucher <[email protected]>
Date:   Mon Sep 25 10:44:06 2023 -0400

    drm/radeon: apply state adjust rules to some additional HAINAN vairants
    
    commit 86650ee2241ff84207eaa298ab318533f3c21a38 upstream.
    
    They need a similar workaround.
    
    Closes: https://gitlab.freedesktop.org/drm/amd/-/issues/1839
    Signed-off-by: Alex Deucher <[email protected]>
    (cherry picked from commit 87327658c848f56eac166cb382b57b83bf06c5ac)
    Cc: [email protected]
    Signed-off-by: Greg Kroah-Hartman <[email protected]>

drm/sitronix/st7586: fix bad pixel data due to byte swap [+ + +]

Author: David Lechner <[email protected]>
Date:   Sat Feb 28 22:30:30 2026 -0600

    drm/sitronix/st7586: fix bad pixel data due to byte swap
    
    [ Upstream commit 46d8a07b4ae262e2fec6ce2aa454e06243661265 ]
    
    Correctly set dbi->write_memory_bpw for the ST7586 driver. This driver
    is for a monochrome display that has an unusual data format, so the
    default value set in mipi_dbi_spi_init() is not correct simply because
    this controller is non-standard.
    
    Previously, we were using dbi->swap_bytes to make the same sort of
    workaround, but it was removed in the same commit that added
    dbi->write_memory_bpw, so we need to use the latter now to have the
    correct behavior.
    
    This fixes every 3 columns of pixels being swapped on the display. There
    are 3 pixels per byte, so the byte swap caused this effect.
    
    Fixes: df3fb27a74a4 ("drm/mipi-dbi: Make bits per word configurable for pixel transfers")
    Acked-by: Thomas Zimmermann <[email protected]>
    Reviewed-by: Javier Martinez Canillas <[email protected]>
    Signed-off-by: David Lechner <[email protected]>
    Link: https://patch.msgid.link/20260228-drm-mipi-dbi-fix-st7586-byte-swap-v1-1-e78f6c24cd28@baylibre.com
    Signed-off-by: Sasha Levin <[email protected]>

drm/xe/oa: Allow reading after disabling OA stream [+ + +]

Author: Ashutosh Dixit <[email protected]>
Date:   Thu Mar 12 22:36:30 2026 -0700

    drm/xe/oa: Allow reading after disabling OA stream
    
    commit 9be6fd9fbd2032b683e51374497768af9aaa228a upstream.
    
    Some OA data might be present in the OA buffer when OA stream is
    disabled. Allow UMD's to retrieve this data, so that all data till the
    point when OA stream is disabled can be retrieved.
    
    v2: Update tail pointer after disable (Umesh)
    
    Fixes: efb315d0a013 ("drm/xe/oa/uapi: Read file_operation")
    Cc: [email protected]
    Signed-off-by: Ashutosh Dixit <[email protected]>
    Reviewed-by: Umesh Nerlige Ramappa<[email protected]>
    Link: https://patch.msgid.link/[email protected]
    (cherry picked from commit 4ff57c5e8dbba23b5457be12f9709d5c016da16e)
    Signed-off-by: Thomas Hellström <[email protected]>
    Signed-off-by: Greg Kroah-Hartman <[email protected]>

drm/xe/sync: Cleanup partially initialized sync on parse failure [+ + +]

Author: Shuicheng Lin <[email protected]>
Date:   Thu Feb 19 23:35:18 2026 +0000

    drm/xe/sync: Cleanup partially initialized sync on parse failure
    
    commit 1bfd7575092420ba5a0b944953c95b74a5646ff8 upstream.
    
    xe_sync_entry_parse() can allocate references (syncobj, fence, chain fence,
    or user fence) before hitting a later failure path. Several of those paths
    returned directly, leaving partially initialized state and leaking refs.
    
    Route these error paths through a common free_sync label and call
    xe_sync_entry_cleanup(sync) before returning the error.
    
    Fixes: dd08ebf6c352 ("drm/xe: Introduce a new DRM driver for Intel GPUs")
    Cc: Matthew Brost <[email protected]>
    Signed-off-by: Shuicheng Lin <[email protected]>
    Reviewed-by: Matthew Brost <[email protected]>
    Signed-off-by: Matthew Brost <[email protected]>
    Link: https://patch.msgid.link/[email protected]
    (cherry picked from commit f939bdd9207a5d1fc55cced5459858480686ce22)
    Cc: [email protected]
    Signed-off-by: Rodrigo Vivi <[email protected]>
    Signed-off-by: Greg Kroah-Hartman <[email protected]>

drm/xe: Open-code GGTT MMIO access protection [+ + +]

Author: Matthew Brost <[email protected]>
Date:   Tue Mar 10 18:50:39 2026 -0400

    drm/xe: Open-code GGTT MMIO access protection
    
    commit 01f2557aa684e514005541e71a3d01f4cd45c170 upstream.
    
    GGTT MMIO access is currently protected by hotplug (drm_dev_enter),
    which works correctly when the driver loads successfully and is later
    unbound or unloaded. However, if driver load fails, this protection is
    insufficient because drm_dev_unplug() is never called.
    
    Additionally, devm release functions cannot guarantee that all BOs with
    GGTT mappings are destroyed before the GGTT MMIO region is removed, as
    some BOs may be freed asynchronously by worker threads.
    
    To address this, introduce an open-coded flag, protected by the GGTT
    lock, that guards GGTT MMIO access. The flag is cleared during the
    dev_fini_ggtt devm release function to ensure MMIO access is disabled
    once teardown begins.
    
    Cc: [email protected]
    Fixes: 919bb54e989c ("drm/xe: Fix missing runtime outer protection for ggtt_remove_node")
    Reviewed-by: Zhanjun Dong <[email protected]>
    Signed-off-by: Matthew Brost <[email protected]>
    Link: https://patch.msgid.link/[email protected]
    (cherry picked from commit 4f3a998a173b4325c2efd90bdadc6ccd3ad9a431)
    Signed-off-by: Thomas Hellström <[email protected]>
    Signed-off-by: Greg Kroah-Hartman <[email protected]>

drm: Fix use-after-free on framebuffers and property blobs when calling drm_dev_unplug [+ + +]

Author: Maarten Lankhorst <[email protected]>
Date:   Fri Mar 13 16:17:27 2026 +0100

    drm: Fix use-after-free on framebuffers and property blobs when calling drm_dev_unplug
    
    commit 6bee098b91417654703e17eb5c1822c6dfd0c01d upstream.
    
    When trying to do a rather aggressive test of igt's "xe_module_load
    --r reload" with a full desktop environment and game running I noticed
    a few OOPSes when dereferencing freed pointers, related to
    framebuffers and property blobs after the compositor exits.
    
    Solve this by guarding the freeing in drm_file with drm_dev_enter/exit,
    and immediately put the references from struct drm_file objects during
    drm_dev_unplug().
    
    Related warnings for framebuffers on the subtest:
    [  739.713076] ------------[ cut here ]------------
                   WARN_ON(!list_empty(&dev->mode_config.fb_list))
    [  739.713079] WARNING: drivers/gpu/drm/drm_mode_config.c:584 at drm_mode_config_cleanup+0x30b/0x320 [drm], CPU#12: xe_module_load/13145
    ....
    [  739.713328] Call Trace:
    [  739.713330]  <TASK>
    [  739.713335]  ? intel_pmdemand_destroy_state+0x11/0x20 [xe]
    [  739.713574]  ? intel_atomic_global_obj_cleanup+0xe4/0x1a0 [xe]
    [  739.713794]  intel_display_driver_remove_noirq+0x51/0xb0 [xe]
    [  739.714041]  xe_display_fini_early+0x33/0x50 [xe]
    [  739.714284]  devm_action_release+0xf/0x20
    [  739.714294]  devres_release_all+0xad/0xf0
    [  739.714301]  device_unbind_cleanup+0x12/0xa0
    [  739.714305]  device_release_driver_internal+0x1b7/0x210
    [  739.714311]  device_driver_detach+0x14/0x20
    [  739.714315]  unbind_store+0xa6/0xb0
    [  739.714319]  drv_attr_store+0x21/0x30
    [  739.714322]  sysfs_kf_write+0x48/0x60
    [  739.714328]  kernfs_fop_write_iter+0x16b/0x240
    [  739.714333]  vfs_write+0x266/0x520
    [  739.714341]  ksys_write+0x72/0xe0
    [  739.714345]  __x64_sys_write+0x19/0x20
    [  739.714347]  x64_sys_call+0xa15/0xa30
    [  739.714355]  do_syscall_64+0xd8/0xab0
    [  739.714361]  entry_SYSCALL_64_after_hwframe+0x4b/0x53
    
    and
    
    [  739.714459] ------------[ cut here ]------------
    [  739.714461] xe 0000:67:00.0: [drm] drm_WARN_ON(!list_empty(&fb->filp_head))
    [  739.714464] WARNING: drivers/gpu/drm/drm_framebuffer.c:833 at drm_framebuffer_free+0x6c/0x90 [drm], CPU#12: xe_module_load/13145
    [  739.714715] RIP: 0010:drm_framebuffer_free+0x7a/0x90 [drm]
    ...
    [  739.714869] Call Trace:
    [  739.714871]  <TASK>
    [  739.714876]  drm_mode_config_cleanup+0x26a/0x320 [drm]
    [  739.714998]  ? __drm_printfn_seq_file+0x20/0x20 [drm]
    [  739.715115]  ? drm_mode_config_cleanup+0x207/0x320 [drm]
    [  739.715235]  intel_display_driver_remove_noirq+0x51/0xb0 [xe]
    [  739.715576]  xe_display_fini_early+0x33/0x50 [xe]
    [  739.715821]  devm_action_release+0xf/0x20
    [  739.715828]  devres_release_all+0xad/0xf0
    [  739.715843]  device_unbind_cleanup+0x12/0xa0
    [  739.715850]  device_release_driver_internal+0x1b7/0x210
    [  739.715856]  device_driver_detach+0x14/0x20
    [  739.715860]  unbind_store+0xa6/0xb0
    [  739.715865]  drv_attr_store+0x21/0x30
    [  739.715868]  sysfs_kf_write+0x48/0x60
    [  739.715873]  kernfs_fop_write_iter+0x16b/0x240
    [  739.715878]  vfs_write+0x266/0x520
    [  739.715886]  ksys_write+0x72/0xe0
    [  739.715890]  __x64_sys_write+0x19/0x20
    [  739.715893]  x64_sys_call+0xa15/0xa30
    [  739.715900]  do_syscall_64+0xd8/0xab0
    [  739.715905]  entry_SYSCALL_64_after_hwframe+0x4b/0x53
    
    and then finally file close blows up:
    
    [  743.186530] Oops: general protection fault, probably for non-canonical address 0xdead000000000122: 0000 [#1] SMP
    [  743.186535] CPU: 3 UID: 1000 PID: 3453 Comm: kwin_wayland Tainted: G        W           7.0.0-rc1-valkyria+ #110 PREEMPT_{RT,(lazy)}
    [  743.186537] Tainted: [W]=WARN
    [  743.186538] Hardware name: Gigabyte Technology Co., Ltd. X299 AORUS Gaming 3/X299 AORUS Gaming 3-CF, BIOS F8n 12/06/2021
    [  743.186539] RIP: 0010:drm_framebuffer_cleanup+0x55/0xc0 [drm]
    [  743.186588] Code: d8 72 73 0f b6 42 05 ff c3 39 c3 72 e8 49 8d bd 50 07 00 00 31 f6 e8 3a 80 d3 e1 49 8b 44 24 10 49 8d 7c 24 08 49 8b 54 24 08 <48> 3b 38 0f 85 95 7f 02 00 48 3b 7a 08 0f 85 8b 7f 02 00 48 89 42
    [  743.186589] RSP: 0018:ffffc900085e3cf8 EFLAGS: 00010202
    [  743.186591] RAX: dead000000000122 RBX: 0000000000000001 RCX: ffffffff8217ed03
    [  743.186592] RDX: dead000000000100 RSI: 0000000000000000 RDI: ffff88814675ba08
    [  743.186593] RBP: ffffc900085e3d10 R08: 0000000000000000 R09: 0000000000000000
    [  743.186593] R10: 0000000000000000 R11: 0000000000000000 R12: ffff88814675ba00
    [  743.186594] R13: ffff88810d778000 R14: ffff888119f6dca0 R15: ffff88810c660bb0
    [  743.186595] FS:  00007ff377d21280(0000) GS:ffff888cec3f8000(0000) knlGS:0000000000000000
    [  743.186596] CS:  0010 DS: 0000 ES: 0000 CR0: 0000000080050033
    [  743.186596] CR2: 000055690b55e000 CR3: 0000000113586003 CR4: 00000000003706f0
    [  743.186597] Call Trace:
    [  743.186598]  <TASK>
    [  743.186603]  intel_user_framebuffer_destroy+0x12/0x90 [xe]
    [  743.186722]  drm_framebuffer_free+0x3a/0x90 [drm]
    [  743.186750]  ? trace_hardirqs_on+0x5f/0x120
    [  743.186754]  drm_mode_object_put+0x51/0x70 [drm]
    [  743.186786]  drm_fb_release+0x105/0x190 [drm]
    [  743.186812]  ? rt_mutex_slowunlock+0x3aa/0x410
    [  743.186817]  ? rt_spin_lock+0xea/0x1b0
    [  743.186819]  drm_file_free+0x1e0/0x2c0 [drm]
    [  743.186843]  drm_release_noglobal+0x91/0xf0 [drm]
    [  743.186865]  __fput+0x100/0x2e0
    [  743.186869]  fput_close_sync+0x40/0xa0
    [  743.186870]  __x64_sys_close+0x3e/0x80
    [  743.186873]  x64_sys_call+0xa07/0xa30
    [  743.186879]  do_syscall_64+0xd8/0xab0
    [  743.186881]  entry_SYSCALL_64_after_hwframe+0x4b/0x53
    [  743.186882] RIP: 0033:0x7ff37e567732
    [  743.186884] Code: 08 0f 85 a1 38 ff ff 49 89 fb 48 89 f0 48 89 d7 48 89 ce 4c 89 c2 4d 89 ca 4c 8b 44 24 08 4c 8b 4c 24 10 4c 89 5c 24 08 0f 05 <c3> 66 2e 0f 1f 84 00 00 00 00 00 0f 1f 00 f3 0f 1e fa 55 bf 01 00
    [  743.186885] RSP: 002b:00007ffc818169a8 EFLAGS: 00000246 ORIG_RAX: 0000000000000003
    [  743.186886] RAX: ffffffffffffffda RBX: 00007ffc81816a30 RCX: 00007ff37e567732
    [  743.186887] RDX: 0000000000000000 RSI: 0000000000000000 RDI: 0000000000000012
    [  743.186888] RBP: 00007ffc818169d0 R08: 0000000000000000 R09: 0000000000000000
    [  743.186889] R10: 0000000000000000 R11: 0000000000000246 R12: 000055d60a7996e0
    [  743.186889] R13: 00007ffc81816a90 R14: 00007ffc81816a90 R15: 000055d60a782a30
    [  743.186892]  </TASK>
    [  743.186893] Modules linked in: rfcomm snd_hrtimer xt_CHECKSUM xt_MASQUERADE xt_conntrack ipt_REJECT nf_reject_ipv4 xt_tcpudp xt_addrtype nft_compat x_tables nft_chain_nat nf_nat nf_conntrack nf_defrag_ipv6 nf_defrag_ipv4 nf_tables overlay cfg80211 bnep mtd_intel_dg snd_hda_codec_intelhdmi mtd snd_hda_codec_hdmi nls_utf8 mxm_wmi intel_wmi_thunderbolt gigabyte_wmi wmi_bmof xe drm_gpuvm drm_gpusvm_helper i2c_algo_bit drm_buddy drm_ttm_helper ttm video drm_suballoc_helper gpu_sched drm_client_lib drm_exec drm_display_helper cec drm_kunit_helpers drm_kms_helper kunit x86_pkg_temp_thermal intel_powerclamp coretemp snd_hda_codec_alc882 snd_hda_codec_realtek_lib snd_hda_codec_generic snd_hda_intel snd_soc_avs snd_soc_hda_codec snd_hda_ext_core snd_hda_codec snd_hwdep snd_hda_core snd_intel_dspcfg snd_soc_core snd_compress ac97_bus snd_pcm snd_seq snd_seq_device snd_timer i2c_i801 i2c_mux snd i2c_smbus btusb btrtl btbcm btmtk btintel bluetooth ecdh_generic rfkill ecc mei_me mei ioatdma dca wmi nfsd drm i2c_dev fuse nfnetlink
    [  743.186938] ---[ end trace 0000000000000000 ]---
    
    And for property blobs:
    
    void drm_mode_config_cleanup(struct drm_device *dev)
    {
    ...
            list_for_each_entry_safe(blob, bt, &dev->mode_config.property_blob_list,
                                     head_global) {
                    drm_property_blob_put(blob);
            }
    
    Resulting in:
    
    [  371.072940] BUG: unable to handle page fault for address: 000001ffffffffff
    [  371.072944] #PF: supervisor read access in kernel mode
    [  371.072945] #PF: error_code(0x0000) - not-present page
    [  371.072947] PGD 0 P4D 0
    [  371.072950] Oops: Oops: 0000 [#1] SMP
    [  371.072953] CPU: 0 UID: 1000 PID: 3693 Comm: kwin_wayland Not tainted 7.0.0-rc1-valkyria+ #111 PREEMPT_{RT,(lazy)}
    [  371.072956] Hardware name: Gigabyte Technology Co., Ltd. X299 AORUS Gaming 3/X299 AORUS Gaming 3-CF, BIOS F8n 12/06/2021
    [  371.072957] RIP: 0010:drm_property_destroy_user_blobs+0x3b/0x90 [drm]
    [  371.073019] Code: 00 00 48 83 ec 10 48 8b 86 30 01 00 00 48 39 c3 74 59 48 89 c2 48 8d 48 c8 48 8b 00 4c 8d 60 c8 eb 04 4c 8d 60 c8 48 8b 71 40 <48> 39 16 0f 85 39 32 01 00 48 3b 50 08 0f 85 2f 32 01 00 48 89 70
    [  371.073021] RSP: 0018:ffffc90006a73de8 EFLAGS: 00010293
    [  371.073022] RAX: 000001ffffffffff RBX: ffff888118a1a930 RCX: ffff8881b92355c0
    [  371.073024] RDX: ffff8881b92355f8 RSI: 000001ffffffffff RDI: ffff888118be4000
    [  371.073025] RBP: ffffc90006a73e08 R08: ffff8881009b7300 R09: ffff888cecc5b000
    [  371.073026] R10: ffffc90006a73e90 R11: 0000000000000002 R12: 000001ffffffffc7
    [  371.073027] R13: ffff888118a1a980 R14: ffff88810b366d20 R15: ffff888118a1a970
    [  371.073028] FS:  00007f1faccbb280(0000) GS:ffff888cec2db000(0000) knlGS:0000000000000000
    [  371.073029] CS:  0010 DS: 0000 ES: 0000 CR0: 0000000080050033
    [  371.073030] CR2: 000001ffffffffff CR3: 000000010655c001 CR4: 00000000003706f0
    [  371.073031] Call Trace:
    [  371.073033]  <TASK>
    [  371.073036]  drm_file_free+0x1df/0x2a0 [drm]
    [  371.073077]  drm_release_noglobal+0x7a/0xe0 [drm]
    [  371.073113]  __fput+0xe2/0x2b0
    [  371.073118]  fput_close_sync+0x40/0xa0
    [  371.073119]  __x64_sys_close+0x3e/0x80
    [  371.073122]  x64_sys_call+0xa07/0xa30
    [  371.073126]  do_syscall_64+0xc0/0x840
    [  371.073130]  entry_SYSCALL_64_after_hwframe+0x4b/0x53
    [  371.073132] RIP: 0033:0x7f1fb3501732
    [  371.073133] Code: 08 0f 85 a1 38 ff ff 49 89 fb 48 89 f0 48 89 d7 48 89 ce 4c 89 c2 4d 89 ca 4c 8b 44 24 08 4c 8b 4c 24 10 4c 89 5c 24 08 0f 05 <c3> 66 2e 0f 1f 84 00 00 00 00 00 0f 1f 00 f3 0f 1e fa 55 bf 01 00
    [  371.073135] RSP: 002b:00007ffe8e6f0278 EFLAGS: 00000246 ORIG_RAX: 0000000000000003
    [  371.073136] RAX: ffffffffffffffda RBX: 00007ffe8e6f0300 RCX: 00007f1fb3501732
    [  371.073137] RDX: 0000000000000000 RSI: 0000000000000000 RDI: 0000000000000012
    [  371.073138] RBP: 00007ffe8e6f02a0 R08: 0000000000000000 R09: 0000000000000000
    [  371.073139] R10: 0000000000000000 R11: 0000000000000246 R12: 00005585ba46eea0
    [  371.073140] R13: 00007ffe8e6f0360 R14: 00007ffe8e6f0360 R15: 00005585ba458a30
    [  371.073143]  </TASK>
    [  371.073144] Modules linked in: rfcomm snd_hrtimer xt_addrtype xt_CHECKSUM xt_MASQUERADE xt_conntrack ipt_REJECT nf_reject_ipv4 xt_tcpudp nft_compat x_tables nft_chain_nat nf_nat nf_conntrack nf_defrag_ipv6 nf_defrag_ipv4 nf_tables overlay cfg80211 bnep snd_hda_codec_intelhdmi snd_hda_codec_hdmi mtd_intel_dg mtd nls_utf8 wmi_bmof mxm_wmi gigabyte_wmi intel_wmi_thunderbolt xe drm_gpuvm drm_gpusvm_helper i2c_algo_bit drm_buddy drm_ttm_helper ttm video drm_suballoc_helper gpu_sched drm_client_lib drm_exec drm_display_helper cec drm_kunit_helpers drm_kms_helper kunit x86_pkg_temp_thermal intel_powerclamp coretemp snd_hda_codec_alc882 snd_hda_codec_realtek_lib snd_hda_codec_generic snd_hda_intel snd_soc_avs snd_soc_hda_codec snd_hda_ext_core snd_hda_codec snd_hwdep snd_hda_core snd_intel_dspcfg snd_soc_core snd_compress ac97_bus snd_pcm snd_seq snd_seq_device snd_timer i2c_i801 btusb i2c_mux i2c_smbus btrtl snd btbcm btmtk btintel bluetooth ecdh_generic rfkill ecc mei_me mei ioatdma dca wmi nfsd drm i2c_dev fuse nfnetlink
    [  371.073198] CR2: 000001ffffffffff
    [  371.073199] ---[ end trace 0000000000000000 ]---
    
    Add a guard around file close, and ensure the warnings from drm_mode_config
    do not trigger. Fix those by allowing an open reference to the file descriptor
    and cleaning up the file linked list entry in drm_mode_config_cleanup().
    
    Cc: <[email protected]> # v4.18+
    Fixes: bee330f3d672 ("drm: Use srcu to protect drm_device.unplugged")
    Cc: Thomas Hellström <[email protected]>
    Reviewed-by: Thomas Hellström <[email protected]>
    Link: https://patch.msgid.link/[email protected]
    Signed-off-by: Maarten Lankhorst <[email protected]>
    Signed-off-by: Greg Kroah-Hartman <[email protected]>

dst: fix races in rt6_uncached_list_del() and rt_del_uncached_list() [+ + +]

Author: Eric Dumazet <[email protected]>
Date:   Tue Feb 24 14:31:16 2026 +0800

    dst: fix races in rt6_uncached_list_del() and rt_del_uncached_list()
    
    [ Upstream commit 9a6f0c4d5796ab89b5a28a890ce542344d58bd69 ]
    
    syzbot was able to crash the kernel in rt6_uncached_list_flush_dev()
    in an interesting way [1]
    
    Crash happens in list_del_init()/INIT_LIST_HEAD() while writing
    list->prev, while the prior write on list->next went well.
    
    static inline void INIT_LIST_HEAD(struct list_head *list)
    {
            WRITE_ONCE(list->next, list); // This went well
            WRITE_ONCE(list->prev, list); // Crash, @list has been freed.
    }
    
    Issue here is that rt6_uncached_list_del() did not attempt to lock
    ul->lock, as list_empty(&rt->dst.rt_uncached) returned
    true because the WRITE_ONCE(list->next, list) happened on the other CPU.
    
    We might use list_del_init_careful() and list_empty_careful(),
    or make sure rt6_uncached_list_del() always grabs the spinlock
    whenever rt->dst.rt_uncached_list has been set.
    
    A similar fix is neeed for IPv4.
    
    [1]
    
     BUG: KASAN: slab-use-after-free in INIT_LIST_HEAD include/linux/list.h:46 [inline]
     BUG: KASAN: slab-use-after-free in list_del_init include/linux/list.h:296 [inline]
     BUG: KASAN: slab-use-after-free in rt6_uncached_list_flush_dev net/ipv6/route.c:191 [inline]
     BUG: KASAN: slab-use-after-free in rt6_disable_ip+0x633/0x730 net/ipv6/route.c:5020
    Write of size 8 at addr ffff8880294cfa78 by task kworker/u8:14/3450
    
    CPU: 0 UID: 0 PID: 3450 Comm: kworker/u8:14 Tainted: G             L      syzkaller #0 PREEMPT_{RT,(full)}
    Tainted: [L]=SOFTLOCKUP
    Hardware name: Google Google Compute Engine/Google Compute Engine, BIOS Google 10/25/2025
    Workqueue: netns cleanup_net
    Call Trace:
     <TASK>
      dump_stack_lvl+0xe8/0x150 lib/dump_stack.c:120
      print_address_description mm/kasan/report.c:378 [inline]
      print_report+0xca/0x240 mm/kasan/report.c:482
      kasan_report+0x118/0x150 mm/kasan/report.c:595
      INIT_LIST_HEAD include/linux/list.h:46 [inline]
      list_del_init include/linux/list.h:296 [inline]
      rt6_uncached_list_flush_dev net/ipv6/route.c:191 [inline]
      rt6_disable_ip+0x633/0x730 net/ipv6/route.c:5020
      addrconf_ifdown+0x143/0x18a0 net/ipv6/addrconf.c:3853
     addrconf_notify+0x1bc/0x1050 net/ipv6/addrconf.c:-1
      notifier_call_chain+0x19d/0x3a0 kernel/notifier.c:85
      call_netdevice_notifiers_extack net/core/dev.c:2268 [inline]
      call_netdevice_notifiers net/core/dev.c:2282 [inline]
      netif_close_many+0x29c/0x410 net/core/dev.c:1785
      unregister_netdevice_many_notify+0xb50/0x2330 net/core/dev.c:12353
      ops_exit_rtnl_list net/core/net_namespace.c:187 [inline]
      ops_undo_list+0x3dc/0x990 net/core/net_namespace.c:248
      cleanup_net+0x4de/0x7b0 net/core/net_namespace.c:696
      process_one_work kernel/workqueue.c:3257 [inline]
      process_scheduled_works+0xad1/0x1770 kernel/workqueue.c:3340
      worker_thread+0x8a0/0xda0 kernel/workqueue.c:3421
      kthread+0x711/0x8a0 kernel/kthread.c:463
      ret_from_fork+0x510/0xa50 arch/x86/kernel/process.c:158
      ret_from_fork_asm+0x1a/0x30 arch/x86/entry/entry_64.S:246
     </TASK>
    
    Allocated by task 803:
      kasan_save_stack mm/kasan/common.c:57 [inline]
      kasan_save_track+0x3e/0x80 mm/kasan/common.c:78
      unpoison_slab_object mm/kasan/common.c:340 [inline]
      __kasan_slab_alloc+0x6c/0x80 mm/kasan/common.c:366
      kasan_slab_alloc include/linux/kasan.h:253 [inline]
      slab_post_alloc_hook mm/slub.c:4953 [inline]
      slab_alloc_node mm/slub.c:5263 [inline]
      kmem_cache_alloc_noprof+0x18d/0x6c0 mm/slub.c:5270
      dst_alloc+0x105/0x170 net/core/dst.c:89
      ip6_dst_alloc net/ipv6/route.c:342 [inline]
      icmp6_dst_alloc+0x75/0x460 net/ipv6/route.c:3333
      mld_sendpack+0x683/0xe60 net/ipv6/mcast.c:1844
      mld_send_cr net/ipv6/mcast.c:2154 [inline]
      mld_ifc_work+0x83e/0xd60 net/ipv6/mcast.c:2693
      process_one_work kernel/workqueue.c:3257 [inline]
      process_scheduled_works+0xad1/0x1770 kernel/workqueue.c:3340
      worker_thread+0x8a0/0xda0 kernel/workqueue.c:3421
      kthread+0x711/0x8a0 kernel/kthread.c:463
      ret_from_fork+0x510/0xa50 arch/x86/kernel/process.c:158
      ret_from_fork_asm+0x1a/0x30 arch/x86/entry/entry_64.S:246
    
    Freed by task 20:
      kasan_save_stack mm/kasan/common.c:57 [inline]
      kasan_save_track+0x3e/0x80 mm/kasan/common.c:78
      kasan_save_free_info+0x46/0x50 mm/kasan/generic.c:584
      poison_slab_object mm/kasan/common.c:253 [inline]
      __kasan_slab_free+0x5c/0x80 mm/kasan/common.c:285
      kasan_slab_free include/linux/kasan.h:235 [inline]
      slab_free_hook mm/slub.c:2540 [inline]
      slab_free mm/slub.c:6670 [inline]
      kmem_cache_free+0x18f/0x8d0 mm/slub.c:6781
      dst_destroy+0x235/0x350 net/core/dst.c:121
      rcu_do_batch kernel/rcu/tree.c:2605 [inline]
      rcu_core kernel/rcu/tree.c:2857 [inline]
      rcu_cpu_kthread+0xba5/0x1af0 kernel/rcu/tree.c:2945
      smpboot_thread_fn+0x542/0xa60 kernel/smpboot.c:160
      kthread+0x711/0x8a0 kernel/kthread.c:463
      ret_from_fork+0x510/0xa50 arch/x86/kernel/process.c:158
      ret_from_fork_asm+0x1a/0x30 arch/x86/entry/entry_64.S:246
    
    Last potentially related work creation:
      kasan_save_stack+0x3e/0x60 mm/kasan/common.c:57
      kasan_record_aux_stack+0xbd/0xd0 mm/kasan/generic.c:556
      __call_rcu_common kernel/rcu/tree.c:3119 [inline]
      call_rcu+0xee/0x890 kernel/rcu/tree.c:3239
      refdst_drop include/net/dst.h:266 [inline]
      skb_dst_drop include/net/dst.h:278 [inline]
      skb_release_head_state+0x71/0x360 net/core/skbuff.c:1156
      skb_release_all net/core/skbuff.c:1180 [inline]
      __kfree_skb net/core/skbuff.c:1196 [inline]
      sk_skb_reason_drop+0xe9/0x170 net/core/skbuff.c:1234
      kfree_skb_reason include/linux/skbuff.h:1322 [inline]
      tcf_kfree_skb_list include/net/sch_generic.h:1127 [inline]
      __dev_xmit_skb net/core/dev.c:4260 [inline]
      __dev_queue_xmit+0x26aa/0x3210 net/core/dev.c:4785
      NF_HOOK_COND include/linux/netfilter.h:307 [inline]
      ip6_output+0x340/0x550 net/ipv6/ip6_output.c:247
      NF_HOOK+0x9e/0x380 include/linux/netfilter.h:318
      mld_sendpack+0x8d4/0xe60 net/ipv6/mcast.c:1855
      mld_send_cr net/ipv6/mcast.c:2154 [inline]
      mld_ifc_work+0x83e/0xd60 net/ipv6/mcast.c:2693
      process_one_work kernel/workqueue.c:3257 [inline]
      process_scheduled_works+0xad1/0x1770 kernel/workqueue.c:3340
      worker_thread+0x8a0/0xda0 kernel/workqueue.c:3421
      kthread+0x711/0x8a0 kernel/kthread.c:463
      ret_from_fork+0x510/0xa50 arch/x86/kernel/process.c:158
      ret_from_fork_asm+0x1a/0x30 arch/x86/entry/entry_64.S:246
    
    The buggy address belongs to the object at ffff8880294cfa00
     which belongs to the cache ip6_dst_cache of size 232
    The buggy address is located 120 bytes inside of
     freed 232-byte region [ffff8880294cfa00, ffff8880294cfae8)
    
    The buggy address belongs to the physical page:
    page: refcount:0 mapcount:0 mapping:0000000000000000 index:0x0 pfn:0x294cf
    memcg:ffff88803536b781
    flags: 0x80000000000000(node=0|zone=1)
    page_type: f5(slab)
    raw: 0080000000000000 ffff88802ff1c8c0 ffffea0000bf2bc0 dead000000000006
    raw: 0000000000000000 00000000800c000c 00000000f5000000 ffff88803536b781
    page dumped because: kasan: bad access detected
    page_owner tracks the page as allocated
    page last allocated via order 0, migratetype Unmovable, gfp_mask 0x52820(GFP_ATOMIC|__GFP_NOWARN|__GFP_NORETRY|__GFP_COMP), pid 9, tgid 9 (kworker/0:0), ts 91119585830, free_ts 91088628818
      set_page_owner include/linux/page_owner.h:32 [inline]
      post_alloc_hook+0x234/0x290 mm/page_alloc.c:1857
      prep_new_page mm/page_alloc.c:1865 [inline]
      get_page_from_freelist+0x28c0/0x2960 mm/page_alloc.c:3915
      __alloc_frozen_pages_noprof+0x181/0x370 mm/page_alloc.c:5210
      alloc_pages_mpol+0xd1/0x380 mm/mempolicy.c:2486
      alloc_slab_page mm/slub.c:3075 [inline]
      allocate_slab+0x86/0x3b0 mm/slub.c:3248
      new_slab mm/slub.c:3302 [inline]
      ___slab_alloc+0xb10/0x13e0 mm/slub.c:4656
      __slab_alloc+0xc6/0x1f0 mm/slub.c:4779
      __slab_alloc_node mm/slub.c:4855 [inline]
      slab_alloc_node mm/slub.c:5251 [inline]
      kmem_cache_alloc_noprof+0x101/0x6c0 mm/slub.c:5270
      dst_alloc+0x105/0x170 net/core/dst.c:89
      ip6_dst_alloc net/ipv6/route.c:342 [inline]
      icmp6_dst_alloc+0x75/0x460 net/ipv6/route.c:3333
      mld_sendpack+0x683/0xe60 net/ipv6/mcast.c:1844
      mld_send_cr net/ipv6/mcast.c:2154 [inline]
      mld_ifc_work+0x83e/0xd60 net/ipv6/mcast.c:2693
      process_one_work kernel/workqueue.c:3257 [inline]
      process_scheduled_works+0xad1/0x1770 kernel/workqueue.c:3340
      worker_thread+0x8a0/0xda0 kernel/workqueue.c:3421
      kthread+0x711/0x8a0 kernel/kthread.c:463
      ret_from_fork+0x510/0xa50 arch/x86/kernel/process.c:158
    page last free pid 5859 tgid 5859 stack trace:
      reset_page_owner include/linux/page_owner.h:25 [inline]
      free_pages_prepare mm/page_alloc.c:1406 [inline]
      __free_frozen_pages+0xfe1/0x1170 mm/page_alloc.c:2943
      discard_slab mm/slub.c:3346 [inline]
      __put_partials+0x149/0x170 mm/slub.c:3886
      __slab_free+0x2af/0x330 mm/slub.c:5952
      qlink_free mm/kasan/quarantine.c:163 [inline]
      qlist_free_all+0x97/0x100 mm/kasan/quarantine.c:179
      kasan_quarantine_reduce+0x148/0x160 mm/kasan/quarantine.c:286
      __kasan_slab_alloc+0x22/0x80 mm/kasan/common.c:350
      kasan_slab_alloc include/linux/kasan.h:253 [inline]
      slab_post_alloc_hook mm/slub.c:4953 [inline]
      slab_alloc_node mm/slub.c:5263 [inline]
      kmem_cache_alloc_noprof+0x18d/0x6c0 mm/slub.c:5270
      getname_flags+0xb8/0x540 fs/namei.c:146
      getname include/linux/fs.h:2498 [inline]
      do_sys_openat2+0xbc/0x200 fs/open.c:1426
      do_sys_open fs/open.c:1436 [inline]
      __do_sys_openat fs/open.c:1452 [inline]
      __se_sys_openat fs/open.c:1447 [inline]
      __x64_sys_openat+0x138/0x170 fs/open.c:1447
      do_syscall_x64 arch/x86/entry/syscall_64.c:63 [inline]
      do_syscall_64+0xec/0xf80 arch/x86/entry/syscall_64.c:94
    
    Fixes: 8d0b94afdca8 ("ipv6: Keep track of DST_NOCACHE routes in case of iface down/unregister")
    Fixes: 78df76a065ae ("ipv4: take rt_uncached_lock only if needed")
    Reported-by: [email protected]
    Closes: https://lore.kernel.org/netdev/[email protected]/T/#u
    Signed-off-by: Eric Dumazet <[email protected]>
    Cc: Martin KaFai Lau <[email protected]>
    Reviewed-by: David Ahern <[email protected]>
    Link: https://patch.msgid.link/[email protected]
    Signed-off-by: Jakub Kicinski <[email protected]>
    Signed-off-by: Rahul Sharma <[email protected]>
    Signed-off-by: Greg Kroah-Hartman <[email protected]>

e1000/e1000e: Fix leak in DMA error cleanup [+ + +]

Author: Matt Vollrath <[email protected]>
Date:   Tue Feb 24 18:28:33 2026 -0500

    e1000/e1000e: Fix leak in DMA error cleanup
    
    [ Upstream commit e94eaef11142b01f77bf8ba4d0b59720b7858109 ]
    
    If an error is encountered while mapping TX buffers, the driver should
    unmap any buffers already mapped for that skb.
    
    Because count is incremented after a successful mapping, it will always
    match the correct number of unmappings needed when dma_error is reached.
    Decrementing count before the while loop in dma_error causes an
    off-by-one error. If any mapping was successful before an unsuccessful
    mapping, exactly one DMA mapping would leak.
    
    In these commits, a faulty while condition caused an infinite loop in
    dma_error:
    Commit 03b1320dfcee ("e1000e: remove use of skb_dma_map from e1000e
    driver")
    Commit 602c0554d7b0 ("e1000: remove use of skb_dma_map from e1000 driver")
    
    Commit c1fa347f20f1 ("e1000/e1000e/igb/igbvf/ixgb/ixgbe: Fix tests of
    unsigned in *_tx_map()") fixed the infinite loop, but introduced the
    off-by-one error.
    
    This issue may still exist in the igbvf driver, but I did not address it
    in this patch.
    
    Fixes: c1fa347f20f1 ("e1000/e1000e/igb/igbvf/ixgb/ixgbe: Fix tests of unsigned in *_tx_map()")
    Assisted-by: Claude:claude-4.6-opus
    Signed-off-by: Matt Vollrath <[email protected]>
    Signed-off-by: Tony Nguyen <[email protected]>
    Signed-off-by: Sasha Levin <[email protected]>

erofs: fix inline data read failure for ztailpacking pclusters [+ + +]

Author: Gao Xiang <[email protected]>
Date:   Wed Mar 11 16:14:29 2026 +0800

    erofs: fix inline data read failure for ztailpacking pclusters
    
    [ Upstream commit c134a40f86efb8d6b5a949ef70e06d5752209be5 ]
    
    Compressed folios for ztailpacking pclusters must be valid before adding
    these pclusters to I/O chains. Otherwise, z_erofs_decompress_pcluster()
    may assume they are already valid and then trigger a NULL pointer
    dereference.
    
    It is somewhat hard to reproduce because the inline data is in the same
    block as the tail of the compressed indexes, which are usually read just
    before. However, it may still happen if a fatal signal arrives while
    read_mapping_folio() is running, as shown below:
    
     erofs: (device dm-1): z_erofs_pcluster_begin: failed to get inline data -4
     Unable to handle kernel NULL pointer dereference at virtual address 0000000000000008
    
     ...
    
     pc : z_erofs_decompress_queue+0x4c8/0xa14
     lr : z_erofs_decompress_queue+0x160/0xa14
     sp : ffffffc08b3eb3a0
     x29: ffffffc08b3eb570 x28: ffffffc08b3eb418 x27: 0000000000001000
     x26: ffffff8086ebdbb8 x25: ffffff8086ebdbb8 x24: 0000000000000001
     x23: 0000000000000008 x22: 00000000fffffffb x21: dead000000000700
     x20: 00000000000015e7 x19: ffffff808babb400 x18: ffffffc089edc098
     x17: 00000000c006287d x16: 00000000c006287d x15: 0000000000000004
     x14: ffffff80ba8f8000 x13: 0000000000000004 x12: 00000006589a77c9
     x11: 0000000000000015 x10: 0000000000000000 x9 : 0000000000000000
     x8 : 0000000000000000 x7 : 0000000000000000 x6 : 000000000000003f
     x5 : 0000000000000040 x4 : ffffffffffffffe0 x3 : 0000000000000020
     x2 : 0000000000000008 x1 : 0000000000000000 x0 : 0000000000000000
     Call trace:
      z_erofs_decompress_queue+0x4c8/0xa14
      z_erofs_runqueue+0x908/0x97c
      z_erofs_read_folio+0x128/0x228
      filemap_read_folio+0x68/0x128
      filemap_get_pages+0x44c/0x8b4
      filemap_read+0x12c/0x5b8
      generic_file_read_iter+0x4c/0x15c
      do_iter_readv_writev+0x188/0x1e0
      vfs_iter_read+0xac/0x1a4
      backing_file_read_iter+0x170/0x34c
      ovl_read_iter+0xf0/0x140
      vfs_read+0x28c/0x344
      ksys_read+0x80/0xf0
      __arm64_sys_read+0x24/0x34
      invoke_syscall+0x60/0x114
      el0_svc_common+0x88/0xe4
      do_el0_svc+0x24/0x30
      el0_svc+0x40/0xa8
      el0t_64_sync_handler+0x70/0xbc
      el0t_64_sync+0x1bc/0x1c0
    
    Fix this by reading the inline data before allocating and adding
    the pclusters to the I/O chains.
    
    Fixes: cecf864d3d76 ("erofs: support inline data decompression")
    Reported-by: Zhiguo Niu <[email protected]>
    Reviewed-and-tested-by: Zhiguo Niu <[email protected]>
    Signed-off-by: Gao Xiang <[email protected]>
    Signed-off-by: Zhiguo Niu <[email protected]>
    Signed-off-by: Greg Kroah-Hartman <[email protected]>

f2fs: compress: change the first parameter of page_array_{alloc,free} to sbi [+ + +]

Author: Zhiguo Niu <[email protected]>
Date:   Fri Mar 6 10:38:45 2026 +0800

    f2fs: compress: change the first parameter of page_array_{alloc,free} to sbi
    
    [ Upstream commit 8e2a9b656474d67c55010f2c003ea2cf889a19ff ]
    
    No logic changes, just cleanup and prepare for fixing the UAF issue
    in f2fs_free_dic.
    
    Signed-off-by: Zhiguo Niu <[email protected]>
    Signed-off-by: Baocong Liu <[email protected]>
    Reviewed-by: Chao Yu <[email protected]>
    Signed-off-by: Jaegeuk Kim <[email protected]>
    Signed-off-by: Bin Lan <[email protected]>
    Signed-off-by: Greg Kroah-Hartman <[email protected]>

f2fs: compress: fix UAF of f2fs_inode_info in f2fs_free_dic [+ + +]

Author: Zhiguo Niu <[email protected]>
Date:   Fri Mar 6 10:38:46 2026 +0800

    f2fs: compress: fix UAF of f2fs_inode_info in f2fs_free_dic
    
    [ Upstream commit 39868685c2a94a70762bc6d77dc81d781d05bff5 ]
    
    The decompress_io_ctx may be released asynchronously after
    I/O completion. If this file is deleted immediately after read,
    and the kworker of processing post_read_wq has not been executed yet
    due to high workloads, It is possible that the inode(f2fs_inode_info)
    is evicted and freed before it is used f2fs_free_dic.
    
        The UAF case as below:
        Thread A                                      Thread B
        - f2fs_decompress_end_io
         - f2fs_put_dic
          - queue_work
            add free_dic work to post_read_wq
                                                       - do_unlink
                                                        - iput
                                                         - evict
                                                          - call_rcu
        This file is deleted after read.
    
        Thread C                                 kworker to process post_read_wq
        - rcu_do_batch
         - f2fs_free_inode
          - kmem_cache_free
         inode is freed by rcu
                                                 - process_scheduled_works
                                                  - f2fs_late_free_dic
                                                   - f2fs_free_dic
                                                    - f2fs_release_decomp_mem
                                          read (dic->inode)->i_compress_algorithm
    
    This patch store compress_algorithm and sbi in dic to avoid inode UAF.
    
    In addition, the previous solution is deprecated in [1] may cause system hang.
    [1] https://lore.kernel.org/all/[email protected]
    
    Cc: Daeho Jeong <[email protected]>
    Fixes: bff139b49d9f ("f2fs: handle decompress only post processing in softirq")
    Signed-off-by: Zhiguo Niu <[email protected]>
    Signed-off-by: Baocong Liu <[email protected]>
    Reviewed-by: Chao Yu <[email protected]>
    Signed-off-by: Jaegeuk Kim <[email protected]>
    [ Keep the original f2fs_vmalloc(workspace_size) in v6.12.y instead of
    f2fs_vmalloc(dic->sbi, workspace_size) per commit
    54ca9be0bc58 ("f2fs: introduce FAULT_VMALLOC"). ]
    Signed-off-by: Bin Lan <[email protected]>
    Signed-off-by: Greg Kroah-Hartman <[email protected]>

f2fs: fix to avoid migrating empty section [+ + +]

Author: Chao Yu <[email protected]>
Date:   Thu Mar 5 16:48:03 2026 +0800

    f2fs: fix to avoid migrating empty section
    
    [ Upstream commit d625a2b08c089397d3a03bff13fa8645e4ec7a01 ]
    
    It reports a bug from device w/ zufs:
    
    F2FS-fs (dm-64): Inconsistent segment (173822) type [1, 0] in SSA and SIT
    F2FS-fs (dm-64): Stopped filesystem due to reason: 4
    
    Thread A                                Thread B
    - f2fs_expand_inode_data
     - f2fs_allocate_pinning_section
      - f2fs_gc_range
       - do_garbage_collect w/ segno #x
                                            - writepage
                                             - f2fs_allocate_data_block
                                              - new_curseg
                                               - allocate segno #x
    
    The root cause is: fallocate on pinning file may race w/ block allocation
    as above, result in do_garbage_collect() from fallocate() may migrate
    segment which is just allocated by a log, the log will update segment type
    in its in-memory structure, however GC will get segment type from on-disk
    SSA block, once segment type changes by log, we can detect such
    inconsistency, then shutdown filesystem.
    
    In this case, on-disk SSA shows type of segno #173822 is 1 (SUM_TYPE_NODE),
    however segno #173822 was just allocated as data type segment, so in-memory
    SIT shows type of segno #173822 is 0 (SUM_TYPE_DATA).
    
    Change as below to fix this issue:
    - check whether current section is empty before gc
    - add sanity checks on do_garbage_collect() to avoid any race case, result
    in migrating segment used by log.
    - btw, it fixes misc issue in printed logs: "SSA and SIT" -> "SIT and SSA".
    
    Fixes: 9703d69d9d15 ("f2fs: support file pinning for zoned devices")
    Cc: Daeho Jeong <[email protected]>
    Signed-off-by: Chao Yu <[email protected]>
    Signed-off-by: Jaegeuk Kim <[email protected]>
    [ Use IS_CURSEC instead of is_cursec according to
    commit c1cfc87e49525 ("f2fs: introduce is_cur{seg,sec}()"). ]
    Signed-off-by: Robert Garcia <[email protected]>
    Signed-off-by: Greg Kroah-Hartman <[email protected]>

fgraph: Fix thresh_return clear per-task notrace [+ + +]

Author: Shengming Hu <[email protected]>
Date:   Wed Mar 18 07:39:12 2026 -0400

    fgraph: Fix thresh_return clear per-task notrace
    
    [ Upstream commit 6ca8379b5d36e22b04e6315c3e49a6083377c862 ]
    
    When tracing_thresh is enabled, function graph tracing uses
    trace_graph_thresh_return() as the return handler. Unlike
    trace_graph_return(), it did not clear the per-task TRACE_GRAPH_NOTRACE
    flag set by the entry handler for set_graph_notrace addresses. This could
    leave the task permanently in "notrace" state and effectively disable
    function graph tracing for that task.
    
    Mirror trace_graph_return()'s per-task notrace handling by clearing
    TRACE_GRAPH_NOTRACE and returning early when set.
    
    Cc: [email protected]
    Link: https://patch.msgid.link/[email protected]
    Fixes: b84214890a9bc ("function_graph: Move graph notrace bit to shadow stack global var")
    Acked-by: Masami Hiramatsu (Google) <[email protected]>
    Signed-off-by: Shengming Hu <[email protected]>
    Signed-off-by: Steven Rostedt (Google) <[email protected]>
    Signed-off-by: Sasha Levin <[email protected]>
    Signed-off-by: Greg Kroah-Hartman <[email protected]>

firmware: arm_ffa: Remove vm_id argument in ffa_rxtx_unmap() [+ + +]

Author: Yeoreum Yun <[email protected]>
Date:   Wed Mar 4 12:09:53 2026 +0000

    firmware: arm_ffa: Remove vm_id argument in ffa_rxtx_unmap()
    
    [ Upstream commit a4e8473b775160f3ce978f621cf8dea2c7250433 ]
    
    According to the FF-A specification (DEN0077, v1.1, §13.7), when
    FFA_RXTX_UNMAP is invoked from any instance other than non-secure
    physical, the w1 register must be zero (MBZ). If a non-zero value is
    supplied in this context, the SPMC must return FFA_INVALID_PARAMETER.
    
    The Arm FF-A driver operates exclusively as a guest or non-secure
    physical instance where the partition ID is always zero and is not
    invoked from a hypervisor context where w1 carries a VM ID. In this
    execution model, the partition ID observed by the driver is always zero,
    and passing a VM ID is unnecessary and potentially invalid.
    
    Remove the vm_id parameter from ffa_rxtx_unmap() and ensure that the
    SMC call is issued with w1 implicitly zeroed, as required by the
    specification. This prevents invalid parameter errors and aligns the
    implementation with the defined FF-A ABI behavior.
    
    Fixes: 3bbfe9871005 ("firmware: arm_ffa: Add initial Arm FFA driver support")
    Signed-off-by: Yeoreum Yun <[email protected]>
    Message-Id: <[email protected]>
    Signed-off-by: Sudeep Holla <[email protected]>
    Signed-off-by: Sasha Levin <[email protected]>

firmware: arm_scpi: Fix device_node reference leak in probe path [+ + +]

Author: Felix Gu <[email protected]>
Date:   Wed Jan 21 21:08:19 2026 +0800

    firmware: arm_scpi: Fix device_node reference leak in probe path
    
    [ Upstream commit 879c001afbac3df94160334fe5117c0c83b2cf48 ]
    
    A device_node reference obtained from the device tree is not released
    on all error paths in the arm_scpi probe path. Specifically, a node
    returned by of_parse_phandle() could be leaked when the probe failed
    after the node was acquired. The probe function returns early and
    the shmem reference is not released.
    
    Use __free(device_node) scope-based cleanup to automatically release
    the reference when the variable goes out of scope.
    
    Fixes: ed7ecb883901 ("firmware: arm_scpi: Add compatibility checks for shmem node")
    Signed-off-by: Felix Gu <[email protected]>
    Message-Id: <[email protected]>
    Signed-off-by: Sudeep Holla <[email protected]>
    Signed-off-by: Sasha Levin <[email protected]>

Linux: Fix CC_HAS_ASM_GOTO_OUTPUT on non-x86 architectures [+ + +]

Author: Linus Torvalds <[email protected]>
Date:   Mon Sep 29 08:54:12 2025 -0700

    Fix CC_HAS_ASM_GOTO_OUTPUT on non-x86 architectures
    
    [ Upstream commit fde0ab43b9a30d08817adc5402b69fec83a61cb8 ]
    
    There's a silly problem with the CC_HAS_ASM_GOTO_OUTPUT test: even with
    a working compiler it will fail on some architectures simply because it
    uses the mnemonic "jmp" for testing the inline asm.
    
    And as reported by Geert, not all architectures use that mnemonic, so
    the test fails spuriously on such platforms (including arm and riscv,
    but also several other architectures).
    
    This issue avoided any obvious test failures because the build still
    works thanks to falling back on the old non-asm-goto code, which just
    generates worse code.
    
    Just use an empty asm statement instead.
    
    Reported-and-tested-by: Geert Uytterhoeven <[email protected]>
    Suggested-by: Peter Zijlstra <[email protected]>
    Fixes: e2ffa15b9baa ("kbuild: Disable CC_HAS_ASM_GOTO_OUTPUT on clang < 17")
    Cc: Nathan Chancellor <[email protected]>
    Cc: Thomas Gleixner <[email protected]>
    Signed-off-by: Linus Torvalds <[email protected]>
    Signed-off-by: Sasha Levin <[email protected]>

fs/tests: exec: Remove bad test vector [+ + +]

Author: Kees Cook <[email protected]>
Date:   Wed Mar 18 11:36:29 2026 -0700

    fs/tests: exec: Remove bad test vector
    
    [ Upstream commit c4192754e836e0ffed95833509b6ada975b74418 ]
    
    Drop an unusable test in the bprm stack limits.
    
    Reported-by: Guenter Roeck <[email protected]>
    Closes: https://lore.kernel.org/all/[email protected]
    Fixes: 60371f43e56b ("exec: Add KUnit test for bprm_stack_limits()")
    Signed-off-by: Kees Cook <[email protected]>
    Signed-off-by: Sasha Levin <[email protected]>

gve: fix incorrect buffer cleanup in gve_tx_clean_pending_packets for QPL [+ + +]

Author: Ankit Garg <[email protected]>
Date:   Mon Mar 9 08:47:50 2026 -0400

    gve: fix incorrect buffer cleanup in gve_tx_clean_pending_packets for QPL
    
    [ Upstream commit fb868db5f4bccd7a78219313ab2917429f715cea ]
    
    In DQ-QPL mode, gve_tx_clean_pending_packets() incorrectly uses the RDA
    buffer cleanup path. It iterates num_bufs times and attempts to unmap
    entries in the dma array.
    
    This leads to two issues:
    1. The dma array shares storage with tx_qpl_buf_ids (union).
     Interpreting buffer IDs as DMA addresses results in attempting to
     unmap incorrect memory locations.
    2. num_bufs in QPL mode (counting 2K chunks) can significantly exceed
     the size of the dma array, causing out-of-bounds access warnings
    (trace below is how we noticed this issue).
    
    UBSAN: array-index-out-of-bounds in
    drivers/net/ethernet/drivers/net/ethernet/google/gve/gve_tx_dqo.c:178:5 index 18 is out of
    range for type 'dma_addr_t[18]' (aka 'unsigned long long[18]')
    Workqueue: gve gve_service_task [gve]
    Call Trace:
    <TASK>
    dump_stack_lvl+0x33/0xa0
    __ubsan_handle_out_of_bounds+0xdc/0x110
    gve_tx_stop_ring_dqo+0x182/0x200 [gve]
    gve_close+0x1be/0x450 [gve]
    gve_reset+0x99/0x120 [gve]
    gve_service_task+0x61/0x100 [gve]
    process_scheduled_works+0x1e9/0x380
    
    Fix this by properly checking for QPL mode and delegating to
    gve_free_tx_qpl_bufs() to reclaim the buffers.
    
    Cc: [email protected]
    Fixes: a6fb8d5a8b69 ("gve: Tx path for DQO-QPL")
    Signed-off-by: Ankit Garg <[email protected]>
    Reviewed-by: Jordan Rhee <[email protected]>
    Reviewed-by: Harshitha Ramamurthy <[email protected]>
    Signed-off-by: Joshua Washington <[email protected]>
    Reviewed-by: Simon Horman <[email protected]>
    Link: https://patch.msgid.link/[email protected]
    Signed-off-by: Jakub Kicinski <[email protected]>
    [ netmem_dma_unmap_page_attrs() => dma_unmap_page() ]
    Signed-off-by: Sasha Levin <[email protected]>
    Signed-off-by: Greg Kroah-Hartman <[email protected]>

HID: bpf: prevent buffer overflow in hid_hw_request [+ + +]

Author: Benjamin Tissoires <[email protected]>
Date:   Fri Mar 13 08:40:25 2026 +0100

    HID: bpf: prevent buffer overflow in hid_hw_request
    
    commit 2b658c1c442ec1cd9eec5ead98d68662c40fe645 upstream.
    
    right now the returned value is considered to be always valid. However,
    when playing with HID-BPF, the return value can be arbitrary big,
    because it's the return value of dispatch_hid_bpf_raw_requests(), which
    calls the struct_ops and we have no guarantees that the value makes
    sense.
    
    Fixes: 8bd0488b5ea5 ("HID: bpf: add HID-BPF hooks for hid_hw_raw_requests")
    Cc: [email protected]
    Acked-by: Jiri Kosina <[email protected]>
    Signed-off-by: Benjamin Tissoires <[email protected]>
    Signed-off-by: Greg Kroah-Hartman <[email protected]>

hwmon: (max6639) Fix pulses-per-revolution implementation [+ + +]

Author: Guenter Roeck <[email protected]>
Date:   Thu Mar 19 21:31:22 2026 -0700

    hwmon: (max6639) Fix pulses-per-revolution implementation
    
    [ Upstream commit e7bae9a7a5e1251ab414291f4e9304d702bb9221 ]
    
    The valid range for the pulses-per-revolution devicetree property is
    1..4. The current code checks for a range of 1..5. Fix it.
    
    Declare the variable used to retrieve pulses per revolution from
    devicetree as u32 (unsigned) to match the of_property_read_u32() API.
    
    The current code uses a postfix decrement when writing the pulses per
    resolution into the chip. This has no effect since the value is evaluated
    before it is decremented. Fix it by decrementing before evaluating the
    value.
    
    Fixes: 7506ebcd662b ("hwmon: (max6639) : Configure based on DT property")
    Cc: Naresh Solanki <[email protected]>
    Signed-off-by: Guenter Roeck <[email protected]>
    Signed-off-by: Sasha Levin <[email protected]>

hwmon: (pmbus/isl68137) Fix unchecked return value and use sysfs_emit() [+ + +]

Author: Sanman Pradhan <[email protected]>
Date:   Wed Mar 18 19:40:19 2026 +0000

    hwmon: (pmbus/isl68137) Fix unchecked return value and use sysfs_emit()
    
    commit 86259558e422b250aa6aa57163a6d759074573f5 upstream.
    
    isl68137_avs_enable_show_page() uses the return value of
    pmbus_read_byte_data() without checking for errors. If the I2C transaction
    fails, a negative error code is passed through bitwise operations,
    producing incorrect output.
    
    Add an error check to propagate the return value if it is negative.
    Additionally, modernize the callback by replacing sprintf()
    with sysfs_emit().
    
    Fixes: 038a9c3d1e424 ("hwmon: (pmbus/isl68137) Add driver for Intersil ISL68137 PWM Controller")
    Cc: [email protected]
    Signed-off-by: Sanman Pradhan <[email protected]>
    Link: https://lore.kernel.org/r/[email protected]
    Signed-off-by: Guenter Roeck <[email protected]>
    Signed-off-by: Greg Kroah-Hartman <[email protected]>

hwmon: (pmbus/mp2975) Add error check for pmbus_read_word_data() return value [+ + +]

Author: Sanman Pradhan <[email protected]>
Date:   Tue Mar 17 17:37:17 2026 +0000

    hwmon: (pmbus/mp2975) Add error check for pmbus_read_word_data() return value
    
    commit 19d4b9c8a136704d5f2544e7ac550f27918a5004 upstream.
    
    mp2973_read_word_data() XORs the return value of pmbus_read_word_data()
    with PB_STATUS_POWER_GOOD_N without first checking for errors. If the I2C
    transaction fails, a negative error code is XORed with the constant,
    producing a corrupted value that is returned as valid status data instead
    of propagating the error.
    
    Add the missing error check before modifying the return value.
    
    Fixes: acda945afb465 ("hwmon: (pmbus/mp2975) Fix PGOOD in READ_STATUS_WORD")
    Cc: [email protected]
    Signed-off-by: Sanman Pradhan <[email protected]>
    Link: https://lore.kernel.org/r/[email protected]
    Signed-off-by: Guenter Roeck <[email protected]>
    Signed-off-by: Greg Kroah-Hartman <[email protected]>

hwmon: (pmbus/q54sj108a2) fix stack overflow in debugfs read [+ + +]

Author: Sanman Pradhan <[email protected]>
Date:   Wed Mar 4 15:51:17 2026 -0800

    hwmon: (pmbus/q54sj108a2) fix stack overflow in debugfs read
    
    commit 25dd70a03b1f5f3aa71e1a5091ecd9cd2a13ee43 upstream.
    
    The q54sj108a2_debugfs_read function suffers from a stack buffer overflow
    due to incorrect arguments passed to bin2hex(). The function currently
    passes 'data' as the destination and 'data_char' as the source.
    
    Because bin2hex() converts each input byte into two hex characters, a
    32-byte block read results in 64 bytes of output. Since 'data' is only
    34 bytes (I2C_SMBUS_BLOCK_MAX + 2), this writes 30 bytes past the end
    of the buffer onto the stack.
    
    Additionally, the arguments were swapped: it was reading from the
    zero-initialized 'data_char' and writing to 'data', resulting in
    all-zero output regardless of the actual I2C read.
    
    Fix this by:
    1. Expanding 'data_char' to 66 bytes to safely hold the hex output.
    2. Correcting the bin2hex() argument order and using the actual read count.
    3. Using a pointer to select the correct output buffer for the final
       simple_read_from_buffer call.
    
    Fixes: d014538aa385 ("hwmon: (pmbus) Driver for Delta power supplies Q54SJ108A2")
    Cc: [email protected]
    Signed-off-by: Sanman Pradhan <[email protected]>
    Link: https://lore.kernel.org/r/[email protected]
    Signed-off-by: Guenter Roeck <[email protected]>
    Signed-off-by: Greg Kroah-Hartman <[email protected]>

i2c: cp2615: fix serial string NULL-deref at probe [+ + +]

Author: Johan Hovold <[email protected]>
Date:   Mon Mar 9 08:50:16 2026 +0100

    i2c: cp2615: fix serial string NULL-deref at probe
    
    commit aa79f996eb41e95aed85a1bd7f56bcd6a3842008 upstream.
    
    The cp2615 driver uses the USB device serial string as the i2c adapter
    name but does not make sure that the string exists.
    
    Verify that the device has a serial number before accessing it to avoid
    triggering a NULL-pointer dereference (e.g. with malicious devices).
    
    Fixes: 4a7695429ead ("i2c: cp2615: add i2c driver for Silicon Labs' CP2615 Digital Audio Bridge")
    Cc: [email protected]      # 5.13
    Cc: Bence Csókás <[email protected]>
    Signed-off-by: Johan Hovold <[email protected]>
    Reviewed-by: Bence Csókás <[email protected]>
    Signed-off-by: Andi Shyti <[email protected]>
    Link: https://lore.kernel.org/r/[email protected]
    Signed-off-by: Greg Kroah-Hartman <[email protected]>

i2c: fsi: Fix a potential leak in fsi_i2c_probe() [+ + +]

Author: Christophe JAILLET <[email protected]>
Date:   Sun Mar 1 17:21:01 2026 +0100

    i2c: fsi: Fix a potential leak in fsi_i2c_probe()
    
    commit be627abcc0d5dbd5882873bd85fbc18aa3d189ed upstream.
    
    In the commit in Fixes:, when the code has been updated to use an explicit
    for loop, instead of for_each_available_child_of_node(), the assumption
    that a reference to a device_node structure would be released at each
    iteration has been broken.
    
    Now, an explicit of_node_put() is needed to release the reference.
    
    Fixes: 095561f476ab ("i2c: fsi: Create busses for all ports")
    Signed-off-by: Christophe JAILLET <[email protected]>
    Cc: <[email protected]> # v5.3+
    Signed-off-by: Andi Shyti <[email protected]>
    Link: https://lore.kernel.org/r/fd805c39f8de51edf303856103d782138a1633c8.1772382022.git.christophe.jaillet@wanadoo.fr
    Signed-off-by: Greg Kroah-Hartman <[email protected]>

i2c: pxa: defer reset on Armada 3700 when recovery is used [+ + +]

Author: Gabor Juhos <[email protected]>
Date:   Thu Feb 26 14:11:27 2026 +0100

    i2c: pxa: defer reset on Armada 3700 when recovery is used
    
    commit 78a6ee14f8b9e1c8f7c77612122444f3be8dc8cc upstream.
    
    The I2C communication is completely broken on the Armada 3700 platform
    since commit 0b01392c18b9 ("i2c: pxa: move to generic GPIO recovery").
    
    For example, on the Methode uDPU board, probing of the two onboard
    temperature sensors fails ...
    
      [    7.271713] i2c i2c-0: using pinctrl states for GPIO recovery
      [    7.277503] i2c i2c-0:  PXA I2C adapter
      [    7.282199] i2c i2c-1: using pinctrl states for GPIO recovery
      [    7.288241] i2c i2c-1:  PXA I2C adapter
      [    7.292947] sfp sfp-eth1: Host maximum power 3.0W
      [    7.299614] sfp sfp-eth0: Host maximum power 3.0W
      [    7.308178] lm75 1-0048: supply vs not found, using dummy regulator
      [   32.489631] lm75 1-0048: probe with driver lm75 failed with error -121
      [   32.496833] lm75 1-0049: supply vs not found, using dummy regulator
      [   82.890614] lm75 1-0049: probe with driver lm75 failed with error -121
    
    ... and accessing the plugged-in SFP modules also does not work:
    
      [  511.298537] sfp sfp-eth1: please wait, module slow to respond
      [  536.488530] sfp sfp-eth0: please wait, module slow to respond
      ...
      [ 1065.688536] sfp sfp-eth1: failed to read EEPROM: -EREMOTEIO
      [ 1090.888532] sfp sfp-eth0: failed to read EEPROM: -EREMOTEIO
    
    After a discussion [1], there was an attempt to fix the problem by
    reverting the offending change by commit 7b211c767121 ("Revert "i2c:
    pxa: move to generic GPIO recovery""), but that only helped to fix
    the issue in the 6.1.y stable tree. The reason behind the partial succes
    is that there was another change in commit 20cb3fce4d60 ("i2c: Set i2c
    pinctrl recovery info from it's device pinctrl") in the 6.3-rc1 cycle
    which broke things further.
    
    The cause of the problem is the same in case of both offending commits
    mentioned above. Namely, the I2C core code changes the pinctrl state to
    GPIO while running the recovery initialization code. Although the PXA
    specific initialization also does this, but the key difference is that
    it happens before the controller is getting enabled in i2c_pxa_reset(),
    whereas in the case of the generic initialization it happens after that.
    
    Change the code to reset the controller only before the first transfer
    instead of before registering the controller. This ensures that the
    controller is not enabled at the time when the generic recovery code
    performs the pinctrl state changes, thus avoids the problem described
    above.
    
    As the result this change restores the original behaviour, which in
    turn makes the I2C communication to work again as it can be seen from
    the following log:
    
      [    7.363250] i2c i2c-0: using pinctrl states for GPIO recovery
      [    7.369041] i2c i2c-0:  PXA I2C adapter
      [    7.373673] i2c i2c-1: using pinctrl states for GPIO recovery
      [    7.379742] i2c i2c-1:  PXA I2C adapter
      [    7.384506] sfp sfp-eth1: Host maximum power 3.0W
      [    7.393013] sfp sfp-eth0: Host maximum power 3.0W
      [    7.399266] lm75 1-0048: supply vs not found, using dummy regulator
      [    7.407257] hwmon hwmon0: temp1_input not attached to any thermal zone
      [    7.413863] lm75 1-0048: hwmon0: sensor 'tmp75c'
      [    7.418746] lm75 1-0049: supply vs not found, using dummy regulator
      [    7.426371] hwmon hwmon1: temp1_input not attached to any thermal zone
      [    7.432972] lm75 1-0049: hwmon1: sensor 'tmp75c'
      [    7.755092] sfp sfp-eth1: module MENTECHOPTO      POS22-LDCC-KR    rev 1.0  sn MNC208U90009     dc 200828
      [    7.764997] mvneta d0040000.ethernet eth1: unsupported SFP module: no common interface modes
      [    7.785362] sfp sfp-eth0: module Mikrotik         S-RJ01           rev 1.0  sn 61B103C55C58     dc 201022
      [    7.803426] hwmon hwmon2: temp1_input not attached to any thermal zone
    
    Link: https://lore.kernel.org/r/[email protected] #1
    
    Cc: [email protected] # 6.3+
    Fixes: 20cb3fce4d60 ("i2c: Set i2c pinctrl recovery info from it's device pinctrl")
    Signed-off-by: Gabor Juhos <[email protected]>
    Tested-by: Robert Marko <[email protected]>
    Reviewed-by: Linus Walleij <[email protected]>
    Signed-off-by: Andi Shyti <[email protected]>
    Link: https://lore.kernel.org/r/20260226-i2c-pxa-fix-i2c-communication-v4-1-797a091dae87@gmail.com
    Signed-off-by: Greg Kroah-Hartman <[email protected]>

i3c: dw-i3c-master: Set SIR_REJECT in DAT on device attach and reattach [+ + +]

Author: Adrian Ng Ho Yin <[email protected]>
Date:   Fri Feb 13 14:00:48 2026 +0800

    i3c: dw-i3c-master: Set SIR_REJECT in DAT on device attach and reattach
    
    [ Upstream commit f311a05784634febd299f03476b80f3f18489767 ]
    
    The DesignWare I3C master controller ACKs IBIs as soon as a valid
    Device Address Table (DAT) entry is present. This can create a race
    between device attachment (after DAA) and the point where the client
    driver enables IBIs via i3c_device_enable_ibi().
    
    Set DEV_ADDR_TABLE_SIR_REJECT in the DAT entry during
    attach_i3c_dev() and reattach_i3c_dev() so that IBIs are rejected
    by default. The bit is managed thereafter by the existing
    dw_i3c_master_set_sir_enabled() function, which clears it in
    enable_ibi() after ENEC is issued, and restores it in disable_ibi()
    after DISEC.
    
    Fixes: 1dd728f5d4d4 ("i3c: master: Add driver for Synopsys DesignWare IP")
    Signed-off-by: Adrian Ng Ho Yin <[email protected]>
    Reviewed-by: Frank Li <[email protected]>
    Link: https://patch.msgid.link/53f5b8cbdd8af789ec38b95b02873f32f9182dd6.1770962368.git.adrianhoyin.ng@altera.com
    Signed-off-by: Alexandre Belloni <[email protected]>
    Signed-off-by: Sasha Levin <[email protected]>

i3c: mipi-i3c-hci: Add missing TID field to no-op command descriptor [+ + +]

Author: Adrian Hunter <[email protected]>
Date:   Fri Mar 6 09:24:46 2026 +0200

    i3c: mipi-i3c-hci: Add missing TID field to no-op command descriptor
    
    commit ec3cfd835f7c4bbd23bc9ad909d2fdc772a578bb upstream.
    
    The internal control command descriptor used for no-op commands includes a
    Transaction ID (TID) field, but the no-op command constructed in
    hci_dma_dequeue_xfer() omitted it.  As a result, the hardware receives a
    no-op descriptor without the expected TID.
    
    This bug has gone unnoticed because the TID is currently not validated in
    the no-op completion path, but the descriptor format requires it to be
    present.
    
    Add the missing TID field when generating a no-op descriptor so that its
    layout matches the defined command structure.
    
    Fixes: 9ad9a52cce282 ("i3c/master: introduce the mipi-i3c-hci driver")
    Cc: [email protected]
    Signed-off-by: Adrian Hunter <[email protected]>
    Reviewed-by: Frank Li <[email protected]>
    Link: https://patch.msgid.link/[email protected]
    Signed-off-by: Alexandre Belloni <[email protected]>
    Signed-off-by: Greg Kroah-Hartman <[email protected]>

i3c: mipi-i3c-hci: Restart DMA ring correctly after dequeue abort [+ + +]

Author: Adrian Hunter <[email protected]>
Date:   Fri Mar 6 09:24:47 2026 +0200

    i3c: mipi-i3c-hci: Restart DMA ring correctly after dequeue abort
    
    commit b6d586431ae20d5157ee468d0ef62ad26798ef13 upstream.
    
    The DMA dequeue path attempts to restart the ring after aborting an
    in-flight transfer, but the current sequence is incomplete. The controller
    must be brought out of the aborted state and the ring control registers
    must be programmed in the correct order: first clearing ABORT, then
    re-enabling the ring and asserting RUN_STOP to resume operation.
    
    Add the missing controller resume step and update the ring control writes
    so that the ring is restarted using the proper sequence.
    
    Fixes: 9ad9a52cce282 ("i3c/master: introduce the mipi-i3c-hci driver")
    Cc: [email protected]
    Signed-off-by: Adrian Hunter <[email protected]>
    Reviewed-by: Frank Li <[email protected]>
    Link: https://patch.msgid.link/[email protected]
    Signed-off-by: Alexandre Belloni <[email protected]>
    Signed-off-by: Greg Kroah-Hartman <[email protected]>

i3c: mipi-i3c-hci: Use ETIMEDOUT instead of ETIME for timeout errors [+ + +]

Author: Adrian Hunter <[email protected]>
Date:   Fri Mar 6 09:24:38 2026 +0200

    i3c: mipi-i3c-hci: Use ETIMEDOUT instead of ETIME for timeout errors
    
    commit 4167b8914463132654e01e16259847d097f8a7f7 upstream.
    
    The MIPI I3C HCI driver currently returns -ETIME for various timeout
    conditions, while other I3C master drivers consistently use -ETIMEDOUT
    for the same class of errors.  Align the HCI driver with the rest of the
    subsystem by replacing all uses of -ETIME with -ETIMEDOUT.
    
    Fixes: 9ad9a52cce282 ("i3c/master: introduce the mipi-i3c-hci driver")
    Cc: [email protected]
    Signed-off-by: Adrian Hunter <[email protected]>
    Reviewed-by: Frank Li <[email protected]>
    Link: https://patch.msgid.link/[email protected]
    Signed-off-by: Alexandre Belloni <[email protected]>
    Signed-off-by: Greg Kroah-Hartman <[email protected]>

i40e: fix src IP mask checks and memcpy argument names in cloud filter [+ + +]

Author: Alok Tiwari <[email protected]>
Date:   Mon Nov 10 11:13:38 2025 -0800

    i40e: fix src IP mask checks and memcpy argument names in cloud filter
    
    [ Upstream commit e809085f492842ce7a519c9ef72d40f4bca89c13 ]
    
    Fix following issues in the IPv4 and IPv6 cloud filter handling logic in
    both the add and delete paths:
    
    - The source-IP mask check incorrectly compares mask.src_ip[0] against
      tcf.dst_ip[0]. Update it to compare against tcf.src_ip[0]. This likely
      goes unnoticed because the check is in an "else if" path that only
      executes when dst_ip is not set, most cloud filter use cases focus on
      destination-IP matching, and the buggy condition can accidentally
      evaluate true in some cases.
    
    - memcpy() for the IPv4 source address incorrectly uses
      ARRAY_SIZE(tcf.dst_ip) instead of ARRAY_SIZE(tcf.src_ip), although
      both arrays are the same size.
    
    - The IPv4 memcpy operations used ARRAY_SIZE(tcf.dst_ip) and ARRAY_SIZE
      (tcf.src_ip), Update these to use sizeof(cfilter->ip.v4.dst_ip) and
      sizeof(cfilter->ip.v4.src_ip) to ensure correct and explicit copy size.
    
    - In the IPv6 delete path, memcmp() uses sizeof(src_ip6) when comparing
      dst_ip6 fields. Replace this with sizeof(dst_ip6) to make the intent
      explicit, even though both fields are struct in6_addr.
    
    Fixes: e284fc280473 ("i40e: Add and delete cloud filter")
    Signed-off-by: Alok Tiwari <[email protected]>
    Reviewed-by: Aleksandr Loktionov <[email protected]>
    Reviewed-by: Paul Menzel <[email protected]>
    Signed-off-by: Tony Nguyen <[email protected]>
    Signed-off-by: Sasha Levin <[email protected]>

iavf: fix VLAN filter lost on add/delete race [+ + +]

Author: Petr Oros <[email protected]>
Date:   Wed Feb 25 11:01:37 2026 +0100

    iavf: fix VLAN filter lost on add/delete race
    
    [ Upstream commit fc9c69be594756b81b54c6bc40803fa6052f35ae ]
    
    When iavf_add_vlan() finds an existing filter in IAVF_VLAN_REMOVE
    state, it transitions the filter to IAVF_VLAN_ACTIVE assuming the
    pending delete can simply be cancelled. However, there is no guarantee
    that iavf_del_vlans() has not already processed the delete AQ request
    and removed the filter from the PF. In that case the filter remains in
    the driver's list as IAVF_VLAN_ACTIVE but is no longer programmed on
    the NIC. Since iavf_add_vlans() only picks up filters in
    IAVF_VLAN_ADD state, the filter is never re-added, and spoof checking
    drops all traffic for that VLAN.
    
      CPU0                       CPU1                     Workqueue
      ----                       ----                     ---------
      iavf_del_vlan(vlan 100)
        f->state = REMOVE
        schedule AQ_DEL_VLAN
                                 iavf_add_vlan(vlan 100)
                                   f->state = ACTIVE
                                                          iavf_del_vlans()
                                                            f is ACTIVE, skip
                                                          iavf_add_vlans()
                                                            f is ACTIVE, skip
    
      Filter is ACTIVE in driver but absent from NIC.
    
    Transition to IAVF_VLAN_ADD instead and schedule
    IAVF_FLAG_AQ_ADD_VLAN_FILTER so iavf_add_vlans() re-programs the
    filter.  A duplicate add is idempotent on the PF.
    
    Fixes: 0c0da0e95105 ("iavf: refactor VLAN filter states")
    Signed-off-by: Petr Oros <[email protected]>
    Tested-by: Rafal Romanowski <[email protected]>
    Signed-off-by: Tony Nguyen <[email protected]>
    Signed-off-by: Sasha Levin <[email protected]>

ice: fix devlink reload call trace [+ + +]

Author: Paul Greenwalt <[email protected]>
Date:   Fri Feb 27 10:29:14 2026 +0800

    ice: fix devlink reload call trace
    
    [ Upstream commit d3f867e7a04678640ebcbfb81893c59f4af48586 ]
    
    Commit 4da71a77fc3b ("ice: read internal temperature sensor") introduced
    internal temperature sensor reading via HWMON. ice_hwmon_init() was added
    to ice_init_feature() and ice_hwmon_exit() was added to ice_remove(). As a
    result if devlink reload is used to reinit the device and then the driver
    is removed, a call trace can occur.
    
    BUG: unable to handle page fault for address: ffffffffc0fd4b5d
    Call Trace:
     string+0x48/0xe0
     vsnprintf+0x1f9/0x650
     sprintf+0x62/0x80
     name_show+0x1f/0x30
     dev_attr_show+0x19/0x60
    
    The call trace repeats approximately every 10 minutes when system
    monitoring tools (e.g., sadc) attempt to read the orphaned hwmon sysfs
    attributes that reference freed module memory.
    
    The sequence is:
    1. Driver load, ice_hwmon_init() gets called from ice_init_feature()
    2. Devlink reload down, flow does not call ice_remove()
    3. Devlink reload up, ice_hwmon_init() gets called from
       ice_init_feature() resulting in a second instance
    4. Driver unload, ice_hwmon_exit() called from ice_remove() leaving the
       first hwmon instance orphaned with dangling pointer
    
    Fix this by moving ice_hwmon_exit() from ice_remove() to
    ice_deinit_features() to ensure proper cleanup symmetry with
    ice_hwmon_init().
    
    Fixes: 4da71a77fc3b ("ice: read internal temperature sensor")
    Reviewed-by: Aleksandr Loktionov <[email protected]>
    Signed-off-by: Paul Greenwalt <[email protected]>
    Reviewed-by: Paul Menzel <[email protected]>
    Tested-by: Rinitha S <[email protected]> (A Contingent worker at Intel)
    Signed-off-by: Tony Nguyen <[email protected]>
    [ Adjust context. The context change is irrelevant to the current patch
    logic. ]
    Signed-off-by: Wenshan Lan <[email protected]>
    Signed-off-by: Greg Kroah-Hartman <[email protected]>

ice: fix retry for AQ command 0x06EE [+ + +]

Author: Jakub Staniszewski <[email protected]>
Date:   Tue Jan 13 20:38:17 2026 +0100

    ice: fix retry for AQ command 0x06EE
    
    commit fb4903b3354aed4a2301180cf991226f896c87ed upstream.
    
    Executing ethtool -m can fail reporting a netlink I/O error while firmware
    link management holds the i2c bus used to communicate with the module.
    
    According to Intel(R) Ethernet Controller E810 Datasheet Rev 2.8 [1]
    Section 3.3.10.4 Read/Write SFF EEPROM (0x06EE)
    request should to be retried upon receiving EBUSY from firmware.
    
    Commit e9c9692c8a81 ("ice: Reimplement module reads used by ethtool")
    implemented it only for part of ice_get_module_eeprom(), leaving all other
    calls to ice_aq_sff_eeprom() vulnerable to returning early on getting
    EBUSY without retrying.
    
    Remove the retry loop from ice_get_module_eeprom() and add Admin Queue
    (AQ) command with opcode 0x06EE to the list of commands that should be
    retried on receiving EBUSY from firmware.
    
    Cc: [email protected]
    Fixes: e9c9692c8a81 ("ice: Reimplement module reads used by ethtool")
    Signed-off-by: Jakub Staniszewski <[email protected]>
    Co-developed-by: Dawid Osuchowski <[email protected]>
    Signed-off-by: Dawid Osuchowski <[email protected]>
    Reviewed-by: Aleksandr Loktionov <[email protected]>
    Reviewed-by: Przemek Kitszel <[email protected]>
    Link: https://www.intel.com/content/www/us/en/content-details/613875/intel-ethernet-controller-e810-datasheet.html [1]
    Reviewed-by: Paul Menzel <[email protected]>
    Tested-by: Rinitha S <[email protected]> (A Contingent worker at Intel)
    Signed-off-by: Tony Nguyen <[email protected]>
    Signed-off-by: Greg Kroah-Hartman <[email protected]>

ice: reintroduce retry mechanism for indirect AQ [+ + +]

Author: Jakub Staniszewski <[email protected]>
Date:   Tue Jan 13 20:38:16 2026 +0100

    ice: reintroduce retry mechanism for indirect AQ
    
    commit 326256c0a72d4877cec1d4df85357da106233128 upstream.
    
    Add retry mechanism for indirect Admin Queue (AQ) commands. To do so we
    need to keep the command buffer.
    
    This technically reverts commit 43a630e37e25
    ("ice: remove unused buffer copy code in ice_sq_send_cmd_retry()"),
    but combines it with a fix in the logic by using a kmemdup() call,
    making it more robust and less likely to break in the future due to
    programmer error.
    
    Cc: Michal Schmidt <[email protected]>
    Cc: [email protected]
    Fixes: 3056df93f7a8 ("ice: Re-send some AQ commands, as result of EBUSY AQ error")
    Signed-off-by: Jakub Staniszewski <[email protected]>
    Co-developed-by: Dawid Osuchowski <[email protected]>
    Signed-off-by: Dawid Osuchowski <[email protected]>
    Reviewed-by: Aleksandr Loktionov <[email protected]>
    Reviewed-by: Przemek Kitszel <[email protected]>
    Reviewed-by: Paul Menzel <[email protected]>
    Tested-by: Rinitha S <[email protected]> (A Contingent worker at Intel)
    Signed-off-by: Tony Nguyen <[email protected]>
    Signed-off-by: Greg Kroah-Hartman <[email protected]>

icmp: fix NULL pointer dereference in icmp_tag_validation() [+ + +]

Author: Weiming Shi <[email protected]>
Date:   Wed Mar 18 21:06:01 2026 +0800

    icmp: fix NULL pointer dereference in icmp_tag_validation()
    
    [ Upstream commit 614aefe56af8e13331e50220c936fc0689cf5675 ]
    
    icmp_tag_validation() unconditionally dereferences the result of
    rcu_dereference(inet_protos[proto]) without checking for NULL.
    The inet_protos[] array is sparse -- only about 15 of 256 protocol
    numbers have registered handlers. When ip_no_pmtu_disc is set to 3
    (hardened PMTU mode) and the kernel receives an ICMP Fragmentation
    Needed error with a quoted inner IP header containing an unregistered
    protocol number, the NULL dereference causes a kernel panic in
    softirq context.
    
     Oops: general protection fault, probably for non-canonical address 0xdffffc0000000002: 0000 [#1] SMP KASAN NOPTI
     KASAN: null-ptr-deref in range [0x0000000000000010-0x0000000000000017]
     RIP: 0010:icmp_unreach (net/ipv4/icmp.c:1085 net/ipv4/icmp.c:1143)
     Call Trace:
      <IRQ>
      icmp_rcv (net/ipv4/icmp.c:1527)
      ip_protocol_deliver_rcu (net/ipv4/ip_input.c:207)
      ip_local_deliver_finish (net/ipv4/ip_input.c:242)
      ip_local_deliver (net/ipv4/ip_input.c:262)
      ip_rcv (net/ipv4/ip_input.c:573)
      __netif_receive_skb_one_core (net/core/dev.c:6164)
      process_backlog (net/core/dev.c:6628)
      handle_softirqs (kernel/softirq.c:561)
      </IRQ>
    
    Add a NULL check before accessing icmp_strict_tag_validation. If the
    protocol has no registered handler, return false since it cannot
    perform strict tag validation.
    
    Fixes: 8ed1dc44d3e9 ("ipv4: introduce hardened ip_no_pmtu_disc mode")
    Reported-by: Xiang Mei <[email protected]>
    Signed-off-by: Weiming Shi <[email protected]>
    Link: https://patch.msgid.link/[email protected]
    Signed-off-by: Jakub Kicinski <[email protected]>
    Signed-off-by: Sasha Levin <[email protected]>

igc: fix missing update of skb->tail in igc_xmit_frame() [+ + +]

Author: Kohei Enju <[email protected]>
Date:   Sat Feb 14 19:46:32 2026 +0000

    igc: fix missing update of skb->tail in igc_xmit_frame()
    
    [ Upstream commit 0ffba246652faf4a36aedc66059c2f94e4c83ea5 ]
    
    igc_xmit_frame() misses updating skb->tail when the packet size is
    shorter than the minimum one.
    Use skb_put_padto() in alignment with other Intel Ethernet drivers.
    
    Fixes: 0507ef8a0372 ("igc: Add transmit and receive fastpath and interrupt handlers")
    Signed-off-by: Kohei Enju <[email protected]>
    Reviewed-by: Simon Horman <[email protected]>
    Reviewed-by: Paul Menzel <[email protected]>
    Tested-by: Avigail Dahan <[email protected]>
    Signed-off-by: Tony Nguyen <[email protected]>
    Signed-off-by: Sasha Levin <[email protected]>

igc: fix page fault in XDP TX timestamps handling [+ + +]

Author: Zdenek Bouska <[email protected]>
Date:   Wed Feb 25 10:58:29 2026 +0100

    igc: fix page fault in XDP TX timestamps handling
    
    [ Upstream commit 45b33e805bd39f615d9353a7194b2da5281332df ]
    
    If an XDP application that requested TX timestamping is shutting down
    while the link of the interface in use is still up the following kernel
    splat is reported:
    
    [  883.803618] [   T1554] BUG: unable to handle page fault for address: ffffcfb6200fd008
    ...
    [  883.803650] [   T1554] Call Trace:
    [  883.803652] [   T1554]  <TASK>
    [  883.803654] [   T1554]  igc_ptp_tx_tstamp_event+0xdf/0x160 [igc]
    [  883.803660] [   T1554]  igc_tsync_interrupt+0x2d5/0x300 [igc]
    ...
    
    During shutdown of the TX ring the xsk_meta pointers are left behind, so
    that the IRQ handler is trying to touch them.
    
    This issue is now being fixed by cleaning up the stale xsk meta data on
    TX shutdown. TX timestamps on other queues remain unaffected.
    
    Fixes: 15fd021bc427 ("igc: Add Tx hardware timestamp request for AF_XDP zero-copy packet")
    Signed-off-by: Zdenek Bouska <[email protected]>
    Reviewed-by: Paul Menzel <[email protected]>
    Reviewed-by: Florian Bezdeka <[email protected]>
    Tested-by: Avigail Dahan <[email protected]>
    Signed-off-by: Tony Nguyen <[email protected]>
    Signed-off-by: Sasha Levin <[email protected]>

iio: buffer: Fix wait_queue not being removed [+ + +]

Author: Nuno Sá <[email protected]>
Date:   Mon Feb 16 13:24:27 2026 +0000

    iio: buffer: Fix wait_queue not being removed
    
    commit 064234044056c93a3719d6893e6e5a26a94a61b6 upstream.
    
    In the edge case where the IIO device is unregistered while we're
    buffering, we were directly returning an error without removing the wait
    queue. Instead, set 'ret' and break out of the loop.
    
    Fixes: 9eeee3b0bf19 ("iio: Add output buffer support")
    Signed-off-by: Nuno Sá <[email protected]>
    Reviewed-by: David Lechner <[email protected]>
    Cc: <[email protected]>
    Signed-off-by: Jonathan Cameron <[email protected]>
    Signed-off-by: Greg Kroah-Hartman <[email protected]>

iio: chemical: bme680: Fix measurement wait duration calculation [+ + +]

Author: Chris Spencer <[email protected]>
Date:   Thu Feb 5 14:55:45 2026 +0000

    iio: chemical: bme680: Fix measurement wait duration calculation
    
    commit f55b9510cd9437da3a0efa08b089caeb47595ff1 upstream.
    
    This function refers to the Bosch BME680 API as the source of the
    calculation, but one of the constants does not match the Bosch
    implementation. This appears to be a simple transposition of two digits,
    resulting in a wait time that is too short. This can cause the following
    'device measurement cycle incomplete' check to occasionally fail, returning
    EBUSY to user space.
    
    Adjust the constant to match the Bosch implementation and resolve the EBUSY
    errors.
    
    Fixes: 4241665e6ea0 ("iio: chemical: bme680: Fix sensor data read operation")
    Link: https://github.com/boschsensortec/BME68x_SensorAPI/blob/v4.4.8/bme68x.c#L521
    Signed-off-by: Chris Spencer <[email protected]>
    Acked-by: Vasileios Amoiridis <[email protected]>
    Cc: [email protected]
    Signed-off-by: Jonathan Cameron <[email protected]>
    Signed-off-by: Greg Kroah-Hartman <[email protected]>

iio: chemical: sps30_i2c: fix buffer size in sps30_i2c_read_meas() [+ + +]

Author: Antoniu Miclaus <[email protected]>
Date:   Thu Feb 12 14:46:07 2026 +0200

    iio: chemical: sps30_i2c: fix buffer size in sps30_i2c_read_meas()
    
    commit 216345f98cae7fcc84f49728c67478ac00321c87 upstream.
    
    sizeof(num) evaluates to sizeof(size_t) (8 bytes on 64-bit) instead
    of the intended __be32 element size (4 bytes). Use sizeof(*meas) to
    correctly match the buffer element type.
    
    Fixes: 8f3f13085278 ("iio: sps30: separate core and interface specific code")
    Signed-off-by: Antoniu Miclaus <[email protected]>
    Acked-by: Tomasz Duszynski <[email protected]>
    Reviewed-by: Andy Shevchenko <[email protected]>
    Cc: <[email protected]>
    Signed-off-by: Jonathan Cameron <[email protected]>
    Signed-off-by: Greg Kroah-Hartman <[email protected]>

iio: chemical: sps30_serial: fix buffer size in sps30_serial_read_meas() [+ + +]

Author: Antoniu Miclaus <[email protected]>
Date:   Thu Feb 12 14:46:08 2026 +0200

    iio: chemical: sps30_serial: fix buffer size in sps30_serial_read_meas()
    
    commit c3914ce1963c4db25e186112c90fa5d2361e9e0a upstream.
    
    sizeof(num) evaluates to sizeof(size_t) which is 8 bytes on 64-bit,
    but the buffer elements are only 4 bytes. The same function already
    uses sizeof(*meas) on line 312, making the mismatch evident. Use
    sizeof(*meas) consistently.
    
    Fixes: b2e171f5a5c6 ("iio: sps30: add support for serial interface")
    Signed-off-by: Antoniu Miclaus <[email protected]>
    Acked-by: Tomasz Duszynski <[email protected]>
    Reviewed-by: Andy Shevchenko <[email protected]>
    Cc: <[email protected]>
    Signed-off-by: Jonathan Cameron <[email protected]>
    Signed-off-by: Greg Kroah-Hartman <[email protected]>

iio: dac: ds4424: reject -128 RAW value [+ + +]

Author: Oleksij Rempel <[email protected]>
Date:   Wed Feb 4 15:00:33 2026 +0100

    iio: dac: ds4424: reject -128 RAW value
    
    commit 5187e03b817c26c1c3bcb2645a612ea935c4be89 upstream.
    
    The DS442x DAC uses sign-magnitude encoding, so -128 cannot be represented
    in hardware (7-bit magnitude).
    
    Previously, passing -128 resulted in a truncated value that programmed
    0mA (magnitude 0) instead of the expected maximum negative current,
    effectively failing silently.
    
    Reject -128 to avoid producing the wrong current.
    
    Fixes: d632a2bd8ffc ("iio: dac: ds4422/ds4424 dac driver")
    Cc: [email protected]
    Signed-off-by: Oleksij Rempel <[email protected]>
    Reviewed-by: Andy Shevchenko <[email protected]>
    Signed-off-by: Jonathan Cameron <[email protected]>
    Signed-off-by: Greg Kroah-Hartman <[email protected]>

iio: frequency: adf4377: Fix duplicated soft reset mask [+ + +]

Author: SeungJu Cheon <[email protected]>
Date:   Sat Jan 24 04:47:58 2026 +0900

    iio: frequency: adf4377: Fix duplicated soft reset mask
    
    commit 6c8bf4b604a8a6346ca71f1c027fa01c2c2e04cb upstream.
    
    The regmap_read_poll_timeout() uses ADF4377_0000_SOFT_RESET_R_MSK
    twice instead of checking both SOFT_RESET_MSK (bit 0) and
    SOFT_RESET_R_MSK (bit 7). This causes an incomplete reset status check.
    
    The code first sets both SOFT_RESET and SOFT_RESET_R bits to 1 via
    regmap_update_bits(), then polls for them to be cleared. Since we set
    both bits before polling, we should be waiting for both to clear.
    
    Fix by using both masks as done in regmap_update_bits() above.
    
    Fixes: eda549e2e524 ("iio: frequency: adf4377: add support for ADF4377")
    Signed-off-by: SeungJu Cheon <[email protected]>
    Cc: [email protected]
    Signed-off-by: Jonathan Cameron <[email protected]>
    Signed-off-by: Greg Kroah-Hartman <[email protected]>

iio: gyro: mpu3050-core: fix pm_runtime error handling [+ + +]

Author: Antoniu Miclaus <[email protected]>
Date:   Mon Feb 16 11:57:56 2026 +0200

    iio: gyro: mpu3050-core: fix pm_runtime error handling
    
    commit acc3949aab3e8094641a9c7c2768de1958c88378 upstream.
    
    The return value of pm_runtime_get_sync() is not checked, allowing
    the driver to access hardware that may fail to resume. The device
    usage count is also unconditionally incremented. Use
    pm_runtime_resume_and_get() which propagates errors and avoids
    incrementing the usage count on failure.
    
    In preenable, add pm_runtime_put_autosuspend() on set_8khz_samplerate()
    failure since postdisable does not run when preenable fails.
    
    Fixes: 3904b28efb2c ("iio: gyro: Add driver for the MPU-3050 gyroscope")
    Reviewed-by: Linus Walleij <[email protected]>
    Signed-off-by: Antoniu Miclaus <[email protected]>
    Cc: <[email protected]>
    Signed-off-by: Jonathan Cameron <[email protected]>
    Signed-off-by: Greg Kroah-Hartman <[email protected]>

iio: gyro: mpu3050-i2c: fix pm_runtime error handling [+ + +]

Author: Antoniu Miclaus <[email protected]>
Date:   Mon Feb 16 11:57:55 2026 +0200

    iio: gyro: mpu3050-i2c: fix pm_runtime error handling
    
    commit 91f950b4cbb1aa9ea4eb3999f1463e8044b717fb upstream.
    
    The return value of pm_runtime_get_sync() is not checked, and the
    function always returns success. This allows I2C mux operations to
    proceed even when the device fails to resume.
    
    Use pm_runtime_resume_and_get() and propagate its return value to
    properly handle resume failures.
    
    Fixes: 3904b28efb2c ("iio: gyro: Add driver for the MPU-3050 gyroscope")
    Signed-off-by: Antoniu Miclaus <[email protected]>
    Cc: <[email protected]>
    Signed-off-by: Jonathan Cameron <[email protected]>
    Signed-off-by: Greg Kroah-Hartman <[email protected]>

iio: imu: inv-mpu9150: fix irq ack preventing irq storms [+ + +]

Author: Andreas Kemnade <[email protected]>
Date:   Wed Dec 31 22:14:16 2025 +0100

    iio: imu: inv-mpu9150: fix irq ack preventing irq storms
    
    [ Upstream commit d23d763e00ace4e9c59f8d33e0713d401133ba88 ]
    
    IRQ needs to be acked. for some odd reasons, reading from irq status does
    not reliable help, enable acking from any register to be on the safe side
    and read the irq status register. Comments in the code indicate a known
    unreliability with that register.
    The blamed commit was tested with mpu6050 in lg,p895 and lg,p880 according
    to Tested-bys. But with the MPU9150 in the Epson Moverio BT-200 this leads
    to irq storms without properly acking the irq.
    
    Fixes: 0a3b517c8089 ("iio: imu: inv_mpu6050: fix interrupt status read for old buggy chips")
    Signed-off-by: Andreas Kemnade <[email protected]>
    Acked-by: Jean-Baptiste Maneyrol <[email protected]>
    Signed-off-by: Jonathan Cameron <[email protected]>
    Signed-off-by: Sasha Levin <[email protected]>

iio: imu: inv_icm42600: fix odr switch to the same value [+ + +]

Author: Jean-Baptiste Maneyrol <[email protected]>
Date:   Fri Jan 30 16:38:47 2026 +0100

    iio: imu: inv_icm42600: fix odr switch to the same value
    
    commit c9f3a593137d862d424130343e77d4b5260a4f5a upstream.
    
    ODR switch is done in 2 steps when FIFO is on : change the ODR register
    value and acknowledge change when reading the FIFO ODR change flag.
    When we are switching to the same odr value, we end up waiting for a
    FIFO ODR flag that is never happening.
    
    Fix the issue by doing nothing and exiting properly when we are
    switching to the same ODR value.
    
    Fixes: ec74ae9fd37c ("iio: imu: inv_icm42600: add accurate timestamping")
    Signed-off-by: Jean-Baptiste Maneyrol <[email protected]>
    Cc: [email protected]
    Signed-off-by: Jonathan Cameron <[email protected]>
    Signed-off-by: Greg Kroah-Hartman <[email protected]>

iio: imu: inv_icm42600: fix odr switch when turning buffer off [+ + +]

Author: Jean-Baptiste Maneyrol <[email protected]>
Date:   Fri Jan 30 17:10:23 2026 +0100

    iio: imu: inv_icm42600: fix odr switch when turning buffer off
    
    commit ffd32db8263d2d785a2c419486a450dc80693235 upstream.
    
    ODR switch is done in 2 steps when FIFO is on : change the ODR register
    value and acknowledge change when reading the FIFO ODR change flag.
    When we are switching odr and turning buffer off just afterward, we are
    losing the FIFO ODR change flag and ODR switch is blocked.
    
    Fix the issue by force applying any waiting ODR change when turning
    buffer off.
    
    Fixes: ec74ae9fd37c ("iio: imu: inv_icm42600: add accurate timestamping")
    Signed-off-by: Jean-Baptiste Maneyrol <[email protected]>
    Cc: [email protected]
    Signed-off-by: Jonathan Cameron <[email protected]>
    Signed-off-by: Greg Kroah-Hartman <[email protected]>

iio: potentiometer: mcp4131: fix double application of wiper shift [+ + +]

Author: Lukas Schmid <[email protected]>
Date:   Mon Feb 2 21:15:35 2026 +0100

    iio: potentiometer: mcp4131: fix double application of wiper shift
    
    commit 85e4614524dca6c0a43874f475a17de2b9725648 upstream.
    
    The MCP4131 wiper address is shifted twice when preparing the SPI
    command in mcp4131_write_raw().
    
    The address is already shifted when assigned to the local variable
    "address", but is then shifted again when written to data->buf[0].
    This results in an incorrect command being sent to the device and
    breaks wiper writes to the second channel.
    
    Remove the second shift and use the pre-shifted address directly
    when composing the SPI transfer.
    
    Fixes: 22d199a53910 ("iio: potentiometer: add driver for Microchip MCP413X/414X/415X/416X/423X/424X/425X/426X")
    Signed-off-by: Lukas Schmid <[email protected]>#
    Cc: <[email protected]>
    Signed-off-by: Jonathan Cameron <[email protected]>
    Signed-off-by: Greg Kroah-Hartman <[email protected]>

iio: proximity: hx9023s: Protect against division by zero in set_samp_freq [+ + +]

Author: Yasin Lee <[email protected]>
Date:   Fri Feb 13 23:14:44 2026 +0800

    iio: proximity: hx9023s: Protect against division by zero in set_samp_freq
    
    commit a318cfc0853706f1d6ce682dba660bc455d674ef upstream.
    
    Avoid division by zero when sampling frequency is unspecified.
    
    Fixes: 60df548277b7 ("iio: proximity: Add driver support for TYHX's HX9023S capacitive proximity sensor")
    Signed-off-by: Yasin Lee <[email protected]>
    Reviewed-by: Andy Shevchenko <[email protected]>
    Cc: <[email protected]>
    Signed-off-by: Jonathan Cameron <[email protected]>
    Signed-off-by: Greg Kroah-Hartman <[email protected]>

io_uring/kbuf: check if target buffer list is still legacy on recycle [+ + +]

Author: Jens Axboe <[email protected]>
Date:   Thu Mar 12 08:59:25 2026 -0600

    io_uring/kbuf: check if target buffer list is still legacy on recycle
    
    Commit c2c185be5c85d37215397c8e8781abf0a69bec1f upstream.
    
    There's a gap between when the buffer was grabbed and when it
    potentially gets recycled, where if the list is empty, someone could've
    upgraded it to a ring provided type. This can happen if the request
    is forced via io-wq. The legacy recycling is missing checking if the
    buffer_list still exists, and if it's of the correct type. Add those
    checks.
    
    Cc: [email protected]
    Fixes: c7fb19428d67 ("io_uring: add support for ring mapped supplied buffers")
    Reported-by: Keenan Dong <[email protected]>
    Signed-off-by: Jens Axboe <[email protected]>
    Signed-off-by: Greg Kroah-Hartman <[email protected]>

io_uring/kbuf: propagate BUF_MORE through early buffer commit path [+ + +]

Author: Jens Axboe <[email protected]>
Date:   Thu Mar 19 14:29:20 2026 -0600

    io_uring/kbuf: propagate BUF_MORE through early buffer commit path
    
    Commit 418eab7a6f3c002d8e64d6e95ec27118017019af upstream.
    
    When io_should_commit() returns true (eg for non-pollable files), buffer
    commit happens at buffer selection time and sel->buf_list is set to
    NULL. When __io_put_kbufs() generates CQE flags at completion time, it
    calls __io_put_kbuf_ring() which finds a NULL buffer_list and hence
    cannot determine whether the buffer was consumed or not. This means that
    IORING_CQE_F_BUF_MORE is never set for non-pollable input with
    incrementally consumed buffers.
    
    Likewise for io_buffers_select(), which always commits upfront and
    discards the return value of io_kbuf_commit().
    
    Add REQ_F_BUF_MORE to store the result of io_kbuf_commit() during early
    commit. Then __io_put_kbuf_ring() can check this flag and set
    IORING_F_BUF_MORE accordingy.
    
    Reported-by: Martin Michaelis <[email protected]>
    Cc: [email protected]
    Fixes: ae98dbf43d75 ("io_uring/kbuf: add support for incremental buffer consumption")
    Link: https://github.com/axboe/liburing/issues/1553
    Signed-off-by: Jens Axboe <[email protected]>
    Signed-off-by: Greg Kroah-Hartman <[email protected]>

io_uring/uring_cmd: fix too strict requirement on ioctl [+ + +]

Author: Asbjørn Sloth Tønnesen <[email protected]>
Date:   Sat Feb 28 18:08:53 2026 +0000

    io_uring/uring_cmd: fix too strict requirement on ioctl
    
    [ Upstream commit 600b665b903733bd60334e86031b157cc823ee55 ]
    
    Attempting SOCKET_URING_OP_SETSOCKOPT on an AF_NETLINK socket resulted
    in an -EOPNOTSUPP, as AF_NETLINK doesn't have an ioctl in its struct
    proto, but only in struct proto_ops.
    
    Prior to the blamed commit, io_uring_cmd_sock() only had two cmd_op
    operations, both requiring ioctl, thus the check was warranted.
    
    Since then, 4 new cmd_op operations have been added, none of which
    depend on ioctl. This patch moves the ioctl check, so it only applies
    to the original operations.
    
    AFAICT, the ioctl requirement was unintentional, and it wasn't
    visible in the blamed patch within 3 lines of context.
    
    Cc: [email protected]
    Fixes: a5d2f99aff6b ("io_uring/cmd: Introduce SOCKET_URING_OP_GETSOCKOPT")
    Signed-off-by: Asbjørn Sloth Tønnesen <[email protected]>
    Reviewed-by: Gabriel Krisman Bertazi <[email protected]>
    Signed-off-by: Jens Axboe <[email protected]>
    [Asbjørn: function moved in commit 91db6edc573b; updated subject prefix]
    Signed-off-by: Asbjørn Sloth Tønnesen <[email protected]>
    Signed-off-by: Greg Kroah-Hartman <[email protected]>

iomap: reject delalloc mappings during writeback [+ + +]

Author: Darrick J. Wong <[email protected]>
Date:   Tue Mar 17 20:27:04 2026 -0400

    iomap: reject delalloc mappings during writeback
    
    [ Upstream commit d320f160aa5ff36cdf83c645cca52b615e866e32 ]
    
    Filesystems should never provide a delayed allocation mapping to
    writeback; they're supposed to allocate the space before replying.
    This can lead to weird IO errors and crashes in the block layer if the
    filesystem is being malicious, or if it hadn't set iomap->dev because
    it's a delalloc mapping.
    
    Fix this by failing writeback on delalloc mappings.  Currently no
    filesystems actually misbehave in this manner, but we ought to be
    stricter about things like that.
    
    Cc: [email protected] # v5.5
    Fixes: 598ecfbaa742ac ("iomap: lift the xfs writeback code to iomap")
    Signed-off-by: Darrick J. Wong <[email protected]>
    Link: https://patch.msgid.link/20260302173002.GL13829@frogsfrogsfrogs
    Reviewed-by: Christoph Hellwig <[email protected]>
    Reviewed-by: Carlos Maiolino <[email protected]>
    Signed-off-by: Christian Brauner <[email protected]>
    [ iomap_add_to_ioend() => iomap_writepage_map_blocks() ]
    Signed-off-by: Sasha Levin <[email protected]>
    Signed-off-by: Greg Kroah-Hartman <[email protected]>

iommu/vt-d: Fix intel iommu iotlb sync hardlockup and retry [+ + +]

Author: Guanghui Feng <[email protected]>
Date:   Mon Mar 16 15:16:39 2026 +0800

    iommu/vt-d: Fix intel iommu iotlb sync hardlockup and retry
    
    commit fe89277c9ceb0d6af0aa665bcf24a41d8b1b79cd upstream.
    
    During the qi_check_fault process after an IOMMU ITE event, requests at
    odd-numbered positions in the queue are set to QI_ABORT, only satisfying
    single-request submissions. However, qi_submit_sync now supports multiple
    simultaneous submissions, and can't guarantee that the wait_desc will be
    at an odd-numbered position. Therefore, if an item times out, IOMMU can't
    re-initiate the request, resulting in an infinite polling wait.
    
    This modifies the process by setting the status of all requests already
    fetched by IOMMU and recorded as QI_IN_USE status (including wait_desc
    requests) to QI_ABORT, thus enabling multiple requests to be resubmitted.
    
    Fixes: 8a1d82462540 ("iommu/vt-d: Multiple descriptors per qi_submit_sync()")
    Cc: [email protected]
    Signed-off-by: Guanghui Feng <[email protected]>
    Tested-by: Shuai Xue <[email protected]>
    Reviewed-by: Shuai Xue <[email protected]>
    Reviewed-by: Samiullah Khawaja <[email protected]>
    Link: https://lore.kernel.org/r/[email protected]
    Signed-off-by: Lu Baolu <[email protected]>
    Fixes: 8a1d82462540 ("iommu/vt-d: Multiple descriptors per  qi_submit_sync()")
    Signed-off-by: Joerg Roedel <[email protected]>
    Signed-off-by: Greg Kroah-Hartman <[email protected]>

ipv6: use RCU in ip6_xmit() [+ + +]

Author: Eric Dumazet <[email protected]>
Date:   Thu Aug 28 19:58:18 2025 +0000

    ipv6: use RCU in ip6_xmit()
    
    commit 9085e56501d93af9f2d7bd16f7fcfacdde47b99c upstream.
    
    Use RCU in ip6_xmit() in order to use dst_dev_rcu() to prevent
    possible UAF.
    
    Fixes: 4a6ce2b6f2ec ("net: introduce a new function dst_dev_put()")
    Signed-off-by: Eric Dumazet <[email protected]>
    Reviewed-by: David Ahern <[email protected]>
    Link: https://patch.msgid.link/[email protected]
    Signed-off-by: Jakub Kicinski <[email protected]>
    Signed-off-by: Keerthana K <[email protected]>
    Signed-off-by: Shivani Agarwal <[email protected]>
    Signed-off-by: Greg Kroah-Hartman <[email protected]>

irqchip/gic-v3-its: Limit number of per-device MSIs to the range the ITS supports [+ + +]

Author: Marc Zyngier <[email protected]>
Date:   Fri Feb 6 15:48:16 2026 +0000

    irqchip/gic-v3-its: Limit number of per-device MSIs to the range the ITS supports
    
    commit ce9e40a9a5e5cff0b1b0d2fa582b3d71a8ce68e8 upstream.
    
    The ITS driver blindly assumes that EventIDs are in abundant supply, to the
    point where it never checks how many the hardware actually supports.
    
    It turns out that some pretty esoteric integrations make it so that only a
    few bits are available, all the way down to a single bit.
    
    Enforce the advertised limitation at the point of allocating the device
    structure, and hope that the endpoint driver can deal with such limitation.
    
    Fixes: 84a6a2e7fc18d ("irqchip: GICv3: ITS: device allocation and configuration")
    Signed-off-by: Marc Zyngier <[email protected]>
    Signed-off-by: Thomas Gleixner <[email protected]>
    Reviewed-by: Robin Murphy <[email protected]>
    Reviewed-by: Zenghui Yu <[email protected]>
    Cc: [email protected]
    Link: https://patch.msgid.link/[email protected]
    Signed-off-by: Greg Kroah-Hartman <[email protected]>

ixgbevf: fix link setup issue [+ + +]

Author: Jedrzej Jagielski <[email protected]>
Date:   Wed Dec 10 12:26:51 2025 +0100

    ixgbevf: fix link setup issue
    
    commit feae40a6a178bb525a15f19288016e5778102a99 upstream.
    
    It may happen that VF spawned for E610 adapter has problem with setting
    link up. This happens when ixgbevf supporting mailbox API 1.6 cooperates
    with PF driver which doesn't support this version of API, and hence
    doesn't support new approach for getting PF link data.
    
    In that case VF asks PF to provide link data but as PF doesn't support
    it, returns -EOPNOTSUPP what leads to early bail from link configuration
    sequence.
    
    Avoid such situation by using legacy VFLINKS approach whenever negotiated
    API version is less than 1.6.
    
    To reproduce the issue just create VF and set its link up - adapter must
    be any from the E610 family, ixgbevf must support API 1.6 or higher while
    ixgbevf must not.
    
    Fixes: 53f0eb62b4d2 ("ixgbevf: fix getting link speed data for E610 devices")
    Reviewed-by: Aleksandr Loktionov <[email protected]>
    Reviewed-by: Piotr Kwapulinski <[email protected]>
    Reviewed-by: Paul Menzel <[email protected]>
    Cc: [email protected]
    Signed-off-by: Jedrzej Jagielski <[email protected]>
    Tested-by: Rafal Romanowski <[email protected]>
    Signed-off-by: Tony Nguyen <[email protected]>
    Signed-off-by: Greg Kroah-Hartman <[email protected]>

kbuild: Disable CC_HAS_ASM_GOTO_OUTPUT on clang < 17 [+ + +]

Author: Thomas Gleixner <[email protected]>
Date:   Tue Sep 16 15:21:51 2025 +0200

    kbuild: Disable CC_HAS_ASM_GOTO_OUTPUT on clang < 17
    
    [ Upstream commit e2ffa15b9baa447e444d654ffd47123ba6443ae4 ]
    
    clang < 17 fails to use scope local labels with CONFIG_CC_HAS_ASM_GOTO_OUTPUT=y:
    
         {
            __label__ local_lbl;
            ...
            unsafe_get_user(uval, uaddr, local_lbl);
            ...
            return 0;
            local_lbl:
                    return -EFAULT;
         }
    
    when two such scopes exist in the same function:
    
      error: cannot jump from this asm goto statement to one of its possible targets
    
    There are other failure scenarios. Shuffling code around slightly makes it
    worse and fail even with one instance.
    
    That issue prevents using local labels for a cleanup based user access
    mechanism.
    
    After failed attempts to provide a simple enough test case for the 'depends
    on' test in Kconfig, the initial cure was to mark ASM goto broken on clang
    versions < 17 to get this road block out of the way.
    
    But Nathan pointed out that this is a known clang issue and indeed affects
    clang < version 17 in combination with cleanup(). It's not even required to
    use local labels for that.
    
    The clang issue tracker has a small enough test case, which can be used as
    a test in the 'depends on' section of CC_HAS_ASM_GOTO_OUTPUT:
    
    void bar(void **);
    void* baz(void);
    
    int  foo (void) {
        {
                asm goto("jmp %l0"::::l0);
                return 0;
    l0:
                return 1;
        }
        void *x __attribute__((cleanup(bar))) = baz();
        {
                asm goto("jmp %l0"::::l1);
                return 42;
    l1:
                return 0xff;
        }
    }
    
    Add another dependency to config CC_HAS_ASM_GOTO_OUTPUT for it and use the
    clang issue tracker test case for detection by condensing it to obfuscated
    C-code contest format. This reliably catches the problem on clang < 17 and
    did not show any issues on the non broken GCC versions.
    
    That test might be sufficient to catch all issues and therefore could
    replace the existing test, but keeping that around does no harm either.
    
    Thanks to Nathan for pointing to the relevant clang issue!
    
    Suggested-by: Nathan Chancellor <[email protected]>
    Signed-off-by: Thomas Gleixner <[email protected]>
    Cc: Nathan Chancellor <[email protected]>
    Reviewed-by: Nathan Chancellor <[email protected]>
    Link: https://github.com/ClangBuiltLinux/linux/issues/1886
    Link: https://github.com/llvm/llvm-project/commit/f023f5cdb2e6c19026f04a15b5a935c041835d14
    Signed-off-by: Sasha Levin <[email protected]>

kbuild: Leave objtool binary around with 'make clean' [+ + +]

Author: Nathan Chancellor <[email protected]>
Date:   Mon Mar 9 14:36:58 2026 -0400

    kbuild: Leave objtool binary around with 'make clean'
    
    [ Upstream commit fdb12c8a24a453bdd6759979b6ef1e04ebd4beb4 ]
    
    The difference between 'make clean' and 'make mrproper' is documented in
    'make help' as:
    
      clean     - Remove most generated files but keep the config and
                  enough build support to build external modules
      mrproper  - Remove all generated files + config + various backup files
    
    After commit 68b4fe32d737 ("kbuild: Add objtool to top-level clean
    target"), running 'make clean' then attempting to build an external
    module with the resulting build directory fails with
    
      $ make ARCH=x86_64 O=build clean
    
      $ make -C build M=... MO=...
      ...
      /bin/sh: line 1: .../build/tools/objtool/objtool: No such file or directory
    
    as 'make clean' removes the objtool binary.
    
    Split the objtool clean target into mrproper and clean like Kbuild does
    and remove all generated artifacts with 'make clean' except for the
    objtool binary, which is removed with 'make mrproper'. To avoid a small
    race when running the objtool clean target through both objtool_mrproper
    and objtool_clean when running 'make mrproper', modify objtool's clean
    up find command to avoid using find's '-delete' command by piping the
    files into 'xargs rm -f' like the rest of Kbuild does.
    
    Cc: [email protected]
    Fixes: 68b4fe32d737 ("kbuild: Add objtool to top-level clean target")
    Reported-by: Michal Suchanek <[email protected]>
    Closes: https://lore.kernel.org/[email protected]/
    Reported-by: Rainer Fiebig <[email protected]>
    Closes: https://lore.kernel.org/[email protected]/
    Acked-by: Josh Poimboeuf <[email protected]>
    Acked-by: Peter Zijlstra (Intel) <[email protected]>
    Reviewed-by: Nicolas Schier <[email protected]>
    Tested-by: Nicolas Schier <[email protected]>
    Link: https://patch.msgid.link/20260227-avoid-objtool-binary-removal-clean-v1-1-122f3e55eae9@kernel.org
    Signed-off-by: Nathan Chancellor <[email protected]>
    [ Context ]
    Signed-off-by: Sasha Levin <[email protected]>
    Signed-off-by: Greg Kroah-Hartman <[email protected]>

kexec: Consolidate machine_kexec_mask_interrupts() implementation [+ + +]

Author: Eliav Farber <[email protected]>
Date:   Wed Dec 4 14:20:02 2024 +0000

    kexec: Consolidate machine_kexec_mask_interrupts() implementation
    
    [ Upstream commit bad6722e478f5b17a5ceb039dfb4c680cf2c0b48 ]
    
    Consolidate the machine_kexec_mask_interrupts implementation into a common
    function located in a new file: kernel/irq/kexec.c. This removes duplicate
    implementations from architecture-specific files in arch/arm, arch/arm64,
    arch/powerpc, and arch/riscv, reducing code duplication and improving
    maintainability.
    
    The new implementation retains architecture-specific behavior for
    CONFIG_GENERIC_IRQ_KEXEC_CLEAR_VM_FORWARD, which was previously implemented
    for ARM64. When enabled (currently for ARM64), it clears the active state
    of interrupts forwarded to virtual machines (VMs) before handling other
    interrupt masking operations.
    
    Signed-off-by: Eliav Farber <[email protected]>
    Signed-off-by: Thomas Gleixner <[email protected]>
    Link: https://lore.kernel.org/all/[email protected]
    Stable-dep-of: 20197b967a6a ("powerpc/kexec/core: use big-endian types for crash variables")
    Signed-off-by: Sasha Levin <[email protected]>

kexec: Include kernel-end even without crashkernel [+ + +]

Author: Ben Collins <[email protected]>
Date:   Mon Apr 21 22:36:46 2025 -0400

    kexec: Include kernel-end even without crashkernel
    
    [ Upstream commit 38c64dfe0af12778953846df5f259e913275cfe5 ]
    
    Certain versions of kexec don't even work without kernel-end being
    added to the device-tree. Add it even if crash-kernel is disabled.
    
    Signed-off-by: Ben Collins <[email protected]>
    Reviewed-by: Sourabh Jain <[email protected]>
    Signed-off-by: Madhavan Srinivasan <[email protected]>
    Link: https://patch.msgid.link/2025042122-inescapable-mandrill-8a5ff2@boujee-and-buff
    Stable-dep-of: 20197b967a6a ("powerpc/kexec/core: use big-endian types for crash variables")
    Signed-off-by: Sasha Levin <[email protected]>

kprobes: avoid crash when rmmod/insmod after ftrace killed [+ + +]

Author: Masami Hiramatsu (Google) <[email protected]>
Date:   Fri Mar 13 23:14:14 2026 +0900

    kprobes: avoid crash when rmmod/insmod after ftrace killed
    
    commit e113f0b46d19626ec15388bcb91432c9a4fd6261 upstream.
    
    After we hit ftrace is killed by some errors, the kernel crash if
    we remove modules in which kprobe probes.
    
    BUG: unable to handle page fault for address: fffffbfff805000d
    PGD 817fcc067 P4D 817fcc067 PUD 817fc8067 PMD 101555067 PTE 0
    Oops: Oops: 0000 [#1] SMP KASAN PTI
    CPU: 4 UID: 0 PID: 2012 Comm: rmmod Tainted: G        W  OE
    Tainted: [W]=WARN, [O]=OOT_MODULE, [E]=UNSIGNED_MODULE
    RIP: 0010:kprobes_module_callback+0x89/0x790
    RSP: 0018:ffff88812e157d30 EFLAGS: 00010a02
    RAX: 1ffffffff805000d RBX: dffffc0000000000 RCX: ffffffff86a8de90
    RDX: ffffed1025c2af9b RSI: 0000000000000008 RDI: ffffffffc0280068
    RBP: 0000000000000000 R08: 0000000000000001 R09: ffffed1025c2af9a
    R10: ffff88812e157cd7 R11: 205d323130325420 R12: 0000000000000002
    R13: ffffffffc0290488 R14: 0000000000000002 R15: ffffffffc0280040
    FS:  00007fbc450dd740(0000) GS:ffff888420331000(0000) knlGS:0000000000000000
    CS:  0010 DS: 0000 ES: 0000 CR0: 0000000080050033
    CR2: fffffbfff805000d CR3: 000000010f624000 CR4: 00000000000006f0
    Call Trace:
     <TASK>
     notifier_call_chain+0xc6/0x280
     blocking_notifier_call_chain+0x60/0x90
     __do_sys_delete_module.constprop.0+0x32a/0x4e0
     do_syscall_64+0x5d/0xfa0
     entry_SYSCALL_64_after_hwframe+0x76/0x7e
    
    This is because the kprobe on ftrace does not correctly handles
    the kprobe_ftrace_disabled flag set by ftrace_kill().
    
    To prevent this error, check kprobe_ftrace_disabled in
    __disarm_kprobe_ftrace() and skip all ftrace related operations.
    
    Link: https://lore.kernel.org/all/176473947565.1727781.13110060700668331950.stgit@mhiramat.tok.corp.google.com/
    
    Reported-by: Ye Bin <[email protected]>
    Closes: https://lore.kernel.org/all/[email protected]/
    Fixes: ae6aa16fdc16 ("kprobes: introduce ftrace based optimization")
    Cc: [email protected]
    Signed-off-by: Masami Hiramatsu (Google) <[email protected]>
    Acked-by: Steven Rostedt (Google) <[email protected]>
    Signed-off-by: Greg Kroah-Hartman <[email protected]>

kprobes: Remove unneeded goto [+ + +]

Author: Masami Hiramatsu (Google) <[email protected]>
Date:   Thu Mar 19 09:39:12 2026 -0400

    kprobes: Remove unneeded goto
    
    [ Upstream commit 5e5b8b49335971b68b54afeb0e7ded004945af07 ]
    
    Remove unneeded gotos. Since the labels referred by these gotos have
    only one reference for each, we can replace those gotos with the
    referred code.
    
    Link: https://lore.kernel.org/all/173371211203.480397.13988907319659165160.stgit@devnote2/
    
    Signed-off-by: Masami Hiramatsu (Google) <[email protected]>
    Stable-dep-of: 5ef268cb7a0a ("kprobes: Remove unneeded warnings from __arm_kprobe_ftrace()")
    Signed-off-by: Sasha Levin <[email protected]>
    Signed-off-by: Greg Kroah-Hartman <[email protected]>

kprobes: Remove unneeded warnings from __arm_kprobe_ftrace() [+ + +]

Author: Masami Hiramatsu (Google) <[email protected]>
Date:   Thu Mar 19 09:39:13 2026 -0400

    kprobes: Remove unneeded warnings from __arm_kprobe_ftrace()
    
    [ Upstream commit 5ef268cb7a0aac55521fd9881f1939fa94a8988e ]
    
    Remove unneeded warnings for handled errors from __arm_kprobe_ftrace()
    because all caller handled the error correctly.
    
    Link: https://lore.kernel.org/all/177261531182.1312989.8737778408503961141.stgit@mhiramat.tok.corp.google.com/
    
    Reported-by: Zw Tang <[email protected]>
    Closes: https://lore.kernel.org/all/CAPHJ_V+J6YDb_wX2nhXU6kh466Dt_nyDSas-1i_Y8s7tqY-Mzw@mail.gmail.com/
    Fixes: 9c89bb8e3272 ("kprobes: treewide: Cleanup the error messages for kprobes")
    Cc: [email protected]
    Signed-off-by: Masami Hiramatsu (Google) <[email protected]>
    Signed-off-by: Sasha Levin <[email protected]>
    Signed-off-by: Greg Kroah-Hartman <[email protected]>

ksmbd: call ksmbd_vfs_kern_path_end_removing() on some error paths [+ + +]

Author: Fedor Pchelkin <[email protected]>
Date:   Tue Feb 24 21:12:15 2026 -0500

    ksmbd: call ksmbd_vfs_kern_path_end_removing() on some error paths
    
    [ Upstream commit a09dc10d1353f0e92c21eae2a79af1c2b1ddcde8 ]
    
    There are two places where ksmbd_vfs_kern_path_end_removing() needs to be
    called in order to balance what the corresponding successful call to
    ksmbd_vfs_kern_path_start_removing() has done, i.e. drop inode locks and
    put the taken references.  Otherwise there might be potential deadlocks
    and unbalanced locks which are caught like:
    
    BUG: workqueue leaked lock or atomic: kworker/5:21/0x00000000/7596
         last function: handle_ksmbd_work
    2 locks held by kworker/5:21/7596:
     #0: ffff8881051ae448 (sb_writers#3){.+.+}-{0:0}, at: ksmbd_vfs_kern_path_locked+0x142/0x660
     #1: ffff888130e966c0 (&type->i_mutex_dir_key#3/1){+.+.}-{4:4}, at: ksmbd_vfs_kern_path_locked+0x17d/0x660
    CPU: 5 PID: 7596 Comm: kworker/5:21 Not tainted 6.1.162-00456-gc29b353f383b #138
    Hardware name: QEMU Standard PC (i440FX + PIIX, 1996), BIOS 1.17.0-debian-1.17.0-1 04/01/2014
    Workqueue: ksmbd-io handle_ksmbd_work
    Call Trace:
     <TASK>
     dump_stack_lvl+0x44/0x5b
     process_one_work.cold+0x57/0x5c
     worker_thread+0x82/0x600
     kthread+0x153/0x190
     ret_from_fork+0x22/0x30
     </TASK>
    
    Found by Linux Verification Center (linuxtesting.org).
    
    Fixes: d5fc1400a34b ("smb/server: avoid deadlock when linking with ReplaceIfExists")
    Cc: [email protected]
    Signed-off-by: Fedor Pchelkin <[email protected]>
    Acked-by: Namjae Jeon <[email protected]>
    Signed-off-by: Steve French <[email protected]>
    [ ksmbd_vfs_kern_path_end_removing() call -> ksmbd_vfs_kern_path_unlock() ]
    Signed-off-by: Sasha Levin <[email protected]>
    Signed-off-by: Greg Kroah-Hartman <[email protected]>

ksmbd: Compare MACs in constant time [+ + +]

Author: Eric Biggers <[email protected]>
Date:   Tue Mar 10 12:52:51 2026 -0700

    ksmbd: Compare MACs in constant time
    
    commit c5794709bc9105935dbedef8b9cf9c06f2b559fa upstream.
    
    To prevent timing attacks, MAC comparisons need to be constant-time.
    Replace the memcmp() with the correct function, crypto_memneq().
    
    Fixes: e2f34481b24d ("cifsd: add server-side procedures for SMB3")
    Cc: [email protected]
    Signed-off-by: Eric Biggers <[email protected]>
    Acked-by: Namjae Jeon <[email protected]>
    Signed-off-by: Steve French <[email protected]>
    Signed-off-by: Greg Kroah-Hartman <[email protected]>

ksmbd: Don't log keys in SMB3 signing and encryption key generation [+ + +]

Author: Thorsten Blum <[email protected]>
Date:   Wed Mar 18 10:36:20 2026 -0400

    ksmbd: Don't log keys in SMB3 signing and encryption key generation
    
    [ Upstream commit 441336115df26b966575de56daf7107ed474faed ]
    
    When KSMBD_DEBUG_AUTH logging is enabled, generate_smb3signingkey() and
    generate_smb3encryptionkey() log the session, signing, encryption, and
    decryption key bytes. Remove the logs to avoid exposing credentials.
    
    Fixes: e2f34481b24d ("cifsd: add server-side procedures for SMB3")
    Cc: [email protected]
    Signed-off-by: Thorsten Blum <[email protected]>
    Acked-by: Namjae Jeon <[email protected]>
    Signed-off-by: Steve French <[email protected]>
    [ Context ]
    Signed-off-by: Sasha Levin <[email protected]>
    Signed-off-by: Greg Kroah-Hartman <[email protected]>

ksmbd: fix use-after-free by using call_rcu() for oplock_info [+ + +]

Author: Namjae Jeon <[email protected]>
Date:   Sat Mar 7 11:32:31 2026 +0900

    ksmbd: fix use-after-free by using call_rcu() for oplock_info
    
    commit 1dfd062caa165ec9d7ee0823087930f3ab8a6294 upstream.
    
    ksmbd currently frees oplock_info immediately using kfree(), even
    though it is accessed under RCU read-side critical sections in places
    like opinfo_get() and proc_show_files().
    
    Since there is no RCU grace period delay between nullifying the pointer
    and freeing the memory, a reader can still access oplock_info
    structure after it has been freed. This can leads to a use-after-free
    especially in opinfo_get() where atomic_inc_not_zero() is called on
    already freed memory.
    
    Fix this by switching to deferred freeing using call_rcu().
    
    Fixes: 18b4fac5ef17 ("ksmbd: fix use-after-free in smb_break_all_levII_oplock()")
    Cc: [email protected]
    Signed-off-by: Namjae Jeon <[email protected]>
    Signed-off-by: Steve French <[email protected]>
    Signed-off-by: Greg Kroah-Hartman <[email protected]>

ksmbd: fix use-after-free in durable v2 replay of active file handles [+ + +]

Author: Hyunwoo Kim <[email protected]>
Date:   Thu Mar 12 17:15:51 2026 +0900

    ksmbd: fix use-after-free in durable v2 replay of active file handles
    
    [ Upstream commit b425e4d0eb321a1116ddbf39636333181675d8f4 ]
    
    parse_durable_handle_context() unconditionally assigns dh_info->fp->conn
    to the current connection when handling a DURABLE_REQ_V2 context with
    SMB2_FLAGS_REPLAY_OPERATION. ksmbd_lookup_fd_cguid() does not filter by
    fp->conn, so it returns file handles that are already actively connected.
    The unconditional overwrite replaces fp->conn, and when the overwriting
    connection is subsequently freed, __ksmbd_close_fd() dereferences the
    stale fp->conn via spin_lock(&fp->conn->llist_lock), causing a
    use-after-free.
    
    KASAN report:
    
    [    7.349357] ==================================================================
    [    7.349607] BUG: KASAN: slab-use-after-free in _raw_spin_lock+0x75/0xe0
    [    7.349811] Write of size 4 at addr ffff8881056ac18c by task kworker/1:2/108
    [    7.350010]
    [    7.350064] CPU: 1 UID: 0 PID: 108 Comm: kworker/1:2 Not tainted 7.0.0-rc3+ #58 PREEMPTLAZY
    [    7.350068] Hardware name: QEMU Ubuntu 24.04 PC v2 (i440FX + PIIX, arch_caps fix, 1996), BIOS 1.16.3-debian-1.16.3-2 04/01/2014
    [    7.350070] Workqueue: ksmbd-io handle_ksmbd_work
    [    7.350083] Call Trace:
    [    7.350087]  <TASK>
    [    7.350087]  dump_stack_lvl+0x64/0x80
    [    7.350094]  print_report+0xce/0x660
    [    7.350100]  ? __pfx__raw_spin_lock_irqsave+0x10/0x10
    [    7.350101]  ? __pfx___mod_timer+0x10/0x10
    [    7.350106]  ? _raw_spin_lock+0x75/0xe0
    [    7.350108]  kasan_report+0xce/0x100
    [    7.350109]  ? _raw_spin_lock+0x75/0xe0
    [    7.350114]  kasan_check_range+0x105/0x1b0
    [    7.350116]  _raw_spin_lock+0x75/0xe0
    [    7.350118]  ? __pfx__raw_spin_lock+0x10/0x10
    [    7.350119]  ? __call_rcu_common.constprop.0+0x25e/0x780
    [    7.350125]  ? close_id_del_oplock+0x2cc/0x4e0
    [    7.350128]  __ksmbd_close_fd+0x27f/0xaf0
    [    7.350131]  ksmbd_close_fd+0x135/0x1b0
    [    7.350133]  smb2_close+0xb19/0x15b0
    [    7.350142]  ? __pfx_smb2_close+0x10/0x10
    [    7.350143]  ? xas_load+0x18/0x270
    [    7.350146]  ? _raw_spin_lock+0x84/0xe0
    [    7.350148]  ? __pfx__raw_spin_lock+0x10/0x10
    [    7.350150]  ? _raw_spin_unlock+0xe/0x30
    [    7.350151]  ? ksmbd_smb2_check_message+0xeb2/0x24c0
    [    7.350153]  ? ksmbd_tree_conn_lookup+0xcd/0xf0
    [    7.350154]  handle_ksmbd_work+0x40f/0x1080
    [    7.350156]  process_one_work+0x5fa/0xef0
    [    7.350162]  ? assign_work+0x122/0x3e0
    [    7.350163]  worker_thread+0x54b/0xf70
    [    7.350165]  ? __pfx_worker_thread+0x10/0x10
    [    7.350166]  kthread+0x346/0x470
    [    7.350170]  ? recalc_sigpending+0x19b/0x230
    [    7.350176]  ? __pfx_kthread+0x10/0x10
    [    7.350178]  ret_from_fork+0x4fb/0x6c0
    [    7.350183]  ? __pfx_ret_from_fork+0x10/0x10
    [    7.350185]  ? __switch_to+0x36c/0xbe0
    [    7.350188]  ? __pfx_kthread+0x10/0x10
    [    7.350190]  ret_from_fork_asm+0x1a/0x30
    [    7.350197]  </TASK>
    [    7.350197]
    [    7.355160] Allocated by task 123:
    [    7.355261]  kasan_save_stack+0x33/0x60
    [    7.355373]  kasan_save_track+0x14/0x30
    [    7.355484]  __kasan_kmalloc+0x8f/0xa0
    [    7.355593]  ksmbd_conn_alloc+0x44/0x6d0
    [    7.355711]  ksmbd_kthread_fn+0x243/0xd70
    [    7.355839]  kthread+0x346/0x470
    [    7.355942]  ret_from_fork+0x4fb/0x6c0
    [    7.356051]  ret_from_fork_asm+0x1a/0x30
    [    7.356164]
    [    7.356214] Freed by task 134:
    [    7.356305]  kasan_save_stack+0x33/0x60
    [    7.356416]  kasan_save_track+0x14/0x30
    [    7.356527]  kasan_save_free_info+0x3b/0x60
    [    7.356646]  __kasan_slab_free+0x43/0x70
    [    7.356761]  kfree+0x1ca/0x430
    [    7.356862]  ksmbd_tcp_disconnect+0x59/0xe0
    [    7.356993]  ksmbd_conn_handler_loop+0x77e/0xd40
    [    7.357138]  kthread+0x346/0x470
    [    7.357240]  ret_from_fork+0x4fb/0x6c0
    [    7.357350]  ret_from_fork_asm+0x1a/0x30
    [    7.357463]
    [    7.357513] The buggy address belongs to the object at ffff8881056ac000
    [    7.357513]  which belongs to the cache kmalloc-1k of size 1024
    [    7.357857] The buggy address is located 396 bytes inside of
    [    7.357857]  freed 1024-byte region [ffff8881056ac000, ffff8881056ac400)
    
    Fix by removing the unconditional fp->conn assignment and rejecting the
    replay when fp->conn is non-NULL. This is consistent with
    ksmbd_lookup_durable_fd(), which also rejects file handles with a
    non-NULL fp->conn. For disconnected file handles (fp->conn == NULL),
    ksmbd_reopen_durable_fd() handles setting fp->conn.
    
    Fixes: c8efcc786146 ("ksmbd: add support for durable handles v1/v2")
    Signed-off-by: Hyunwoo Kim <[email protected]>
    Acked-by: Namjae Jeon <[email protected]>
    Signed-off-by: Steve French <[email protected]>
    Signed-off-by: Sasha Levin <[email protected]>

ksmbd: fix use-after-free in smb_lazy_parent_lease_break_close() [+ + +]

Author: Namjae Jeon <[email protected]>
Date:   Mon Mar 2 12:55:02 2026 +0900

    ksmbd: fix use-after-free in smb_lazy_parent_lease_break_close()
    
    commit eac3361e3d5dd8067b3258c69615888eb45e9f25 upstream.
    
    opinfo pointer obtained via rcu_dereference(fp->f_opinfo) is being
    accessed after rcu_read_unlock() has been called. This creates a
    race condition where the memory could be freed by a concurrent
    writer between the unlock and the subsequent pointer dereferences
    (opinfo->is_lease, etc.), leading to a use-after-free.
    
    Fixes: 5fb282ba4fef ("ksmbd: fix possible null-deref in smb_lazy_parent_lease_break_close")
    Cc: [email protected]
    Signed-off-by: Namjae Jeon <[email protected]>
    Signed-off-by: Steve French <[email protected]>
    Signed-off-by: Greg Kroah-Hartman <[email protected]>

ksmbd: fix use-after-free of share_conf in compound request [+ + +]

Author: Hyunwoo Kim <[email protected]>
Date:   Thu Mar 12 17:17:02 2026 +0900

    ksmbd: fix use-after-free of share_conf in compound request
    
    [ Upstream commit c33615f995aee80657b9fdfbc4ee7f49c2bd733d ]
    
    smb2_get_ksmbd_tcon() reuses work->tcon in compound requests without
    validating tcon->t_state. ksmbd_tree_conn_lookup() checks t_state ==
    TREE_CONNECTED on the initial lookup path, but the compound reuse path
    bypasses this check entirely.
    
    If a prior command in the compound (SMB2_TREE_DISCONNECT) sets t_state
    to TREE_DISCONNECTED and frees share_conf via ksmbd_share_config_put(),
    subsequent commands dereference the freed share_conf through
    work->tcon->share_conf.
    
    KASAN report:
    
    [    4.144653] ==================================================================
    [    4.145059] BUG: KASAN: slab-use-after-free in smb2_write+0xc74/0xe70
    [    4.145415] Read of size 4 at addr ffff88810430c194 by task kworker/1:1/44
    [    4.145772]
    [    4.145867] CPU: 1 UID: 0 PID: 44 Comm: kworker/1:1 Not tainted 7.0.0-rc3+ #60 PREEMPTLAZY
    [    4.145871] Hardware name: QEMU Ubuntu 24.04 PC v2 (i440FX + PIIX, arch_caps fix, 1996), BIOS 1.16.3-debian-1.16.3-2 04/01/2014
    [    4.145875] Workqueue: ksmbd-io handle_ksmbd_work
    [    4.145888] Call Trace:
    [    4.145892]  <TASK>
    [    4.145894]  dump_stack_lvl+0x64/0x80
    [    4.145910]  print_report+0xce/0x660
    [    4.145919]  ? __pfx__raw_spin_lock_irqsave+0x10/0x10
    [    4.145928]  ? smb2_write+0xc74/0xe70
    [    4.145931]  kasan_report+0xce/0x100
    [    4.145934]  ? smb2_write+0xc74/0xe70
    [    4.145937]  smb2_write+0xc74/0xe70
    [    4.145939]  ? __pfx_smb2_write+0x10/0x10
    [    4.145942]  ? _raw_spin_unlock+0xe/0x30
    [    4.145945]  ? ksmbd_smb2_check_message+0xeb2/0x24c0
    [    4.145948]  ? smb2_tree_disconnect+0x31c/0x480
    [    4.145951]  handle_ksmbd_work+0x40f/0x1080
    [    4.145953]  process_one_work+0x5fa/0xef0
    [    4.145962]  ? assign_work+0x122/0x3e0
    [    4.145964]  worker_thread+0x54b/0xf70
    [    4.145967]  ? __pfx_worker_thread+0x10/0x10
    [    4.145970]  kthread+0x346/0x470
    [    4.145976]  ? recalc_sigpending+0x19b/0x230
    [    4.145980]  ? __pfx_kthread+0x10/0x10
    [    4.145984]  ret_from_fork+0x4fb/0x6c0
    [    4.145992]  ? __pfx_ret_from_fork+0x10/0x10
    [    4.145995]  ? __switch_to+0x36c/0xbe0
    [    4.145999]  ? __pfx_kthread+0x10/0x10
    [    4.146003]  ret_from_fork_asm+0x1a/0x30
    [    4.146013]  </TASK>
    [    4.146014]
    [    4.149858] Allocated by task 44:
    [    4.149953]  kasan_save_stack+0x33/0x60
    [    4.150061]  kasan_save_track+0x14/0x30
    [    4.150169]  __kasan_kmalloc+0x8f/0xa0
    [    4.150274]  ksmbd_share_config_get+0x1dd/0xdd0
    [    4.150401]  ksmbd_tree_conn_connect+0x7e/0x600
    [    4.150529]  smb2_tree_connect+0x2e6/0x1000
    [    4.150645]  handle_ksmbd_work+0x40f/0x1080
    [    4.150761]  process_one_work+0x5fa/0xef0
    [    4.150873]  worker_thread+0x54b/0xf70
    [    4.150978]  kthread+0x346/0x470
    [    4.151071]  ret_from_fork+0x4fb/0x6c0
    [    4.151176]  ret_from_fork_asm+0x1a/0x30
    [    4.151286]
    [    4.151332] Freed by task 44:
    [    4.151418]  kasan_save_stack+0x33/0x60
    [    4.151526]  kasan_save_track+0x14/0x30
    [    4.151634]  kasan_save_free_info+0x3b/0x60
    [    4.151751]  __kasan_slab_free+0x43/0x70
    [    4.151861]  kfree+0x1ca/0x430
    [    4.151952]  __ksmbd_tree_conn_disconnect+0xc8/0x190
    [    4.152088]  smb2_tree_disconnect+0x1cd/0x480
    [    4.152211]  handle_ksmbd_work+0x40f/0x1080
    [    4.152326]  process_one_work+0x5fa/0xef0
    [    4.152438]  worker_thread+0x54b/0xf70
    [    4.152545]  kthread+0x346/0x470
    [    4.152638]  ret_from_fork+0x4fb/0x6c0
    [    4.152743]  ret_from_fork_asm+0x1a/0x30
    [    4.152853]
    [    4.152900] The buggy address belongs to the object at ffff88810430c180
    [    4.152900]  which belongs to the cache kmalloc-96 of size 96
    [    4.153226] The buggy address is located 20 bytes inside of
    [    4.153226]  freed 96-byte region [ffff88810430c180, ffff88810430c1e0)
    [    4.153549]
    [    4.153596] The buggy address belongs to the physical page:
    [    4.153750] page: refcount:0 mapcount:0 mapping:0000000000000000 index:0xffff88810430ce80 pfn:0x10430c
    [    4.154000] flags: 0x100000000000200(workingset|node=0|zone=2)
    [    4.154160] page_type: f5(slab)
    [    4.154251] raw: 0100000000000200 ffff888100041280 ffff888100040110 ffff888100040110
    [    4.154461] raw: ffff88810430ce80 0000000800200009 00000000f5000000 0000000000000000
    [    4.154668] page dumped because: kasan: bad access detected
    [    4.154820]
    [    4.154866] Memory state around the buggy address:
    [    4.155002]  ffff88810430c080: fc fc fc fc fc fc fc fc fc fc fc fc fc fc fc fc
    [    4.155196]  ffff88810430c100: fc fc fc fc fc fc fc fc fc fc fc fc fc fc fc fc
    [    4.155391] >ffff88810430c180: fa fb fb fb fb fb fb fb fb fb fb fb fc fc fc fc
    [    4.155587]                          ^
    [    4.155693]  ffff88810430c200: fc fc fc fc fc fc fc fc fc fc fc fc fc fc fc fc
    [    4.155891]  ffff88810430c280: fc fc fc fc fc fc fc fc fc fc fc fc fc fc fc fc
    [    4.156087] ==================================================================
    
    Add the same t_state validation to the compound reuse path, consistent
    with ksmbd_tree_conn_lookup().
    
    Fixes: 5005bcb42191 ("ksmbd: validate session id and tree id in the compound request")
    Signed-off-by: Hyunwoo Kim <[email protected]>
    Acked-by: Namjae Jeon <[email protected]>
    Signed-off-by: Steve French <[email protected]>
    Signed-off-by: Sasha Levin <[email protected]>

ksmbd: unset conn->binding on failed binding request [+ + +]

Author: Namjae Jeon <[email protected]>
Date:   Fri Mar 13 10:00:58 2026 +0900

    ksmbd: unset conn->binding on failed binding request
    
    commit 282343cf8a4a5a3603b1cb0e17a7083e4a593b03 upstream.
    
    When a multichannel SMB2_SESSION_SETUP request with
    SMB2_SESSION_REQ_FLAG_BINDING fails ksmbd sets conn->binding = true
    but never clears it on the error path. This leaves the connection in
    a binding state where all subsequent ksmbd_session_lookup_all() calls
    fall back to the global sessions table. This fix it by clearing
    conn->binding = false in the error path.
    
    Cc: [email protected]
    Reported-by: Hyunwoo Kim <[email protected]>
    Signed-off-by: Namjae Jeon <[email protected]>
    Signed-off-by: Steve French <[email protected]>
    Signed-off-by: Greg Kroah-Hartman <[email protected]>

ksmbd: use volume UUID in FS_OBJECT_ID_INFORMATION [+ + +]

Author: Namjae Jeon <[email protected]>
Date:   Fri Mar 13 10:01:29 2026 +0900

    ksmbd: use volume UUID in FS_OBJECT_ID_INFORMATION
    
    commit 3a64125730cabc34fccfbc230c2667c2e14f7308 upstream.
    
    Use sb->s_uuid for a proper volume identifier as the primary choice.
    For filesystems that do not provide a UUID, fall back to stfs.f_fsid
    obtained from vfs_statfs().
    
    Cc: [email protected]
    Reported-by: Hyunwoo Kim <[email protected]>
    Signed-off-by: Namjae Jeon <[email protected]>
    Signed-off-by: Steve French <[email protected]>
    Signed-off-by: Greg Kroah-Hartman <[email protected]>

KVM: nVMX: Add consistency checks for CR0.WP and CR4.CET [+ + +]

Author: Chao Gao <[email protected]>
Date:   Mon Mar 16 13:20:02 2026 -0400

    KVM: nVMX: Add consistency checks for CR0.WP and CR4.CET
    
    [ Upstream commit 8060b2bd2dd05a19ad7ec248489d374f2bd2b057 ]
    
    Add consistency checks for CR4.CET and CR0.WP in guest-state or host-state
    area in the VMCS12. This ensures that configurations with CR4.CET set and
    CR0.WP not set result in VM-entry failure, aligning with architectural
    behavior.
    
    Tested-by: Mathias Krause <[email protected]>
    Tested-by: John Allen <[email protected]>
    Tested-by: Rick Edgecombe <[email protected]>
    Signed-off-by: Chao Gao <[email protected]>
    Reviewed-by: Binbin Wu <[email protected]>
    Link: https://lore.kernel.org/r/[email protected]
    Signed-off-by: Sean Christopherson <[email protected]>
    Stable-dep-of: e2ffe85b6d2b ("KVM: x86: Introduce KVM_X86_QUIRK_VMCS12_ALLOW_FREEZE_IN_SMM")
    Signed-off-by: Sasha Levin <[email protected]>
    Signed-off-by: Greg Kroah-Hartman <[email protected]>

KVM: SVM: Add a helper to look up the max physical ID for AVIC [+ + +]

Author: Naveen N Rao <[email protected]>
Date:   Mon Mar 16 15:18:44 2026 -0400

    KVM: SVM: Add a helper to look up the max physical ID for AVIC
    
    [ Upstream commit f2f6e67a56dc88fea7e9b10c4e79bb01d97386b7 ]
    
    To help with a future change, add a helper to look up the maximum
    physical ID depending on the vCPU AVIC mode. No functional change
    intended.
    
    Suggested-by: Sean Christopherson <[email protected]>
    Signed-off-by: Naveen N Rao (AMD) <[email protected]>
    Link: https://lore.kernel.org/r/0ab9bf5e20a3463a4aa3a5ea9bbbac66beedf1d1.1757009416.git.naveen@kernel.org
    Signed-off-by: Sean Christopherson <[email protected]>
    Stable-dep-of: 87d0f901a9bd ("KVM: SVM: Set/clear CR8 write interception when AVIC is (de)activated")
    Signed-off-by: Sasha Levin <[email protected]>
    Signed-off-by: Greg Kroah-Hartman <[email protected]>

KVM: SVM: Initialize AVIC VMCB fields if AVIC is enabled with in-kernel APIC [+ + +]

Author: Sean Christopherson <[email protected]>
Date:   Tue Feb 3 11:07:09 2026 -0800

    KVM: SVM: Initialize AVIC VMCB fields if AVIC is enabled with in-kernel APIC
    
    commit 3989a6d036c8ec82c0de3614bed23a1dacd45de5 upstream.
    
    Initialize all per-vCPU AVIC control fields in the VMCB if AVIC is enabled
    in KVM and the VM has an in-kernel local APIC, i.e. if it's _possible_ the
    vCPU could activate AVIC at any point in its lifecycle.  Configuring the
    VMCB if and only if AVIC is active "works" purely because of optimizations
    in kvm_create_lapic() to speculatively set apicv_active if AVIC is enabled
    *and* to defer updates until the first KVM_RUN.  In quotes because KVM
    likely won't do the right thing if kvm_apicv_activated() is false, i.e. if
    a vCPU is created while APICv is inhibited at the VM level for whatever
    reason.  E.g. if the inhibit is *removed* before KVM_REQ_APICV_UPDATE is
    handled in KVM_RUN, then __kvm_vcpu_update_apicv() will elide calls to
    vendor code due to seeing "apicv_active == activate".
    
    Cleaning up the initialization code will also allow fixing a bug where KVM
    incorrectly leaves CR8 interception enabled when AVIC is activated without
    creating a mess with respect to whether AVIC is activated or not.
    
    Cc: [email protected]
    Fixes: 67034bb9dd5e ("KVM: SVM: Add irqchip_split() checks before enabling AVIC")
    Fixes: 6c3e4422dd20 ("svm: Add support for dynamic APICv")
    Reviewed-by: Naveen N Rao (AMD) <[email protected]>
    Reviewed-by: Jim Mattson <[email protected]>
    Link: https://patch.msgid.link/[email protected]
    Signed-off-by: Sean Christopherson <[email protected]>
    Signed-off-by: Paolo Bonzini <[email protected]>
    Signed-off-by: Greg Kroah-Hartman <[email protected]>

KVM: SVM: Limit AVIC physical max index based on configured max_vcpu_ids [+ + +]

Author: Naveen N Rao <[email protected]>
Date:   Mon Mar 16 15:18:43 2026 -0400

    KVM: SVM: Limit AVIC physical max index based on configured max_vcpu_ids
    
    [ Upstream commit 574ef752d4aea04134bc121294d717f4422c2755 ]
    
    KVM allows VMMs to specify the maximum possible APIC ID for a virtual
    machine through KVM_CAP_MAX_VCPU_ID capability so as to limit data
    structures related to APIC/x2APIC. Utilize the same to set the AVIC
    physical max index in the VMCB, similar to VMX. This helps hardware
    limit the number of entries to be scanned in the physical APIC ID table
    speeding up IPI broadcasts for virtual machines with smaller number of
    vCPUs.
    
    Unlike VMX, SVM AVIC requires a single page to be allocated for the
    Physical APIC ID table and the Logical APIC ID table, so retain the
    existing approach of allocating those during VM init.
    
    Signed-off-by: Naveen N Rao (AMD) <[email protected]>
    Link: https://lore.kernel.org/r/adb07ccdb3394cd79cb372ba6bcc69a4e4d4ef54.1757009416.git.naveen@kernel.org
    Signed-off-by: Sean Christopherson <[email protected]>
    Stable-dep-of: 87d0f901a9bd ("KVM: SVM: Set/clear CR8 write interception when AVIC is (de)activated")
    Signed-off-by: Sasha Levin <[email protected]>
    Signed-off-by: Greg Kroah-Hartman <[email protected]>

KVM: SVM: Set/clear CR8 write interception when AVIC is (de)activated [+ + +]

Author: Sean Christopherson <[email protected]>
Date:   Mon Mar 16 15:18:45 2026 -0400

    KVM: SVM: Set/clear CR8 write interception when AVIC is (de)activated
    
    [ Upstream commit 87d0f901a9bd8ae6be57249c737f20ac0cace93d ]
    
    Explicitly set/clear CR8 write interception when AVIC is (de)activated to
    fix a bug where KVM leaves the interception enabled after AVIC is
    activated.  E.g. if KVM emulates INIT=>WFS while AVIC is deactivated, CR8
    will remain intercepted in perpetuity.
    
    On its own, the dangling CR8 intercept is "just" a performance issue, but
    combined with the TPR sync bug fixed by commit d02e48830e3f ("KVM: SVM:
    Sync TPR from LAPIC into VMCB::V_TPR even if AVIC is active"), the danging
    intercept is fatal to Windows guests as the TPR seen by hardware gets
    wildly out of sync with reality.
    
    Note, VMX isn't affected by the bug as TPR_THRESHOLD is explicitly ignored
    when Virtual Interrupt Delivery is enabled, i.e. when APICv is active in
    KVM's world.  I.e. there's no need to trigger update_cr8_intercept(), this
    is firmly an SVM implementation flaw/detail.
    
    WARN if KVM gets a CR8 write #VMEXIT while AVIC is active, as KVM should
    never enter the guest with AVIC enabled and CR8 writes intercepted.
    
    Fixes: 3bbf3565f48c ("svm: Do not intercept CR8 when enable AVIC")
    Cc: [email protected]
    Cc: Jim Mattson <[email protected]>
    Cc: Naveen N Rao (AMD) <[email protected]>
    Cc: Maciej S. Szmigiero <[email protected]>
    Reviewed-by: Naveen N Rao (AMD) <[email protected]>
    Reviewed-by: Jim Mattson <[email protected]>
    Link: https://patch.msgid.link/[email protected]
    Signed-off-by: Sean Christopherson <[email protected]>
    [Squash fix to avic_deactivate_vmcb. - Paolo]
    Signed-off-by: Paolo Bonzini <[email protected]>
    Signed-off-by: Sasha Levin <[email protected]>
    Signed-off-by: Greg Kroah-Hartman <[email protected]>

KVM: x86: Allow vendor code to disable quirks [+ + +]

Author: Paolo Bonzini <[email protected]>
Date:   Mon Mar 16 13:19:59 2026 -0400

    KVM: x86: Allow vendor code to disable quirks
    
    [ Upstream commit a4dae7c7a41d803a05192015b2d47aca8aca4abf ]
    
    In some cases, the handling of quirks is split between platform-specific
    code and generic code, or it is done entirely in generic code, but the
    relevant bug does not trigger on some platforms; for example,
    this will be the case for "ignore guest PAT".  Allow unaffected vendor
    modules to disable handling of a quirk for all VMs via a new entry in
    kvm_caps.
    
    Such quirks remain available in KVM_CAP_DISABLE_QUIRKS2, because that API
    tells userspace that KVM *knows* that some of its past behavior was bogus
    or just undesirable.  In other words, it's plausible for userspace to
    refuse to run if a quirk is not listed by KVM_CAP_DISABLE_QUIRKS2, so
    preserve that and make it part of the API.
    
    As an example, mark KVM_X86_QUIRK_CD_NW_CLEARED as auto-disabled on
    Intel systems.
    
    Signed-off-by: Paolo Bonzini <[email protected]>
    Stable-dep-of: e2ffe85b6d2b ("KVM: x86: Introduce KVM_X86_QUIRK_VMCS12_ALLOW_FREEZE_IN_SMM")
    Signed-off-by: Sasha Levin <[email protected]>
    Signed-off-by: Greg Kroah-Hartman <[email protected]>

KVM: x86: Co-locate initialization of feature MSRs in kvm_arch_vcpu_create() [+ + +]

Author: Sean Christopherson <[email protected]>
Date:   Mon Mar 16 13:19:56 2026 -0400

    KVM: x86: Co-locate initialization of feature MSRs in kvm_arch_vcpu_create()
    
    [ Upstream commit 2142ac663a6a72ac868d0768681b1355e3a703eb ]
    
    Bunch all of the feature MSR initialization in kvm_arch_vcpu_create() so
    that it can be easily quirked in a future patch.
    
    No functional change intended.
    
    Link: https://lore.kernel.org/r/[email protected]
    Signed-off-by: Sean Christopherson <[email protected]>
    Stable-dep-of: e2ffe85b6d2b ("KVM: x86: Introduce KVM_X86_QUIRK_VMCS12_ALLOW_FREEZE_IN_SMM")
    Signed-off-by: Sasha Levin <[email protected]>
    Signed-off-by: Greg Kroah-Hartman <[email protected]>

KVM: x86: do not allow re-enabling quirks [+ + +]

Author: Paolo Bonzini <[email protected]>
Date:   Mon Mar 16 13:19:58 2026 -0400

    KVM: x86: do not allow re-enabling quirks
    
    [ Upstream commit 9966b7822b3f49b3aea5d926ece4bc92f1f0a700 ]
    
    Allowing arbitrary re-enabling of quirks puts a limit on what the
    quirks themselves can do, since you cannot assume that the quirk
    prevents a particular state.  More important, it also prevents
    KVM from disabling a quirk at VM creation time, because userspace
    can always go back and re-enable that.
    
    Signed-off-by: Paolo Bonzini <[email protected]>
    Stable-dep-of: e2ffe85b6d2b ("KVM: x86: Introduce KVM_X86_QUIRK_VMCS12_ALLOW_FREEZE_IN_SMM")
    Signed-off-by: Sasha Levin <[email protected]>
    Signed-off-by: Greg Kroah-Hartman <[email protected]>

KVM: x86: Introduce Intel specific quirk KVM_X86_QUIRK_IGNORE_GUEST_PAT [+ + +]

Author: Yan Zhao <[email protected]>
Date:   Mon Mar 16 13:20:01 2026 -0400

    KVM: x86: Introduce Intel specific quirk KVM_X86_QUIRK_IGNORE_GUEST_PAT
    
    [ Upstream commit c9c1e20b4c7d60fa084b3257525d21a49fe651a1 ]
    
    Introduce an Intel specific quirk KVM_X86_QUIRK_IGNORE_GUEST_PAT to have
    KVM ignore guest PAT when this quirk is enabled.
    
    On AMD platforms, KVM always honors guest PAT.  On Intel however there are
    two issues.  First, KVM *cannot* honor guest PAT if CPU feature self-snoop
    is not supported. Second, UC access on certain Intel platforms can be very
    slow[1] and honoring guest PAT on those platforms may break some old
    guests that accidentally specify video RAM as UC. Those old guests may
    never expect the slowness since KVM always forces WB previously. See [2].
    
    So, introduce a quirk that KVM can enable by default on all Intel platforms
    to avoid breaking old unmodifiable guests. Newer userspace can disable this
    quirk if it wishes KVM to honor guest PAT; disabling the quirk will fail
    if self-snoop is not supported, i.e. if KVM cannot obey the wish.
    
    The quirk is a no-op on AMD and also if any assigned devices have
    non-coherent DMA.  This is not an issue, as KVM_X86_QUIRK_CD_NW_CLEARED is
    another example of a quirk that is sometimes automatically disabled.
    
    Suggested-by: Paolo Bonzini <[email protected]>
    Suggested-by: Sean Christopherson <[email protected]>
    Cc: Kevin Tian <[email protected]>
    Signed-off-by: Yan Zhao <[email protected]>
    Link: https://lore.kernel.org/all/[email protected] # [1]
    Link: https://lore.kernel.org/all/[email protected] # [2]
    Message-ID: <[email protected]>
    [Use supported_quirks/inapplicable_quirks to support both AMD and
     no-self-snoop cases, as well as to remove the shadow_memtype_mask check
     from kvm_mmu_may_ignore_guest_pat(). - Paolo]
    Signed-off-by: Paolo Bonzini <[email protected]>
    Stable-dep-of: e2ffe85b6d2b ("KVM: x86: Introduce KVM_X86_QUIRK_VMCS12_ALLOW_FREEZE_IN_SMM")
    Signed-off-by: Sasha Levin <[email protected]>
    Signed-off-by: Greg Kroah-Hartman <[email protected]>

KVM: x86: Introduce KVM_X86_QUIRK_VMCS12_ALLOW_FREEZE_IN_SMM [+ + +]

Author: Jim Mattson <[email protected]>
Date:   Mon Mar 16 13:20:03 2026 -0400

    KVM: x86: Introduce KVM_X86_QUIRK_VMCS12_ALLOW_FREEZE_IN_SMM
    
    [ Upstream commit e2ffe85b6d2bb7780174b87aa4468a39be17eb81 ]
    
    Add KVM_X86_QUIRK_VMCS12_ALLOW_FREEZE_IN_SMM to allow L1 to set
    FREEZE_IN_SMM in vmcs12's GUEST_IA32_DEBUGCTL field, as permitted
    prior to commit 6b1dd26544d0 ("KVM: VMX: Preserve host's
    DEBUGCTLMSR_FREEZE_IN_SMM while running the guest").  Enable the quirk
    by default for backwards compatibility (like all quirks); userspace
    can disable it via KVM_CAP_DISABLE_QUIRKS2 for consistency with the
    constraints on WRMSR(IA32_DEBUGCTL).
    
    Note that the quirk only bypasses the consistency check.  The vmcs02 bit is
    still owned by the host, and PMCs are not frozen during virtualized SMM.
    In particular, if a host administrator decides that PMCs should not be
    frozen during physical SMM, then L1 has no say in the matter.
    
    Fixes: 095686e6fcb4 ("KVM: nVMX: Check vmcs12->guest_ia32_debugctl on nested VM-Enter")
    Cc: [email protected]
    Signed-off-by: Jim Mattson <[email protected]>
    Link: https://patch.msgid.link/[email protected]
    [sean: tag for stable@, clean-up and fix goofs in the comment and docs]
    Signed-off-by: Sean Christopherson <[email protected]>
    [Rename quirk. - Paolo]
    Signed-off-by: Paolo Bonzini <[email protected]>
    Signed-off-by: Sasha Levin <[email protected]>
    Signed-off-by: Greg Kroah-Hartman <[email protected]>

KVM: x86: Introduce supported_quirks to block disabling quirks [+ + +]

Author: Yan Zhao <[email protected]>
Date:   Mon Mar 16 13:20:00 2026 -0400

    KVM: x86: Introduce supported_quirks to block disabling quirks
    
    [ Upstream commit bd7d5362b4c4ac8b951385867a0fadfae0ba3c07 ]
    
    Introduce supported_quirks in kvm_caps to store platform-specific force-enabled
    quirks.
    
    No functional changes intended.
    
    Signed-off-by: Yan Zhao <[email protected]>
    Message-ID: <[email protected]>
    [Remove unsupported quirks at KVM_ENABLE_CAP time. - Paolo]
    Signed-off-by: Paolo Bonzini <[email protected]>
    Stable-dep-of: e2ffe85b6d2b ("KVM: x86: Introduce KVM_X86_QUIRK_VMCS12_ALLOW_FREEZE_IN_SMM")
    Signed-off-by: Sasha Levin <[email protected]>
    Signed-off-by: Greg Kroah-Hartman <[email protected]>

KVM: x86: Quirk initialization of feature MSRs to KVM's max configuration [+ + +]

Author: Sean Christopherson <[email protected]>
Date:   Mon Mar 16 13:19:57 2026 -0400

    KVM: x86: Quirk initialization of feature MSRs to KVM's max configuration
    
    [ Upstream commit dcb988cdac85bad177de86fbf409524eda4f9467 ]
    
    Add a quirk to control KVM's misguided initialization of select feature
    MSRs to KVM's max configuration, as enabling features by default violates
    KVM's approach of letting userspace own the vCPU model, and is actively
    problematic for MSRs that are conditionally supported, as the vCPU will
    end up with an MSR value that userspace can't restore.  E.g. if the vCPU
    is configured with PDCM=0, userspace will save and attempt to restore a
    non-zero PERF_CAPABILITIES, thanks to KVM's meddling.
    
    Link: https://lore.kernel.org/r/[email protected]
    Signed-off-by: Sean Christopherson <[email protected]>
    Stable-dep-of: e2ffe85b6d2b ("KVM: x86: Introduce KVM_X86_QUIRK_VMCS12_ALLOW_FREEZE_IN_SMM")
    Signed-off-by: Sasha Levin <[email protected]>
    Signed-off-by: Greg Kroah-Hartman <[email protected]>

lib/bootconfig: check bounds before writing in __xbc_open_brace() [+ + +]

Author: Josh Law <[email protected]>
Date:   Thu Mar 12 19:11:42 2026 +0000

    lib/bootconfig: check bounds before writing in __xbc_open_brace()
    
    commit 560f763baa0f2c9a44da4294c06af071405ac46f upstream.
    
    The bounds check for brace_index happens after the array write.
    While the current call pattern prevents an actual out-of-bounds
    access (the previous call would have returned an error), the
    write-before-check pattern is fragile and would become a real
    out-of-bounds write if the error return were ever not propagated.
    
    Move the bounds check before the array write so the function is
    self-contained and safe regardless of caller behavior.
    
    Link: https://lore.kernel.org/all/[email protected]/
    
    Fixes: ead1e19ad905 ("lib/bootconfig: Fix a bug of breaking existing tree nodes")
    Cc: [email protected]
    Signed-off-by: Josh Law <[email protected]>
    Signed-off-by: Masami Hiramatsu (Google) <[email protected]>
    Signed-off-by: Greg Kroah-Hartman <[email protected]>

lib/bootconfig: check xbc_init_node() return in override path [+ + +]

Author: Josh Law <[email protected]>
Date:   Thu Mar 19 08:43:05 2026 +0900

    lib/bootconfig: check xbc_init_node() return in override path
    
    [ Upstream commit bb288d7d869e86d382f35a0e26242c5ccb05ca82 ]
    
    The ':=' override path in xbc_parse_kv() calls xbc_init_node() to
    re-initialize an existing value node but does not check the return
    value. If xbc_init_node() fails (data offset out of range), parsing
    silently continues with stale node data.
    
    Add the missing error check to match the xbc_add_node() call path
    which already checks for failure.
    
    In practice, a bootconfig using ':=' to override a value near the
    32KB data limit could silently retain the old value, meaning a
    security-relevant boot parameter override (e.g., a trace filter or
    debug setting) would not take effect as intended.
    
    Link: https://lore.kernel.org/all/[email protected]/
    
    Fixes: e5efaeb8a8f5 ("bootconfig: Support mixing a value and subkeys under a key")
    Signed-off-by: Josh Law <[email protected]>
    Signed-off-by: Masami Hiramatsu (Google) <[email protected]>
    Signed-off-by: Sasha Levin <[email protected]>

lib/bootconfig: fix off-by-one in xbc_verify_tree() unclosed brace error [+ + +]

Author: Josh Law <[email protected]>
Date:   Thu Mar 12 19:11:41 2026 +0000

    lib/bootconfig: fix off-by-one in xbc_verify_tree() unclosed brace error
    
    commit 39ebc8d7f561e1b64eca87353ef9b18e2825e591 upstream.
    
    __xbc_open_brace() pushes entries with post-increment
    (open_brace[brace_index++]), so brace_index always points one past
    the last valid entry.  xbc_verify_tree() reads open_brace[brace_index]
    to report which brace is unclosed, but this is one past the last
    pushed entry and contains stale/zero data, causing the error message
    to reference the wrong node.
    
    Use open_brace[brace_index - 1] to correctly identify the unclosed
    brace.  brace_index is known to be > 0 here since we are inside the
    if (brace_index) guard.
    
    Link: https://lore.kernel.org/all/[email protected]/
    
    Fixes: ead1e19ad905 ("lib/bootconfig: Fix a bug of breaking existing tree nodes")
    Cc: [email protected]
    Signed-off-by: Josh Law <[email protected]>
    Reviewed-by: Steven Rostedt (Google) <[email protected]>
    Signed-off-by: Masami Hiramatsu (Google) <[email protected]>
    Signed-off-by: Greg Kroah-Hartman <[email protected]>

lib/bootconfig: fix snprintf truncation check in xbc_node_compose_key_after() [+ + +]

Author: Josh Law <[email protected]>
Date:   Thu Mar 12 19:11:43 2026 +0000

    lib/bootconfig: fix snprintf truncation check in xbc_node_compose_key_after()
    
    commit 1120a36bb1e9b9e22de75ecb4ef0b998f73a97f1 upstream.
    
    snprintf() returns the number of characters that would have been
    written excluding the NUL terminator.  Output is truncated when the
    return value is >= the buffer size, not just > the buffer size.
    
    When ret == size, the current code takes the non-truncated path,
    advancing buf by ret and reducing size to 0.  This is wrong because
    the output was actually truncated (the last character was replaced by
    NUL).  Fix by using >= so the truncation path is taken correctly.
    
    Link: https://lore.kernel.org/all/[email protected]/
    
    Fixes: 76db5a27a827 ("bootconfig: Add Extra Boot Config support")
    Cc: [email protected]
    Signed-off-by: Josh Law <[email protected]>
    Signed-off-by: Masami Hiramatsu (Google) <[email protected]>
    Signed-off-by: Greg Kroah-Hartman <[email protected]>

libceph: admit message frames only in CEPH_CON_S_OPEN state [+ + +]

Author: Ilya Dryomov <[email protected]>
Date:   Sun Mar 8 17:57:23 2026 +0100

    libceph: admit message frames only in CEPH_CON_S_OPEN state
    
    commit a5a373705081d7cc6363e16990e2361b0b362314 upstream.
    
    Similar checks are performed for all control frames, but an early check
    for message frames was missing.  process_message() is already set up to
    terminate the loop in case the state changes while con->ops->dispatch()
    handler is being executed.
    
    Cc: [email protected]
    Signed-off-by: Ilya Dryomov <[email protected]>
    Reviewed-by: Alex Markuze <[email protected]>
    Reviewed-by: Viacheslav Dubeyko <[email protected]>
    Signed-off-by: Greg Kroah-Hartman <[email protected]>

libceph: Fix potential out-of-bounds access in ceph_handle_auth_reply() [+ + +]

Author: Raphael Zimmer <[email protected]>
Date:   Tue Mar 10 15:28:15 2026 +0100

    libceph: Fix potential out-of-bounds access in ceph_handle_auth_reply()
    
    commit b282c43ed156ae15ea76748fc15cd5c39dc9ab72 upstream.
    
    This patch fixes an out-of-bounds access in ceph_handle_auth_reply()
    that can be triggered by a message of type CEPH_MSG_AUTH_REPLY. In
    ceph_handle_auth_reply(), the value of the payload_len field of such a
    message is stored in a variable of type int. A value greater than
    INT_MAX leads to an integer overflow and is interpreted as a negative
    value. This leads to decrementing the pointer address by this value and
    subsequently accessing it because ceph_decode_need() only checks that
    the memory access does not exceed the end address of the allocation.
    
    This patch fixes the issue by changing the data type of payload_len to
    u32. Additionally, the data type of result_msg_len is changed to u32,
    as it is also a variable holding a non-negative length.
    
    Also, an additional layer of sanity checks is introduced, ensuring that
    directly after reading it from the message, payload_len and
    result_msg_len are not greater than the overall segment length.
    
    BUG: KASAN: slab-out-of-bounds in ceph_handle_auth_reply+0x642/0x7a0 [libceph]
    Read of size 4 at addr ffff88811404df14 by task kworker/20:1/262
    
    CPU: 20 UID: 0 PID: 262 Comm: kworker/20:1 Not tainted 6.19.2 #5 PREEMPT(voluntary)
    Hardware name: QEMU Standard PC (i440FX + PIIX, 1996), BIOS 1.16.3-debian-1.16.3-2 04/01/2014
    Workqueue: ceph-msgr ceph_con_workfn [libceph]
    Call Trace:
     <TASK>
     dump_stack_lvl+0x76/0xa0
     print_report+0xd1/0x620
     ? __pfx__raw_spin_lock_irqsave+0x10/0x10
     ? kasan_complete_mode_report_info+0x72/0x210
     kasan_report+0xe7/0x130
     ? ceph_handle_auth_reply+0x642/0x7a0 [libceph]
     ? ceph_handle_auth_reply+0x642/0x7a0 [libceph]
     __asan_report_load_n_noabort+0xf/0x20
     ceph_handle_auth_reply+0x642/0x7a0 [libceph]
     mon_dispatch+0x973/0x23d0 [libceph]
     ? apparmor_socket_recvmsg+0x6b/0xa0
     ? __pfx_mon_dispatch+0x10/0x10 [libceph]
     ? __kasan_check_write+0x14/0x30i
     ? mutex_unlock+0x7f/0xd0
     ? __pfx_mutex_unlock+0x10/0x10
     ? __pfx_do_recvmsg+0x10/0x10 [libceph]
     ceph_con_process_message+0x1f1/0x650 [libceph]
     process_message+0x1e/0x450 [libceph]
     ceph_con_v2_try_read+0x2e48/0x6c80 [libceph]
     ? __pfx_ceph_con_v2_try_read+0x10/0x10 [libceph]
     ? save_fpregs_to_fpstate+0xb0/0x230
     ? raw_spin_rq_unlock+0x17/0xa0
     ? finish_task_switch.isra.0+0x13b/0x760
     ? __switch_to+0x385/0xda0
     ? __kasan_check_write+0x14/0x30
     ? mutex_lock+0x8d/0xe0
     ? __pfx_mutex_lock+0x10/0x10
     ceph_con_workfn+0x248/0x10c0 [libceph]
     process_one_work+0x629/0xf80
     ? __kasan_check_write+0x14/0x30
     worker_thread+0x87f/0x1570
     ? __pfx__raw_spin_lock_irqsave+0x10/0x10
     ? __pfx_try_to_wake_up+0x10/0x10
     ? kasan_print_address_stack_frame+0x1f7/0x280
     ? __pfx_worker_thread+0x10/0x10
     kthread+0x396/0x830
     ? __pfx__raw_spin_lock_irq+0x10/0x10
     ? __pfx_kthread+0x10/0x10
     ? __kasan_check_write+0x14/0x30
     ? recalc_sigpending+0x180/0x210
     ? __pfx_kthread+0x10/0x10
     ret_from_fork+0x3f7/0x610
     ? __pfx_ret_from_fork+0x10/0x10
     ? __switch_to+0x385/0xda0
     ? __pfx_kthread+0x10/0x10
     ret_from_fork_asm+0x1a/0x30
     </TASK>
    
    [ idryomov: replace if statements with ceph_decode_need() for
      payload_len and result_msg_len ]
    
    Cc: [email protected]
    Signed-off-by: Raphael Zimmer <[email protected]>
    Reviewed-by: Viacheslav Dubeyko <[email protected]>
    Reviewed-by: Ilya Dryomov <[email protected]>
    Signed-off-by: Ilya Dryomov <[email protected]>
    Signed-off-by: Greg Kroah-Hartman <[email protected]>

libceph: prevent potential out-of-bounds reads in process_message_header() [+ + +]

Author: Ilya Dryomov <[email protected]>
Date:   Sun Mar 8 17:38:00 2026 +0100

    libceph: prevent potential out-of-bounds reads in process_message_header()
    
    commit 69fb5d91bba44ecf7eb80530b85fa4fb028921d5 upstream.
    
    If the message frame is (maliciously) corrupted in a way that the
    length of the control segment ends up being less than the size of the
    message header or a different frame is made to look like a message
    frame, out-of-bounds reads may ensue in process_message_header().
    
    Perform an explicit bounds check before decoding the message header.
    
    Cc: [email protected]
    Reported-by: Raphael Zimmer <[email protected]>
    Signed-off-by: Ilya Dryomov <[email protected]>
    Reviewed-by: Alex Markuze <[email protected]>
    Reviewed-by: Viacheslav Dubeyko <[email protected]>
    Signed-off-by: Greg Kroah-Hartman <[email protected]>

libceph: reject preamble if control segment is empty [+ + +]

Author: Ilya Dryomov <[email protected]>
Date:   Sun Mar 8 20:01:27 2026 +0100

    libceph: reject preamble if control segment is empty
    
    commit c4c22b846eceff05b1129b8844a80310e55a7f87 upstream.
    
    While head_onwire_len() has a branch to handle ctrl_len == 0 case,
    prepare_read_control() always sets up a kvec for the CRC meaning that
    a non-empty control segment is effectively assumed.  All frames that
    clients deal with meet that assumption, so let's make it official and
    treat the preamble with an empty control segment as malformed.
    
    Cc: [email protected]
    Signed-off-by: Ilya Dryomov <[email protected]>
    Reviewed-by: Alex Markuze <[email protected]>
    Signed-off-by: Greg Kroah-Hartman <[email protected]>

libceph: Use u32 for non-negative values in ceph_monmap_decode() [+ + +]

Author: Raphael Zimmer <[email protected]>
Date:   Thu Feb 26 16:07:31 2026 +0100

    libceph: Use u32 for non-negative values in ceph_monmap_decode()
    
    commit 770444611f047dbfd4517ec0bc1b179d40c2f346 upstream.
    
    This patch fixes unnecessary implicit conversions that change signedness
    of blob_len and num_mon in ceph_monmap_decode().
    Currently blob_len and num_mon are (signed) int variables. They are used
    to hold values that are always non-negative and get assigned in
    ceph_decode_32_safe(), which is meant to assign u32 values. Both
    variables are subsequently used as unsigned values, and the value of
    num_mon is further assigned to monmap->num_mon, which is of type u32.
    Therefore, both variables should be of type u32. This is especially
    relevant for num_mon. If the value read from the incoming message is
    very large, it is interpreted as a negative value, and the check for
    num_mon > CEPH_MAX_MON does not catch it. This leads to the attempt to
    allocate a very large chunk of memory for monmap, which will most likely
    fail. In this case, an unnecessary attempt to allocate memory is
    performed, and -ENOMEM is returned instead of -EINVAL.
    
    Cc: [email protected]
    Signed-off-by: Raphael Zimmer <[email protected]>
    Reviewed-by: Viacheslav Dubeyko <[email protected]>
    Reviewed-by: Ilya Dryomov <[email protected]>
    Signed-off-by: Ilya Dryomov <[email protected]>
    Signed-off-by: Greg Kroah-Hartman <[email protected]>

Linux: Linux 6.12.78 [+ + +]

Author: Greg Kroah-Hartman <[email protected]>
Date:   Wed Mar 25 11:08:58 2026 +0100

    Linux 6.12.78
    
    Link: https://lore.kernel.org/r/[email protected]
    Tested-by: Brett A C Sheffield <[email protected]>
    Tested-by: Peter Schneider <[email protected]>
    Tested-by: Pavel Machek (CIP) <[email protected]>
    Tested-by: Florian Fainelli <[email protected]>
    Tested-by: Salvatore Bonaccorso <[email protected]>
    Tested-by: Shuah Khan <[email protected]>
    Tested-by: Harshit Mogalapalli <[email protected]>
    Tested-by: Ron Economos <[email protected]>
    Tested-by: Francesco Dolcini <[email protected]>
    Tested-by: Jon Hunter <[email protected]>
    Tested-by: Mark Brown <[email protected]>
    Tested-by: Barry K. Nathan <[email protected]>
    Signed-off-by: Greg Kroah-Hartman <[email protected]>

LoongArch: Give more information if kmem access failed [+ + +]

Author: Tiezhu Yang <[email protected]>
Date:   Mon Mar 16 10:36:01 2026 +0800

    LoongArch: Give more information if kmem access failed
    
    commit a47f0754bdd01f971c9715acdbdd3a07515c8f83 upstream.
    
    If memory access such as copy_{from, to}_kernel_nofault() failed, its
    users do not know what happened, so it is very useful to print the
    exception code for such cases. Furthermore, it is better to print the
    caller function to know where is the entry.
    
    Here are the low level call chains:
    
      copy_from_kernel_nofault()
        copy_from_kernel_nofault_loop()
          __get_kernel_nofault()
    
      copy_to_kernel_nofault()
        copy_to_kernel_nofault_loop()
          __put_kernel_nofault()
    
    Cc: [email protected]
    Signed-off-by: Tiezhu Yang <[email protected]>
    Signed-off-by: Huacai Chen <[email protected]>
    Signed-off-by: Greg Kroah-Hartman <[email protected]>

mac80211: fix crash in ieee80211_chan_bw_change for AP_VLAN stations [+ + +]

Author: Felix Fietkau <[email protected]>
Date:   Thu Mar 5 17:08:12 2026 +0000

    mac80211: fix crash in ieee80211_chan_bw_change for AP_VLAN stations
    
    commit 672e5229e1ecfc2a3509b53adcb914d8b024a853 upstream.
    
    ieee80211_chan_bw_change() iterates all stations and accesses
    link->reserved.oper via sta->sdata->link[link_id]. For stations on
    AP_VLAN interfaces (e.g. 4addr WDS clients), sta->sdata points to
    the VLAN sdata, whose link never participates in chanctx reservations.
    This leaves link->reserved.oper zero-initialized with chan == NULL,
    causing a NULL pointer dereference in __ieee80211_sta_cap_rx_bw()
    when accessing chandef->chan->band during CSA.
    
    Resolve the VLAN sdata to its parent AP sdata using get_bss_sdata()
    before accessing link data.
    
    Cc: [email protected]
    Signed-off-by: Felix Fietkau <[email protected]>
    Link: https://patch.msgid.link/[email protected]
    [also change sta->sdata in ARRAY_SIZE even if it doesn't matter]
    Signed-off-by: Johannes Berg <[email protected]>
    Signed-off-by: Greg Kroah-Hartman <[email protected]>

mctp: i2c: fix skb memory leak in receive path [+ + +]

Author: Haiyue Wang <[email protected]>
Date:   Thu Mar 5 22:32:34 2026 +0800

    mctp: i2c: fix skb memory leak in receive path
    
    [ Upstream commit e3f5e0f22cfc2371e7471c9fd5b4da78f9df7c69 ]
    
    When 'midev->allow_rx' is false, the newly allocated skb isn't consumed
    by netif_rx(), it needs to free the skb directly.
    
    Fixes: f5b8abf9fc3d ("mctp i2c: MCTP I2C binding driver")
    Signed-off-by: Haiyue Wang <[email protected]>
    Link: https://patch.msgid.link/[email protected]
    Signed-off-by: Paolo Abeni <[email protected]>
    Signed-off-by: Sasha Levin <[email protected]>

mctp: route: hold key->lock in mctp_flow_prepare_output() [+ + +]

Author: Chengfeng Ye <[email protected]>
Date:   Fri Mar 6 03:14:02 2026 +0000

    mctp: route: hold key->lock in mctp_flow_prepare_output()
    
    [ Upstream commit 7d86aa41c073c4e7eb75fd2e674f1fd8f289728a ]
    
    mctp_flow_prepare_output() checks key->dev and may call
    mctp_dev_set_key(), but it does not hold key->lock while doing so.
    
    mctp_dev_set_key() and mctp_dev_release_key() are annotated with
    __must_hold(&key->lock), so key->dev access is intended to be
    serialized by key->lock. The mctp_sendmsg() transmit path reaches
    mctp_flow_prepare_output() via mctp_local_output() -> mctp_dst_output()
    without holding key->lock, so the check-and-set sequence is racy.
    
    Example interleaving:
    
      CPU0                                  CPU1
      ----                                  ----
      mctp_flow_prepare_output(key, devA)
        if (!key->dev)  // sees NULL
                                            mctp_flow_prepare_output(
                                                key, devB)
                                              if (!key->dev)  // still NULL
                                              mctp_dev_set_key(devB, key)
                                                mctp_dev_hold(devB)
                                                key->dev = devB
        mctp_dev_set_key(devA, key)
          mctp_dev_hold(devA)
          key->dev = devA   // overwrites devB
    
    Now both devA and devB references were acquired, but only the final
    key->dev value is tracked for release. One reference can be lost,
    causing a resource leak as mctp_dev_release_key() would only decrease
    the reference on one dev.
    
    Fix by taking key->lock around the key->dev check and
    mctp_dev_set_key() call.
    
    Fixes: 67737c457281 ("mctp: Pass flow data & flow release events to drivers")
    Signed-off-by: Chengfeng Ye <[email protected]>
    Link: https://patch.msgid.link/[email protected]
    Signed-off-by: Paolo Abeni <[email protected]>
    Signed-off-by: Sasha Levin <[email protected]>

media: dvb-net: fix OOB access in ULE extension header tables [+ + +]

Author: Ariel Silver <[email protected]>
Date:   Sat Feb 21 15:26:00 2026 +0100

    media: dvb-net: fix OOB access in ULE extension header tables
    
    commit 24d87712727a5017ad142d63940589a36cd25647 upstream.
    
    The ule_mandatory_ext_handlers[] and ule_optional_ext_handlers[] tables
    in handle_one_ule_extension() are declared with 255 elements (valid
    indices 0-254), but the index htype is derived from network-controlled
    data as (ule_sndu_type & 0x00FF), giving a range of 0-255. When
    htype equals 255, an out-of-bounds read occurs on the function pointer
    table, and the OOB value may be called as a function pointer.
    
    Add a bounds check on htype against the array size before either table
    is accessed. Out-of-range values now cause the SNDU to be discarded.
    
    Fixes: 1da177e4c3f4 ("Linux-2.6.12-rc2")
    Reported-by: Ariel Silver <[email protected]>
    Signed-off-by: Ariel Silver <[email protected]>
    Cc: [email protected]
    Signed-off-by: Mauro Carvalho Chehab <[email protected]>
    Signed-off-by: Greg Kroah-Hartman <[email protected]>

mm/kfence: disable KFENCE upon KASAN HW tags enablement [+ + +]

Author: Alexander Potapenko <[email protected]>
Date:   Fri Feb 13 10:54:10 2026 +0100

    mm/kfence: disable KFENCE upon KASAN HW tags enablement
    
    commit 09833d99db36d74456a4d13eb29c32d56ff8f2b6 upstream.
    
    KFENCE does not currently support KASAN hardware tags.  As a result, the
    two features are incompatible when enabled simultaneously.
    
    Given that MTE provides deterministic protection and KFENCE is a
    sampling-based debugging tool, prioritize the stronger hardware
    protections.  Disable KFENCE initialization and free the pre-allocated
    pool if KASAN hardware tags are detected to ensure the system maintains
    the security guarantees provided by MTE.
    
    Link: https://lkml.kernel.org/r/[email protected]
    Fixes: 0ce20dd84089 ("mm: add Kernel Electric-Fence infrastructure")
    Signed-off-by: Alexander Potapenko <[email protected]>
    Suggested-by: Marco Elver <[email protected]>
    Reviewed-by: Marco Elver <[email protected]>
    Cc: Andrey Konovalov <[email protected]>
    Cc: Andrey Ryabinin <[email protected]>
    Cc: Dmitry Vyukov <[email protected]>
    Cc: Ernesto Martinez Garcia <[email protected]>
    Cc: Greg KH <[email protected]>
    Cc: Kees Cook <[email protected]>
    Cc: <[email protected]>
    Signed-off-by: Andrew Morton <[email protected]>
    Signed-off-by: Greg Kroah-Hartman <[email protected]>

mm/kfence: fix KASAN hardware tag faults during late enablement [+ + +]

Author: Alexander Potapenko <[email protected]>
Date:   Tue Mar 17 07:50:54 2026 -0400

    mm/kfence: fix KASAN hardware tag faults during late enablement
    
    [ Upstream commit d155aab90fffa00f93cea1f107aef0a3d548b2ff ]
    
    When KASAN hardware tags are enabled, re-enabling KFENCE late (via
    /sys/module/kfence/parameters/sample_interval) causes KASAN faults.
    
    This happens because the KFENCE pool and metadata are allocated via the
    page allocator, which tags the memory, while KFENCE continues to access it
    using untagged pointers during initialization.
    
    Use __GFP_SKIP_KASAN for late KFENCE pool and metadata allocations to
    ensure the memory remains untagged, consistent with early allocations from
    memblock.  To support this, add __GFP_SKIP_KASAN to the allowlist in
    __alloc_contig_verify_gfp_mask().
    
    Link: https://lkml.kernel.org/r/[email protected]
    Fixes: 0ce20dd84089 ("mm: add Kernel Electric-Fence infrastructure")
    Signed-off-by: Alexander Potapenko <[email protected]>
    Suggested-by: Ernesto Martinez Garcia <[email protected]>
    Cc: Andrey Konovalov <[email protected]>
    Cc: Andrey Ryabinin <[email protected]>
    Cc: Dmitry Vyukov <[email protected]>
    Cc: Greg KH <[email protected]>
    Cc: Kees Cook <[email protected]>
    Cc: Marco Elver <[email protected]>
    Cc: <[email protected]>
    Signed-off-by: Andrew Morton <[email protected]>
    Signed-off-by: Sasha Levin <[email protected]>
    Signed-off-by: Greg Kroah-Hartman <[email protected]>

mm/page_alloc: forward the gfp flags from alloc_contig_range() to post_alloc_hook() [+ + +]

Author: David Hildenbrand <[email protected]>
Date:   Tue Mar 17 07:50:53 2026 -0400

    mm/page_alloc: forward the gfp flags from alloc_contig_range() to post_alloc_hook()
    
    [ Upstream commit 7b755570064fcb9cde37afd48f6bc65151097ba7 ]
    
    In the __GFP_COMP case, we already pass the gfp_flags to
    prep_new_page()->post_alloc_hook().  However, in the !__GFP_COMP case, we
    essentially pass only hardcoded __GFP_MOVABLE to post_alloc_hook(),
    preventing some action modifiers from being effective..
    
    Let's pass our now properly adjusted gfp flags there as well.
    
    This way, we can now support __GFP_ZERO for alloc_contig_*().
    
    As a side effect, we now also support __GFP_SKIP_ZERO and__GFP_ZEROTAGS;
    but we'll keep the more special stuff (KASAN, NOLOCKDEP) disabled for now.
    
    It's worth noting that with __GFP_ZERO, we might unnecessarily zero pages
    when we have to release part of our range using free_contig_range() again.
    This can be optimized in the future, if ever required; the caller we'll
    be converting (powernv/memtrace) next won't trigger this.
    
    Link: https://lkml.kernel.org/r/[email protected]
    Signed-off-by: David Hildenbrand <[email protected]>
    Reviewed-by: Vlastimil Babka <[email protected]>
    Reviewed-by: Oscar Salvador <[email protected]>
    Cc: Christophe Leroy <[email protected]>
    Cc: Madhavan Srinivasan <[email protected]>
    Cc: Michael Ellerman <[email protected]>
    Cc: Naveen N Rao <[email protected]>
    Cc: Nicholas Piggin <[email protected]>
    Cc: Vishal Moola (Oracle) <[email protected]>
    Cc: Zi Yan <[email protected]>
    Signed-off-by: Andrew Morton <[email protected]>
    Stable-dep-of: d155aab90fff ("mm/kfence: fix KASAN hardware tag faults during late enablement")
    Signed-off-by: Sasha Levin <[email protected]>
    Signed-off-by: Greg Kroah-Hartman <[email protected]>

mm/page_alloc: move set_page_refcounted() to callers of post_alloc_hook() [+ + +]

Author: Matthew Wilcox (Oracle) <[email protected]>
Date:   Tue Mar 17 07:50:51 2026 -0400

    mm/page_alloc: move set_page_refcounted() to callers of post_alloc_hook()
    
    [ Upstream commit 8fd10a892a8db797fffb59a9a60bce23a56eef46 ]
    
    In preparation for allocating frozen pages, stop initialising the page
    refcount in post_alloc_hook().
    
    Link: https://lkml.kernel.org/r/[email protected]
    Signed-off-by: Matthew Wilcox (Oracle) <[email protected]>
    Reviewed-by: Miaohe Lin <[email protected]>
    Reviewed-by: Zi Yan <[email protected]>
    Acked-by: David Hildenbrand <[email protected]>
    Reviewed-by: Vlastimil Babka <[email protected]>
    Cc: Hyeonggon Yoo <[email protected]>
    Cc: Mel Gorman <[email protected]>
    Cc: Muchun Song <[email protected]>
    Cc: William Kucharski <[email protected]>
    Signed-off-by: Andrew Morton <[email protected]>
    Stable-dep-of: d155aab90fff ("mm/kfence: fix KASAN hardware tag faults during late enablement")
    Signed-off-by: Sasha Levin <[email protected]>
    Signed-off-by: Greg Kroah-Hartman <[email protected]>

mm/page_alloc: sort out the alloc_contig_range() gfp flags mess [+ + +]

Author: David Hildenbrand <[email protected]>
Date:   Tue Mar 17 07:50:52 2026 -0400

    mm/page_alloc: sort out the alloc_contig_range() gfp flags mess
    
    [ Upstream commit f6037a4a686523dee1967ef7620349822e019ff8 ]
    
    It's all a bit complicated for alloc_contig_range().  For example, we
    don't support many flags, so let's start bailing out on unsupported ones
    -- ignoring the placement hints, as we are already given the range to
    allocate.
    
    While we currently set cc.gfp_mask, in __alloc_contig_migrate_range() we
    simply create yet another GFP mask whereby we ignore the reclaim flags
    specify by the caller.  That looks very inconsistent.
    
    Let's clean it up, constructing the gfp flags used for
    compaction/migration exactly once.  Update the documentation of the
    gfp_mask parameter for alloc_contig_range() and alloc_contig_pages().
    
    Link: https://lkml.kernel.org/r/[email protected]
    Signed-off-by: David Hildenbrand <[email protected]>
    Acked-by: Zi Yan <[email protected]>
    Reviewed-by: Vlastimil Babka <[email protected]>
    Reviewed-by: Oscar Salvador <[email protected]>
    Cc: Christophe Leroy <[email protected]>
    Cc: Madhavan Srinivasan <[email protected]>
    Cc: Michael Ellerman <[email protected]>
    Cc: Naveen N Rao <[email protected]>
    Cc: Nicholas Piggin <[email protected]>
    Cc: Vishal Moola (Oracle) <[email protected]>
    Signed-off-by: Andrew Morton <[email protected]>
    Stable-dep-of: d155aab90fff ("mm/kfence: fix KASAN hardware tag faults during late enablement")
    Signed-off-by: Sasha Levin <[email protected]>
    Signed-off-by: Greg Kroah-Hartman <[email protected]>

mm/shmem, swap: avoid redundant Xarray lookup during swapin [+ + +]

Author: Kairui Song <[email protected]>
Date:   Mon Mar 23 02:43:31 2026 -0700

    mm/shmem, swap: avoid redundant Xarray lookup during swapin
    
    commit 0cfc0e7e3d062b93e9eec6828de000981cdfb152 upstream.
    
    Currently shmem calls xa_get_order to get the swap radix entry order,
    requiring a full tree walk.  This can be easily combined with the swap
    entry value checking (shmem_confirm_swap) to avoid the duplicated lookup
    and abort early if the entry is gone already.  Which should improve the
    performance.
    
    Link: https://lkml.kernel.org/r/[email protected]
    Link: https://lkml.kernel.org/r/[email protected]
    Signed-off-by: Kairui Song <[email protected]>
    Reviewed-by: Kemeng Shi <[email protected]>
    Reviewed-by: Dev Jain <[email protected]>
    Reviewed-by: Baolin Wang <[email protected]>
    Cc: Baoquan He <[email protected]>
    Cc: Barry Song <[email protected]>
    Cc: Chris Li <[email protected]>
    Cc: Hugh Dickins <[email protected]>
    Cc: Matthew Wilcox (Oracle) <[email protected]>
    Cc: Nhat Pham <[email protected]>
    Signed-off-by: Andrew Morton <[email protected]>
    
    Stable-dep-of: 8a1968bd997f ("mm/shmem, swap: fix race of truncate and swap entry split")
    [ hughd: removed series cover letter and skip_swapcache dependencies ]
    Signed-off-by: Hugh Dickins <[email protected]>
    Signed-off-by: Greg Kroah-Hartman <[email protected]>

mm/shmem, swap: improve cached mTHP handling and fix potential hang [+ + +]

Author: Kairui Song <[email protected]>
Date:   Mon Mar 23 02:40:16 2026 -0700

    mm/shmem, swap: improve cached mTHP handling and fix potential hang
    
    commit 5c241ed8d031693dadf33dd98ed2e7cc363e9b66 upstream.
    
    The current swap-in code assumes that, when a swap entry in shmem mapping
    is order 0, its cached folios (if present) must be order 0 too, which
    turns out not always correct.
    
    The problem is shmem_split_large_entry is called before verifying the
    folio will eventually be swapped in, one possible race is:
    
        CPU1                          CPU2
    shmem_swapin_folio
    /* swap in of order > 0 swap entry S1 */
      folio = swap_cache_get_folio
      /* folio = NULL */
      order = xa_get_order
      /* order > 0 */
      folio = shmem_swap_alloc_folio
      /* mTHP alloc failure, folio = NULL */
      <... Interrupted ...>
                                     shmem_swapin_folio
                                     /* S1 is swapped in */
                                     shmem_writeout
                                     /* S1 is swapped out, folio cached */
      shmem_split_large_entry(..., S1)
      /* S1 is split, but the folio covering it has order > 0 now */
    
    Now any following swapin of S1 will hang: `xa_get_order` returns 0, and
    folio lookup will return a folio with order > 0.  The
    `xa_get_order(&mapping->i_pages, index) != folio_order(folio)` will always
    return false causing swap-in to return -EEXIST.
    
    And this looks fragile.  So fix this up by allowing seeing a larger folio
    in swap cache, and check the whole shmem mapping range covered by the
    swapin have the right swap value upon inserting the folio.  And drop the
    redundant tree walks before the insertion.
    
    This will actually improve performance, as it avoids two redundant Xarray
    tree walks in the hot path, and the only side effect is that in the
    failure path, shmem may redundantly reallocate a few folios causing
    temporary slight memory pressure.
    
    And worth noting, it may seems the order and value check before inserting
    might help reducing the lock contention, which is not true.  The swap
    cache layer ensures raced swapin will either see a swap cache folio or
    failed to do a swapin (we have SWAP_HAS_CACHE bit even if swap cache is
    bypassed), so holding the folio lock and checking the folio flag is
    already good enough for avoiding the lock contention.  The chance that a
    folio passes the swap entry value check but the shmem mapping slot has
    changed should be very low.
    
    Link: https://lkml.kernel.org/r/[email protected]
    Link: https://lkml.kernel.org/r/[email protected]
    Fixes: 809bc86517cc ("mm: shmem: support large folio swap out")
    Signed-off-by: Kairui Song <[email protected]>
    Reviewed-by: Kemeng Shi <[email protected]>
    Reviewed-by: Baolin Wang <[email protected]>
    Tested-by: Baolin Wang <[email protected]>
    Cc: Baoquan He <[email protected]>
    Cc: Barry Song <[email protected]>
    Cc: Chris Li <[email protected]>
    Cc: Hugh Dickins <[email protected]>
    Cc: Matthew Wilcox (Oracle) <[email protected]>
    Cc: Nhat Pham <[email protected]>
    Cc: Dev Jain <[email protected]>
    Cc: <[email protected]>
    Signed-off-by: Andrew Morton <[email protected]>
    
    [ hughd: removed skip_swapcache dependencies ]
    Signed-off-by: Hugh Dickins <[email protected]>
    Signed-off-by: Greg Kroah-Hartman <[email protected]>

mm/tracing: rss_stat: ensure curr is false from kthread context [+ + +]

Author: Kalesh Singh <[email protected]>
Date:   Thu Feb 19 15:36:56 2026 -0800

    mm/tracing: rss_stat: ensure curr is false from kthread context
    
    commit 079c24d5690262e83ee476e2a548e416f3237511 upstream.
    
    The rss_stat trace event allows userspace tools, like Perfetto [1], to
    inspect per-process RSS metric changes over time.
    
    The curr field was introduced to rss_stat in commit e4dcad204d3a
    ("rss_stat: add support to detect RSS updates of external mm").  Its
    intent is to indicate whether the RSS update is for the mm_struct of the
    current execution context; and is set to false when operating on a remote
    mm_struct (e.g., via kswapd or a direct reclaimer).
    
    However, an issue arises when a kernel thread temporarily adopts a user
    process's mm_struct.  Kernel threads do not have their own mm_struct and
    normally have current->mm set to NULL.  To operate on user memory, they
    can "borrow" a memory context using kthread_use_mm(), which sets
    current->mm to the user process's mm.
    
    This can be observed, for example, in the USB Function Filesystem (FFS)
    driver.  The ffs_user_copy_worker() handles AIO completions and uses
    kthread_use_mm() to copy data to a user-space buffer.  If a page fault
    occurs during this copy, the fault handler executes in the kthread's
    context.
    
    At this point, current is the kthread, but current->mm points to the user
    process's mm.  Since the rss_stat event (from the page fault) is for that
    same mm, the condition current->mm == mm becomes true, causing curr to be
    incorrectly set to true when the trace event is emitted.
    
    This is misleading because it suggests the mm belongs to the kthread,
    confusing userspace tools that track per-process RSS changes and
    corrupting their mm_id-to-process association.
    
    Fix this by ensuring curr is always false when the trace event is emitted
    from a kthread context by checking for the PF_KTHREAD flag.
    
    Link: https://lkml.kernel.org/r/[email protected]
    Link: https://perfetto.dev/ [1]
    Fixes: e4dcad204d3a ("rss_stat: add support to detect RSS updates of external mm")
    Signed-off-by: Kalesh Singh <[email protected]>
    Acked-by: Zi Yan <[email protected]>
    Acked-by: SeongJae Park <[email protected]>
    Reviewed-by: Pedro Falcato <[email protected]>
    Cc: "David Hildenbrand (Arm)" <[email protected]>
    Cc: Joel Fernandes <[email protected]>
    Cc: Lorenzo Stoakes <[email protected]>
    Cc: Minchan Kim <[email protected]>
    Cc: Steven Rostedt <[email protected]>
    Cc: Suren Baghdasaryan <[email protected]>
    Cc: <[email protected]>    [5.10+]
    Signed-off-by: Andrew Morton <[email protected]>
    Signed-off-by: Greg Kroah-Hartman <[email protected]>

mm: shmem: avoid unpaired folio_unlock() in shmem_swapin_folio() [+ + +]

Author: Kemeng Shi <[email protected]>
Date:   Mon Mar 23 02:37:35 2026 -0700

    mm: shmem: avoid unpaired folio_unlock() in shmem_swapin_folio()
    
    commit e08d5f515613a9860bfee7312461a19f422adb5e upstream.
    
    If we get a folio from swap_cache_get_folio() successfully but encounter a
    failure before the folio is locked, we will unlock the folio which was not
    previously locked.
    
    Put the folio and set it to NULL when a failure occurs before the folio is
    locked to fix the issue.
    
    Link: https://lkml.kernel.org/r/[email protected]
    Link: https://lkml.kernel.org/r/[email protected]
    Fixes: 058313515d5a ("mm: shmem: fix potential data corruption during shmem swapin")
    Signed-off-by: Kemeng Shi <[email protected]>
    Reviewed-by: Baolin Wang <[email protected]>
    Reviewed-by: Kairui Song <[email protected]>
    Cc: Hugh Dickins <[email protected]>
    Cc: kernel test robot <[email protected]>
    Signed-off-by: Andrew Morton <[email protected]>
    
    [ hughd: removed series cover letter comments ]
    Signed-off-by: Hugh Dickins <[email protected]>
    Signed-off-by: Greg Kroah-Hartman <[email protected]>

mm: shmem: fix potential data corruption during shmem swapin [+ + +]

Author: Baolin Wang <[email protected]>
Date:   Mon Mar 23 02:34:19 2026 -0700

    mm: shmem: fix potential data corruption during shmem swapin
    
    commit 058313515d5aab10d0a01dd634f92ed4a4e71d4c upstream.
    
    Alex and Kairui reported some issues (system hang or data corruption) when
    swapping out or swapping in large shmem folios.  This is especially easy
    to reproduce when the tmpfs is mount with the 'huge=within_size'
    parameter.  Thanks to Kairui's reproducer, the issue can be easily
    replicated.
    
    The root cause of the problem is that swap readahead may asynchronously
    swap in order 0 folios into the swap cache, while the shmem mapping can
    still store large swap entries.  Then an order 0 folio is inserted into
    the shmem mapping without splitting the large swap entry, which overwrites
    the original large swap entry, leading to data corruption.
    
    When getting a folio from the swap cache, we should split the large swap
    entry stored in the shmem mapping if the orders do not match, to fix this
    issue.
    
    Link: https://lkml.kernel.org/r/2fe47c557e74e9df5fe2437ccdc6c9115fa1bf70.1740476943.git.baolin.wang@linux.alibaba.com
    Fixes: 809bc86517cc ("mm: shmem: support large folio swap out")
    Signed-off-by: Baolin Wang <[email protected]>
    Reported-by: Alex Xu (Hello71) <[email protected]>
    Reported-by: Kairui Song <[email protected]>
    Closes: https://lore.kernel.org/all/1738717785.im3r5g2vxc.none@localhost/
    Tested-by: Kairui Song <[email protected]>
    Cc: David Hildenbrand <[email protected]>
    Cc: Lance Yang <[email protected]>
    Cc: Matthew Wilcow <[email protected]>
    Cc: Hugh Dickins <[email protected]>
    Cc: <[email protected]>
    Signed-off-by: Andrew Morton <[email protected]>
    
    [ hughd: removed skip_swapcache dependencies ]
    Signed-off-by: Hugh Dickins <[email protected]>
    Signed-off-by: Greg Kroah-Hartman <[email protected]>

mm: thp: deny THP for files on anonymous inodes [+ + +]

Author: Deepanshu Kartikey <[email protected]>
Date:   Sat Feb 14 05:45:35 2026 +0530

    mm: thp: deny THP for files on anonymous inodes
    
    commit dd085fe9a8ebfc5d10314c60452db38d2b75e609 upstream.
    
    file_thp_enabled() incorrectly allows THP for files on anonymous inodes
    (e.g. guest_memfd and secretmem). These files are created via
    alloc_file_pseudo(), which does not call get_write_access() and leaves
    inode->i_writecount at 0. Combined with S_ISREG(inode->i_mode) being
    true, they appear as read-only regular files when
    CONFIG_READ_ONLY_THP_FOR_FS is enabled, making them eligible for THP
    collapse.
    
    Anonymous inodes can never pass the inode_is_open_for_write() check
    since their i_writecount is never incremented through the normal VFS
    open path. The right thing to do is to exclude them from THP eligibility
    altogether, since CONFIG_READ_ONLY_THP_FOR_FS was designed for real
    filesystem files (e.g. shared libraries), not for pseudo-filesystem
    inodes.
    
    For guest_memfd, this allows khugepaged and MADV_COLLAPSE to create
    large folios in the page cache via the collapse path, but the
    guest_memfd fault handler does not support large folios. This triggers
    WARN_ON_ONCE(folio_test_large(folio)) in kvm_gmem_fault_user_mapping().
    
    For secretmem, collapse_file() tries to copy page contents through the
    direct map, but secretmem pages are removed from the direct map. This
    can result in a kernel crash:
    
        BUG: unable to handle page fault for address: ffff88810284d000
        RIP: 0010:memcpy_orig+0x16/0x130
        Call Trace:
         collapse_file
         hpage_collapse_scan_file
         madvise_collapse
    
    Secretmem is not affected by the crash on upstream as the memory failure
    recovery handles the failed copy gracefully, but it still triggers
    confusing false memory failure reports:
    
        Memory failure: 0x106d96f: recovery action for clean unevictable
        LRU page: Recovered
    
    Check IS_ANON_FILE(inode) in file_thp_enabled() to deny THP for all
    anonymous inode files.
    
    Link: https://syzkaller.appspot.com/bug?extid=33a04338019ac7e43a44
    Link: https://lore.kernel.org/linux-mm/CAEvNRgHegcz3ro35ixkDw39ES8=U6rs6S7iP0gkR9enr7HoGtA@mail.gmail.com
    Link: https://lkml.kernel.org/r/[email protected]
    Fixes: 7fbb5e188248 ("mm: remove VM_EXEC requirement for THP eligibility")
    Signed-off-by: Deepanshu Kartikey <[email protected]>
    Reported-by: [email protected]
    Closes: https://syzkaller.appspot.com/bug?extid=33a04338019ac7e43a44
    Tested-by: [email protected]
    Tested-by: Lance Yang <[email protected]>
    Acked-by: David Hildenbrand (Arm) <[email protected]>
    Reviewed-by: Barry Song <[email protected]>
    Reviewed-by: Ackerley Tng <[email protected]>
    Tested-by: Ackerley Tng <[email protected]>
    Reviewed-by: Lorenzo Stoakes <[email protected]>
    Cc: Baolin Wang <[email protected]>
    Cc: Dev Jain <[email protected]>
    Cc: Fangrui Song <[email protected]>
    Cc: Liam Howlett <[email protected]>
    Cc: Nico Pache <[email protected]>
    Cc: Ryan Roberts <[email protected]>
    Cc: Yang Shi <[email protected]>
    Cc: Zi Yan <[email protected]>
    Cc: <[email protected]>
    Signed-off-by: Andrew Morton <[email protected]>
    [ Ackerley: we don't have IS_ANON_FILE() yet. As guest_memfd does
      not apply yet, simply check for secretmem explicitly. ]
    Signed-off-by: Ackerley Tng <[email protected]>
    Reviewed-by: David Hildenbrand (Arm) <[email protected]>
    Signed-off-by: Greg Kroah-Hartman <[email protected]>

mmc: core: Avoid bitfield RMW for claim/retune flags [+ + +]

Author: Penghe Geng <[email protected]>
Date:   Thu Feb 19 15:29:54 2026 -0500

    mmc: core: Avoid bitfield RMW for claim/retune flags
    
    commit 901084c51a0a8fb42a3f37d2e9c62083c495f824 upstream.
    
    Move claimed and retune control flags out of the bitfield word to
    avoid unrelated RMW side effects in asynchronous contexts.
    
    The host->claimed bit shared a word with retune flags. Writes to claimed
    in __mmc_claim_host() or retune_now in mmc_mq_queue_rq() can overwrite
    other bits when concurrent updates happen in other contexts, triggering
    spurious WARN_ON(!host->claimed). Convert claimed, can_retune,
    retune_now and retune_paused to bool to remove shared-word coupling.
    
    Fixes: 6c0cedd1ef952 ("mmc: core: Introduce host claiming by context")
    Fixes: 1e8e55b67030c ("mmc: block: Add CQE support")
    Cc: [email protected]
    Suggested-by: Adrian Hunter <[email protected]>
    Signed-off-by: Penghe Geng <[email protected]>
    Acked-by: Adrian Hunter <[email protected]>
    Signed-off-by: Ulf Hansson <[email protected]>
    Signed-off-by: Greg Kroah-Hartman <[email protected]>

mmc: dw_mmc-rockchip: Add memory clock auto-gating support [+ + +]

Author: Shawn Lin <[email protected]>
Date:   Tue Mar 17 08:17:17 2026 -0400

    mmc: dw_mmc-rockchip: Add memory clock auto-gating support
    
    [ Upstream commit ff6f0286c896f062853552097220dd93961be9c4 ]
    
    Per design recommendations, the memory clock can be gated when there
    is no in-flight transfer, which helps save power. This feature is
    introduced alongside internal phase support, and this patch enables it.
    
    Signed-off-by: Shawn Lin <[email protected]>
    Signed-off-by: Ulf Hansson <[email protected]>
    Stable-dep-of: 6465a8bbb0f6 ("mmc: dw_mmc-rockchip: Fix runtime PM support for internal phase support")
    Signed-off-by: Sasha Levin <[email protected]>
    Signed-off-by: Greg Kroah-Hartman <[email protected]>

mmc: dw_mmc-rockchip: Fix runtime PM support for internal phase support [+ + +]

Author: Shawn Lin <[email protected]>
Date:   Tue Mar 17 08:17:18 2026 -0400

    mmc: dw_mmc-rockchip: Fix runtime PM support for internal phase support
    
    [ Upstream commit 6465a8bbb0f6ad98aeb66dc9ea19c32c193a610b ]
    
    RK3576 is the first platform to introduce internal phase support, and
    subsequent platforms are expected to adopt a similar design. In this
    architecture, runtime suspend powers off the attached power domain, which
    resets registers, including vendor-specific ones such as SDMMC_TIMING_CON0,
    SDMMC_TIMING_CON1, and SDMMC_MISC_CON. These registers must be saved and
    restored, a requirement that falls outside the scope of the dw_mmc core.
    
    Fixes: 59903441f5e4 ("mmc: dw_mmc-rockchip: Add internal phase support")
    Signed-off-by: Shawn Lin <[email protected]>
    Tested-by: Marco Schirrmeister <[email protected]>
    Reviewed-by: Heiko Stuebner <[email protected]>
    Cc: [email protected]
    Signed-off-by: Ulf Hansson <[email protected]>
    Signed-off-by: Sasha Levin <[email protected]>
    Signed-off-by: Greg Kroah-Hartman <[email protected]>

mmc: dw_mmc-rockchip: use modern PM macros [+ + +]

Author: Jisheng Zhang <[email protected]>
Date:   Tue Mar 17 08:17:16 2026 -0400

    mmc: dw_mmc-rockchip: use modern PM macros
    
    [ Upstream commit 4b43f2bcc84dd550c1a847318db02165d2829573 ]
    
    Use the modern PM macros for the suspend and resume functions to be
    automatically dropped by the compiler when CONFIG_PM or
    CONFIG_PM_SLEEP are disabled, without having to use #ifdef guards.
    
    This has the advantage of always compiling these functions in,
    independently of any Kconfig option. Thanks to that, bugs and other
    regressions are subsequently easier to catch.
    
    Signed-off-by: Jisheng Zhang <[email protected]>
    Link: https://lore.kernel.org/r/[email protected]
    Signed-off-by: Ulf Hansson <[email protected]>
    Stable-dep-of: 6465a8bbb0f6 ("mmc: dw_mmc-rockchip: Fix runtime PM support for internal phase support")
    Signed-off-by: Sasha Levin <[email protected]>
    Signed-off-by: Greg Kroah-Hartman <[email protected]>

mmc: mmci: Fix device_node reference leak in of_get_dml_pipe_index() [+ + +]

Author: Felix Gu <[email protected]>
Date:   Tue Jan 20 22:26:46 2026 +0800

    mmc: mmci: Fix device_node reference leak in of_get_dml_pipe_index()
    
    commit af12e64ae0661546e8b4f5d30d55c5f53a11efe7 upstream.
    
    When calling of_parse_phandle_with_args(), the caller is responsible
    to call of_node_put() to release the reference of device node.
    In of_get_dml_pipe_index(), it does not release the reference.
    
    Fixes: 9cb15142d0e3 ("mmc: mmci: Add qcom dml support to the driver.")
    Signed-off-by: Felix Gu <[email protected]>
    Cc: [email protected]
    Signed-off-by: Ulf Hansson <[email protected]>
    Signed-off-by: Greg Kroah-Hartman <[email protected]>

mmc: sdhci-pci-gli: fix GL9750 DMA write corruption [+ + +]

Author: Matthew Schwartz <[email protected]>
Date:   Mon Mar 2 13:07:17 2026 -0800

    mmc: sdhci-pci-gli: fix GL9750 DMA write corruption
    
    commit 2b76e0cc7803e5ab561c875edaba7f6bbd87fbb0 upstream.
    
    The GL9750 SD host controller has intermittent data corruption during
    DMA write operations. The GM_BURST register's R_OSRC_Lmt field
    (bits 17:16), which limits outstanding DMA read requests from system
    memory, is not being cleared during initialization. The Windows driver
    sets R_OSRC_Lmt to zero, limiting requests to the smallest unit.
    
    Clear R_OSRC_Lmt to match the Windows driver behavior. This eliminates
    write corruption verified with f3write/f3read tests while maintaining
    DMA performance.
    
    Cc: [email protected]
    Fixes: e51df6ce668a ("mmc: host: sdhci-pci: Add Genesys Logic GL975x support")
    Closes: https://lore.kernel.org/linux-mmc/[email protected]/
    Acked-by: Adrian Hunter <[email protected]>
    Signed-off-by: Matthew Schwartz <[email protected]>
    Reviewed-by: Ben Chuang <[email protected]>
    Signed-off-by: Ulf Hansson <[email protected]>
    Signed-off-by: Greg Kroah-Hartman <[email protected]>

mmc: sdhci: fix timing selection for 1-bit bus width [+ + +]

Author: Luke Wang <[email protected]>
Date:   Wed Mar 11 17:50:06 2026 +0800

    mmc: sdhci: fix timing selection for 1-bit bus width
    
    commit 5e3486e64094c28a526543f1e8aa0d5964b7f02d upstream.
    
    When 1-bit bus width is used with HS200/HS400 capabilities set,
    mmc_select_hs200() returns 0 without actually switching. This
    causes mmc_select_timing() to skip mmc_select_hs(), leaving eMMC
    in legacy mode (26MHz) instead of High Speed SDR (52MHz).
    
    Per JEDEC eMMC spec section 5.3.2, 1-bit mode supports High Speed
    SDR. Drop incompatible HS200/HS400/UHS/DDR caps early so timing
    selection falls through to mmc_select_hs() correctly.
    
    Fixes: f2119df6b764 ("mmc: sd: add support for signal voltage switch procedure")
    Signed-off-by: Luke Wang <[email protected]>
    Acked-by: Adrian Hunter <[email protected]>
    Cc: [email protected]
    Signed-off-by: Ulf Hansson <[email protected]>
    Signed-off-by: Greg Kroah-Hartman <[email protected]>

mpls: add missing unregister_netdevice_notifier to mpls_init [+ + +]

Author: Sabrina Dubroca <[email protected]>
Date:   Wed Mar 11 23:35:09 2026 +0100

    mpls: add missing unregister_netdevice_notifier to mpls_init
    
    [ Upstream commit 99600f79b28c83c68bae199a3d8e95049a758308 ]
    
    If mpls_init() fails after registering mpls_dev_notifier, it never
    gets removed. Add the missing unregister_netdevice_notifier() call to
    the error handling path.
    
    Fixes: 5be2062e3080 ("mpls: Handle error of rtnl_register_module().")
    Signed-off-by: Sabrina Dubroca <[email protected]>
    Link: https://patch.msgid.link/7c55363c4f743d19e2306204a134407c90a69bbb.1773228081.git.sd@queasysnail.net
    Signed-off-by: Jakub Kicinski <[email protected]>
    Signed-off-by: Sasha Levin <[email protected]>

mptcp: pm: avoid sending RM_ADDR over same subflow [+ + +]

Author: Matthieu Baerts (NGI0) <[email protected]>
Date:   Mon Mar 9 10:57:40 2026 -0400

    mptcp: pm: avoid sending RM_ADDR over same subflow
    
    [ Upstream commit fb8d0bccb221080630efcd9660c9f9349e53cc9e ]
    
    RM_ADDR are sent over an active subflow, the first one in the subflows
    list. There is then a high chance the initial subflow is picked. With
    the in-kernel PM, when an endpoint is removed, a RM_ADDR is sent, then
    linked subflows are closed. This is done for each active MPTCP
    connection.
    
    MPTCP endpoints are likely removed because the attached network is no
    longer available or usable. In this case, it is better to avoid sending
    this RM_ADDR over the subflow that is going to be removed, but prefer
    sending it over another active and non stale subflow, if any.
    
    This modification avoids situations where the other end is not notified
    when a subflow is no longer usable: typically when the endpoint linked
    to the initial subflow is removed, especially on the server side.
    
    Fixes: 8dd5efb1f91b ("mptcp: send ack for rm_addr")
    Cc: [email protected]
    Reported-by: Frank Lorenz <[email protected]>
    Closes: https://github.com/multipath-tcp/mptcp_net-next/issues/612
    Reviewed-by: Mat Martineau <[email protected]>
    Signed-off-by: Matthieu Baerts (NGI0) <[email protected]>
    Link: https://patch.msgid.link/20260303-net-mptcp-misc-fixes-7-0-rc2-v1-2-4b5462b6f016@kernel.org
    Signed-off-by: Jakub Kicinski <[email protected]>
    [ adapted to _nl-prefixed function names in pm_netlink.c and omitted stale subflow fallback ]
    Signed-off-by: Sasha Levin <[email protected]>
    Signed-off-by: Greg Kroah-Hartman <[email protected]>

mptcp: pm: in-kernel: always mark signal+subflow endp as used [+ + +]

Author: Matthieu Baerts (NGI0) <[email protected]>
Date:   Mon Mar 9 11:38:46 2026 -0400

    mptcp: pm: in-kernel: always mark signal+subflow endp as used
    
    [ Upstream commit 579a752464a64cb5f9139102f0e6b90a1f595ceb ]
    
    Syzkaller managed to find a combination of actions that was generating
    this warning:
    
      msk->pm.local_addr_used == 0
      WARNING: net/mptcp/pm_kernel.c:1071 at __mark_subflow_endp_available net/mptcp/pm_kernel.c:1071 [inline], CPU#1: syz.2.17/961
      WARNING: net/mptcp/pm_kernel.c:1071 at mptcp_nl_remove_subflow_and_signal_addr net/mptcp/pm_kernel.c:1103 [inline], CPU#1: syz.2.17/961
      WARNING: net/mptcp/pm_kernel.c:1071 at mptcp_pm_nl_del_addr_doit+0x81d/0x8f0 net/mptcp/pm_kernel.c:1210, CPU#1: syz.2.17/961
      Modules linked in:
      CPU: 1 UID: 0 PID: 961 Comm: syz.2.17 Not tainted 6.19.0-08368-gfafda3b4b06b #22 PREEMPT(full)
      Hardware name: QEMU Ubuntu 25.10 PC v2 (i440FX + PIIX, + 10.1 machine, 1996), BIOS 1.17.0-debian-1.17.0-1build1 04/01/2014
      RIP: 0010:__mark_subflow_endp_available net/mptcp/pm_kernel.c:1071 [inline]
      RIP: 0010:mptcp_nl_remove_subflow_and_signal_addr net/mptcp/pm_kernel.c:1103 [inline]
      RIP: 0010:mptcp_pm_nl_del_addr_doit+0x81d/0x8f0 net/mptcp/pm_kernel.c:1210
      Code: 89 c5 e8 46 30 6f fe e9 21 fd ff ff 49 83 ed 80 e8 38 30 6f fe 4c 89 ef be 03 00 00 00 e8 db 49 df fe eb ac e8 24 30 6f fe 90 <0f> 0b 90 e9 1d ff ff ff e8 16 30 6f fe eb 05 e8 0f 30 6f fe e8 9a
      RSP: 0018:ffffc90001663880 EFLAGS: 00010293
      RAX: ffffffff82de1a6c RBX: 0000000000000000 RCX: ffff88800722b500
      RDX: 0000000000000000 RSI: 0000000000000000 RDI: 0000000000000000
      RBP: ffff8880158b22d0 R08: 0000000000010425 R09: ffffffffffffffff
      R10: ffffffff82de18ba R11: 0000000000000000 R12: ffff88800641a640
      R13: ffff8880158b1880 R14: ffff88801ec3c900 R15: ffff88800641a650
      FS:  00005555722c3500(0000) GS:ffff8880f909d000(0000) knlGS:0000000000000000
      CS:  0010 DS: 0000 ES: 0000 CR0: 0000000080050033
      CR2: 00007f66346e0f60 CR3: 000000001607c000 CR4: 0000000000350ef0
      Call Trace:
       <TASK>
       genl_family_rcv_msg_doit+0x117/0x180 net/netlink/genetlink.c:1115
       genl_family_rcv_msg net/netlink/genetlink.c:1195 [inline]
       genl_rcv_msg+0x3a8/0x3f0 net/netlink/genetlink.c:1210
       netlink_rcv_skb+0x16d/0x240 net/netlink/af_netlink.c:2550
       genl_rcv+0x28/0x40 net/netlink/genetlink.c:1219
       netlink_unicast_kernel net/netlink/af_netlink.c:1318 [inline]
       netlink_unicast+0x3e9/0x4c0 net/netlink/af_netlink.c:1344
       netlink_sendmsg+0x4aa/0x5b0 net/netlink/af_netlink.c:1894
       sock_sendmsg_nosec net/socket.c:727 [inline]
       __sock_sendmsg+0xc9/0xf0 net/socket.c:742
       ____sys_sendmsg+0x272/0x3b0 net/socket.c:2592
       ___sys_sendmsg+0x2de/0x320 net/socket.c:2646
       __sys_sendmsg net/socket.c:2678 [inline]
       __do_sys_sendmsg net/socket.c:2683 [inline]
       __se_sys_sendmsg net/socket.c:2681 [inline]
       __x64_sys_sendmsg+0x110/0x1a0 net/socket.c:2681
       do_syscall_x64 arch/x86/entry/syscall_64.c:63 [inline]
       do_syscall_64+0x143/0x440 arch/x86/entry/syscall_64.c:94
       entry_SYSCALL_64_after_hwframe+0x77/0x7f
      RIP: 0033:0x7f66346f826d
      Code: ff c3 66 2e 0f 1f 84 00 00 00 00 00 90 f3 0f 1e fa 48 89 f8 48 89 f7 48 89 d6 48 89 ca 4d 89 c2 4d 89 c8 4c 8b 4c 24 08 0f 05 <48> 3d 01 f0 ff ff 73 01 c3 48 c7 c1 e8 ff ff ff f7 d8 64 89 01 48
      RSP: 002b:00007ffc83d8bdc8 EFLAGS: 00000246 ORIG_RAX: 000000000000002e
      RAX: ffffffffffffffda RBX: 00007f6634985fa0 RCX: 00007f66346f826d
      RDX: 00000000040000b0 RSI: 0000200000000740 RDI: 0000000000000007
      RBP: 0000000000000000 R08: 0000000000000000 R09: 0000000000000000
      R10: 0000000000000000 R11: 0000000000000246 R12: 00007f6634985fa8
      R13: 00007f6634985fac R14: 0000000000000000 R15: 0000000000001770
       </TASK>
    
    The actions that caused that seem to be:
    
     - Set the MPTCP subflows limit to 0
     - Create an MPTCP endpoint with both the 'signal' and 'subflow' flags
     - Create a new MPTCP connection from a different address: an ADD_ADDR
       linked to the MPTCP endpoint will be sent ('signal' flag), but no
       subflows is initiated ('subflow' flag)
     - Remove the MPTCP endpoint
    
    In this case, msk->pm.local_addr_used has been kept to 0 -- because no
    subflows have been created -- but the corresponding bit in
    msk->pm.id_avail_bitmap has been cleared when the ADD_ADDR has been
    sent. This later causes a splat when removing the MPTCP endpoint because
    msk->pm.local_addr_used has been kept to 0.
    
    Now, if an endpoint has both the signal and subflow flags, but it is not
    possible to create subflows because of the limits or the c-flag case,
    then the local endpoint counter is still incremented: the endpoint is
    used at the end. This avoids issues later when removing the endpoint and
    calling __mark_subflow_endp_available(), which expects
    msk->pm.local_addr_used to have been previously incremented if the
    endpoint was marked as used according to msk->pm.id_avail_bitmap.
    
    Note that signal_and_subflow variable is reset to false when the limits
    and the c-flag case allows subflows creation. Also, local_addr_used is
    only incremented for non ID0 subflows.
    
    Fixes: 85df533a787b ("mptcp: pm: do not ignore 'subflow' if 'signal' flag is also set")
    Cc: [email protected]
    Closes: https://github.com/multipath-tcp/mptcp_net-next/issues/613
    Reviewed-by: Mat Martineau <[email protected]>
    Signed-off-by: Matthieu Baerts (NGI0) <[email protected]>
    Link: https://patch.msgid.link/20260303-net-mptcp-misc-fixes-7-0-rc2-v1-4-4b5462b6f016@kernel.org
    Signed-off-by: Jakub Kicinski <[email protected]>
    [ pm_kernel.c => pm_netlink.c ]
    Signed-off-by: Sasha Levin <[email protected]>
    Signed-off-by: Greg Kroah-Hartman <[email protected]>

mptcp: pm: in-kernel: always set ID as avail when rm endp [+ + +]

Author: Matthieu Baerts (NGI0) <[email protected]>
Date:   Fri Mar 6 18:48:14 2026 +0100

    mptcp: pm: in-kernel: always set ID as avail when rm endp
    
    commit d191101dee25567c2af3b28565f45346c33d65f5 upstream.
    
    Syzkaller managed to find a combination of actions that was generating
    this warning:
    
      WARNING: net/mptcp/pm_kernel.c:1074 at __mark_subflow_endp_available net/mptcp/pm_kernel.c:1074 [inline], CPU#1: syz.7.48/2535
      WARNING: net/mptcp/pm_kernel.c:1074 at mptcp_pm_nl_fullmesh net/mptcp/pm_kernel.c:1446 [inline], CPU#1: syz.7.48/2535
      WARNING: net/mptcp/pm_kernel.c:1074 at mptcp_pm_nl_set_flags_all net/mptcp/pm_kernel.c:1474 [inline], CPU#1: syz.7.48/2535
      WARNING: net/mptcp/pm_kernel.c:1074 at mptcp_pm_nl_set_flags+0x5de/0x640 net/mptcp/pm_kernel.c:1538, CPU#1: syz.7.48/2535
      Modules linked in:
      CPU: 1 UID: 0 PID: 2535 Comm: syz.7.48 Not tainted 6.18.0-03987-gea5f5e676cf5 #17 PREEMPT(voluntary)
      Hardware name: QEMU Ubuntu 25.10 PC (i440FX + PIIX, 1996), BIOS 1.17.0-debian-1.17.0-1 04/01/2014
      RIP: 0010:__mark_subflow_endp_available net/mptcp/pm_kernel.c:1074 [inline]
      RIP: 0010:mptcp_pm_nl_fullmesh net/mptcp/pm_kernel.c:1446 [inline]
      RIP: 0010:mptcp_pm_nl_set_flags_all net/mptcp/pm_kernel.c:1474 [inline]
      RIP: 0010:mptcp_pm_nl_set_flags+0x5de/0x640 net/mptcp/pm_kernel.c:1538
      Code: 89 c7 e8 c5 8c 73 fe e9 f7 fd ff ff 49 83 ef 80 e8 b7 8c 73 fe 4c 89 ff be 03 00 00 00 e8 4a 29 e3 fe eb ac e8 a3 8c 73 fe 90 <0f> 0b 90 e9 3d ff ff ff e8 95 8c 73 fe b8 a1 ff ff ff eb 1a e8 89
      RSP: 0018:ffffc9001535b820 EFLAGS: 00010287
      netdevsim0: tun_chr_ioctl cmd 1074025677
      RAX: ffffffff82da294d RBX: 0000000000000001 RCX: 0000000000080000
      RDX: ffffc900096d0000 RSI: 00000000000006d6 RDI: 00000000000006d7
      netdevsim0: linktype set to 823
      RBP: ffff88802cdb2240 R08: 00000000000104ae R09: ffffffffffffffff
      R10: ffffffff82da27d4 R11: 0000000000000000 R12: 0000000000000000
      R13: ffff88801246d8c0 R14: ffffc9001535b8b8 R15: ffff88802cdb1800
      FS:  00007fc6ac5a76c0(0000) GS:ffff8880f90c8000(0000) knlGS:0000000000000000
      netlink: 'syz.3.50': attribute type 5 has an invalid length.
      CS:  0010 DS: 0000 ES: 0000 CR0: 0000000080050033
      netlink: 1232 bytes leftover after parsing attributes in process `syz.3.50'.
      CR2: 0000200000010000 CR3: 0000000025b1a000 CR4: 0000000000350ef0
      Call Trace:
       <TASK>
       mptcp_pm_set_flags net/mptcp/pm_netlink.c:277 [inline]
       mptcp_pm_nl_set_flags_doit+0x1d7/0x210 net/mptcp/pm_netlink.c:282
       genl_family_rcv_msg_doit+0x117/0x180 net/netlink/genetlink.c:1115
       genl_family_rcv_msg net/netlink/genetlink.c:1195 [inline]
       genl_rcv_msg+0x3a8/0x3f0 net/netlink/genetlink.c:1210
       netlink_rcv_skb+0x16d/0x240 net/netlink/af_netlink.c:2550
       genl_rcv+0x28/0x40 net/netlink/genetlink.c:1219
       netlink_unicast_kernel net/netlink/af_netlink.c:1318 [inline]
       netlink_unicast+0x3e9/0x4c0 net/netlink/af_netlink.c:1344
       netlink_sendmsg+0x4ab/0x5b0 net/netlink/af_netlink.c:1894
       sock_sendmsg_nosec net/socket.c:718 [inline]
       __sock_sendmsg+0xc9/0xf0 net/socket.c:733
       ____sys_sendmsg+0x272/0x3b0 net/socket.c:2608
       ___sys_sendmsg+0x2de/0x320 net/socket.c:2662
       __sys_sendmsg net/socket.c:2694 [inline]
       __do_sys_sendmsg net/socket.c:2699 [inline]
       __se_sys_sendmsg net/socket.c:2697 [inline]
       __x64_sys_sendmsg+0x110/0x1a0 net/socket.c:2697
       do_syscall_x64 arch/x86/entry/syscall_64.c:63 [inline]
       do_syscall_64+0xed/0x360 arch/x86/entry/syscall_64.c:94
       entry_SYSCALL_64_after_hwframe+0x77/0x7f
      RIP: 0033:0x7fc6adb66f6d
      Code: ff c3 66 2e 0f 1f 84 00 00 00 00 00 90 f3 0f 1e fa 48 89 f8 48 89 f7 48 89 d6 48 89 ca 4d 89 c2 4d 89 c8 4c 8b 4c 24 08 0f 05 <48> 3d 01 f0 ff ff 73 01 c3 48 c7 c1 e8 ff ff ff f7 d8 64 89 01 48
      RSP: 002b:00007fc6ac5a6ff8 EFLAGS: 00000246 ORIG_RAX: 000000000000002e
      RAX: ffffffffffffffda RBX: 00007fc6addf5fa0 RCX: 00007fc6adb66f6d
      RDX: 0000000000048084 RSI: 00002000000002c0 RDI: 000000000000000e
      RBP: 0000000000000000 R08: 0000000000000000 R09: 0000000000000000
      R10: 0000000000000000 R11: 0000000000000246 R12: 0000000000000000
      netlink: 'syz.5.51': attribute type 2 has an invalid length.
      R13: 00007fff25e91fe0 R14: 00007fc6ac5a7ce4 R15: 00007fff25e920d7
       </TASK>
    
    The actions that caused that seem to be:
    
     - Create an MPTCP endpoint for address A without any flags
     - Create a new MPTCP connection from address A
     - Remove the MPTCP endpoint: the corresponding subflows will be removed
     - Recreate the endpoint with the same ID, but with the subflow flag
     - Change the same endpoint to add the fullmesh flag
    
    In this case, msk->pm.local_addr_used has been kept to 0 as expected,
    but the corresponding bit in msk->pm.id_avail_bitmap was still unset
    after having removed the endpoint, causing the splat later on.
    
    When removing an endpoint, the corresponding endpoint ID was only marked
    as available for "signal" types with an announced address, plus all
    "subflow" types, but not the other types like an endpoint corresponding
    to the initial subflow. In these cases, re-creating an endpoint with the
    same ID didn't signal/create anything. Here, adding the fullmesh flag
    was creating the splat when calling __mark_subflow_endp_available() from
    mptcp_pm_nl_fullmesh(), because msk->pm.local_addr_used was set to 0
    while the ID was marked as used.
    
    To fix this issue, the corresponding bit in msk->pm.id_avail_bitmap can
    always be set as available when removing an MPTCP in-kernel endpoint. In
    other words, moving the call to __set_bit() to do it in all cases,
    except for "subflow" types where this bit is handled in a dedicated
    helper.
    
    Note: instead of adding a new spin_(un)lock_bh that would be taken in
    all cases, do all the actions requiring the spin lock under the same
    block.
    
    This modification potentially fixes another issue reported by syzbot,
    see [1]. But without a reproducer or more details about what exactly
    happened before, it is hard to confirm.
    
    Fixes: e255683c06df ("mptcp: pm: re-using ID of unused removed ADD_ADDR")
    Cc: [email protected]
    Closes: https://github.com/multipath-tcp/mptcp_net-next/issues/606
    Reported-by: [email protected]
    Closes: https://lore.kernel.org/[email protected] [1]
    Reviewed-by: Mat Martineau <[email protected]>
    Signed-off-by: Matthieu Baerts (NGI0) <[email protected]>
    Link: https://patch.msgid.link/20260205-net-mptcp-misc-fixes-6-19-rc8-v2-1-c2720ce75c34@kernel.org
    Signed-off-by: Jakub Kicinski <[email protected]>
    [ Conflict in pm_netlink.c, because commit 8617e85e04bd ("mptcp: pm:
      split in-kernel PM specific code") is not in this version, and move
      code from pm_netlink.c to pm_kernel.c. Also, commit 636113918508
      ("mptcp: pm: remove '_nl' from mptcp_pm_nl_rm_addr_received") renamed
      mptcp_pm_nl_rm_subflow_received() to mptcp_pm_rm_subflow(). Apart from
      that, the same patch can be applied in pm_netlink.c. ]
    Signed-off-by: Matthieu Baerts (NGI0) <[email protected]>
    Signed-off-by: Greg Kroah-Hartman <[email protected]>

mtd: Avoid boot crash in RedBoot partition table parser [+ + +]

Author: Finn Thain <[email protected]>
Date:   Mon Feb 16 18:01:30 2026 +1100

    mtd: Avoid boot crash in RedBoot partition table parser
    
    commit 8e2f8020270af7777d49c2e7132260983e4fc566 upstream.
    
    Given CONFIG_FORTIFY_SOURCE=y and a recent compiler,
    commit 439a1bcac648 ("fortify: Use __builtin_dynamic_object_size() when
    available") produces the warning below and an oops.
    
        Searching for RedBoot partition table in 50000000.flash at offset 0x7e0000
        ------------[ cut here ]------------
        WARNING: lib/string_helpers.c:1035 at 0xc029e04c, CPU#0: swapper/0/1
        memcmp: detected buffer overflow: 15 byte read of buffer size 14
        Modules linked in:
        CPU: 0 UID: 0 PID: 1 Comm: swapper/0 Not tainted 6.19.0 #1 NONE
    
    As Kees said, "'names' is pointing to the final 'namelen' many bytes
    of the allocation ... 'namelen' could be basically any length at all.
    This fortify warning looks legit to me -- this code used to be reading
    beyond the end of the allocation."
    
    Since the size of the dynamic allocation is calculated with strlen()
    we can use strcmp() instead of memcmp() and remain within bounds.
    
    Cc: Kees Cook <[email protected]>
    Cc: [email protected]
    Cc: [email protected]
    Link: https://lore.kernel.org/all/202602151911.AD092DFFCD@keescook/
    Fixes: 1da177e4c3f4 ("Linux-2.6.12-rc2")
    Suggested-by: Kees Cook <[email protected]>
    Signed-off-by: Finn Thain <[email protected]>
    Signed-off-by: Miquel Raynal <[email protected]>
    Signed-off-by: Greg Kroah-Hartman <[email protected]>

mtd: rawnand: brcmnand: skip DMA during panic write [+ + +]

Author: Kamal Dasu <[email protected]>
Date:   Thu Mar 5 15:21:57 2026 -0500

    mtd: rawnand: brcmnand: skip DMA during panic write
    
    [ Upstream commit da9ba4dcc01e7cf52b7676f0ee9607b8358c2171 ]
    
    When oops_panic_write is set, the driver disables interrupts and
    switches to PIO polling mode but still falls through into the DMA
    path. DMA cannot be used reliably in panic context, so make the
    DMA path an else branch to ensure only PIO is used during panic
    writes.
    
    Fixes: c1ac2dc34b51 ("mtd: rawnand: brcmnand: When oops in progress use pio and interrupt polling")
    Signed-off-by: Kamal Dasu <[email protected]>
    Reviewed-by: William Zhang <[email protected]>
    Reviewed-by: Florian Fainelli <[email protected]>
    Signed-off-by: Miquel Raynal <[email protected]>
    Signed-off-by: Sasha Levin <[email protected]>

mtd: rawnand: cadence: Fix error check for dma_alloc_coherent() in cadence_nand_init() [+ + +]

Author: Chen Ni <[email protected]>
Date:   Mon Feb 9 15:56:18 2026 +0800

    mtd: rawnand: cadence: Fix error check for dma_alloc_coherent() in cadence_nand_init()
    
    commit 0410e1a4c545c769c59c6eda897ad5d574d0c865 upstream.
    
    Fix wrong variable used for error checking after dma_alloc_coherent()
    call. The function checks cdns_ctrl->dma_cdma_desc instead of
    cdns_ctrl->cdma_desc, which could lead to incorrect error handling.
    
    Fixes: ec4ba01e894d ("mtd: rawnand: Add new Cadence NAND driver to MTD subsystem")
    Cc: [email protected]
    Signed-off-by: Chen Ni <[email protected]>
    Reviewed-by: Alok Tiwari <[email protected]>
    Signed-off-by: Miquel Raynal <[email protected]>
    Signed-off-by: Greg Kroah-Hartman <[email protected]>

mtd: rawnand: pl353: make sure optimal timings are applied [+ + +]

Author: Olivier Sobrie <[email protected]>
Date:   Tue Mar 17 18:18:07 2026 +0100

    mtd: rawnand: pl353: make sure optimal timings are applied
    
    commit b9465b04de4b90228de03db9a1e0d56b00814366 upstream.
    
    Timings of the nand are adjusted by pl35x_nfc_setup_interface() but
    actually applied by the pl35x_nand_select_target() function.
    If there is only one nand chip, the pl35x_nand_select_target() will only
    apply the timings once since the test at its beginning will always be true
    after the first call to this function. As a result, the hardware will
    keep using the default timings set at boot to detect the nand chip, not
    the optimal ones.
    
    With this patch, we program directly the new timings when
    pl35x_nfc_setup_interface() is called.
    
    Fixes: 08d8c62164a3 ("mtd: rawnand: pl353: Add support for the ARM PL353 SMC NAND controller")
    Signed-off-by: Olivier Sobrie <[email protected]>
    Cc: [email protected]
    Signed-off-by: Miquel Raynal <[email protected]>
    Signed-off-by: Greg Kroah-Hartman <[email protected]>

mtd: rawnand: serialize lock/unlock against other NAND operations [+ + +]

Author: Kamal Dasu <[email protected]>
Date:   Thu Mar 5 14:49:06 2026 -0500

    mtd: rawnand: serialize lock/unlock against other NAND operations
    
    [ Upstream commit bab2bc6e850a697a23b9e5f0e21bb8c187615e95 ]
    
    nand_lock() and nand_unlock() call into chip->ops.lock_area/unlock_area
    without holding the NAND device lock. On controllers that implement
    SET_FEATURES via multiple low-level PIO commands, these can race with
    concurrent UBI/UBIFS background erase/write operations that hold the
    device lock, resulting in cmd_pending conflicts on the NAND controller.
    
    Add nand_get_device()/nand_release_device() around the lock/unlock
    operations to serialize them against all other NAND controller access.
    
    Fixes: 92270086b7e5 ("mtd: rawnand: Add support for manufacturer specific lock/unlock operation")
    Signed-off-by: Kamal Dasu <[email protected]>
    Reviewed-by: William Zhang <[email protected]>
    Signed-off-by: Miquel Raynal <[email protected]>
    Signed-off-by: Sasha Levin <[email protected]>

mtd: spi-nor: core: avoid odd length/address reads on 8D-8D-8D mode [+ + +]

Author: Pratyush Yadav <[email protected]>
Date:   Tue Jul 8 17:16:45 2025 +0800

    mtd: spi-nor: core: avoid odd length/address reads on 8D-8D-8D mode
    
    commit f156b23df6a84efb2f6686156be94d4988568954 upstream.
    
    On Octal DTR capable flashes like Micron Xcella reads cannot start or
    end at an odd address in Octal DTR mode. Extra bytes need to be read at
    the start or end to make sure both the start address and length remain
    even.
    
    To avoid allocating too much extra memory, thereby putting unnecessary
    memory pressure on the system, the temporary buffer containing the extra
    padding bytes is capped at PAGE_SIZE bytes. The rest of the 2-byte
    aligned part should be read directly in the main buffer.
    
    Signed-off-by: Pratyush Yadav <[email protected]>
    Reviewed-by: Michael Walle <[email protected]>
    Signed-off-by: Luke Wang <[email protected]>
    Signed-off-by: Pratyush Yadav <[email protected]>
    Link: https://lore.kernel.org/r/[email protected]
    Signed-off-by: Greg Kroah-Hartman <[email protected]>

mtd: spi-nor: core: avoid odd length/address writes in 8D-8D-8D mode [+ + +]

Author: Pratyush Yadav <[email protected]>
Date:   Tue Jul 8 17:16:46 2025 +0800

    mtd: spi-nor: core: avoid odd length/address writes in 8D-8D-8D mode
    
    commit 17926cd770ec837ed27d9856cf07f2da8dda4131 upstream.
    
    On Octal DTR capable flashes like Micron Xcella the writes cannot start
    or end at an odd address in Octal DTR mode. Extra 0xff bytes need to be
    appended or prepended to make sure the start address and end address are
    even. 0xff is used because on NOR flashes a program operation can only
    flip bits from 1 to 0, not the other way round. 0 to 1 flip needs to
    happen via erases.
    
    Signed-off-by: Pratyush Yadav <[email protected]>
    Reviewed-by: Michael Walle <[email protected]>
    Signed-off-by: Luke Wang <[email protected]>
    Signed-off-by: Pratyush Yadav <[email protected]>
    Link: https://lore.kernel.org/r/[email protected]
    Signed-off-by: Greg Kroah-Hartman <[email protected]>

net/mlx5: Fix crash when moving to switchdev mode [+ + +]

Author: Patrisious Haddad <[email protected]>
Date:   Thu Mar 5 16:26:30 2026 +0200

    net/mlx5: Fix crash when moving to switchdev mode
    
    [ Upstream commit 24b2795f9683e092dc22a68f487e7aaaf2ddafea ]
    
    When moving to switchdev mode when the device doesn't support IPsec,
    we try to clean up the IPsec resources anyway which causes the crash
    below, fix that by correctly checking for IPsec support before trying
    to clean up its resources.
    
    [27642.515799] WARNING: arch/x86/mm/fault.c:1276 at
    do_user_addr_fault+0x18a/0x680, CPU#4: devlink/6490
    [27642.517159] Modules linked in: xt_conntrack xt_MASQUERADE
    ip6table_nat ip6table_filter ip6_tables iptable_nat nf_nat xt_addrtype
    rpcsec_gss_krb5 auth_rpcgss oid_registry overlay mlx5_fwctl nfnetlink
    zram zsmalloc mlx5_ib fuse rpcrdma rdma_ucm ib_uverbs ib_iser libiscsi
    scsi_transport_iscsi ib_umad rdma_cm ib_ipoib iw_cm ib_cm mlx5_core
    ib_core
    [27642.521358] CPU: 4 UID: 0 PID: 6490 Comm: devlink Not tainted
    6.19.0-rc5_for_upstream_min_debug_2026_01_14_16_47 #1 NONE
    [27642.522923] Hardware name: QEMU Standard PC (Q35 + ICH9, 2009), BIOS
    rel-1.16.3-0-ga6ed6b701f0a-prebuilt.qemu.org 04/01/2014
    [27642.524528] RIP: 0010:do_user_addr_fault+0x18a/0x680
    [27642.525362] Code: ff 0f 84 75 03 00 00 48 89 ee 4c 89 e7 e8 5e b9 22
    00 49 89 c0 48 85 c0 0f 84 a8 02 00 00 f7 c3 60 80 00 00 74 22 31 c9 eb
       ae <0f> 0b 48 83 c4 10 48 89 ea 48 89 de 4c 89 f7 5b 5d 41 5c 41 5d
    41
    [27642.528166] RSP: 0018:ffff88810770f6b8 EFLAGS: 00010046
    [27642.529038] RAX: 0000000000000000 RBX: 0000000000000002 RCX:
    ffff88810b980f00
    [27642.530158] RDX: 00000000000000a0 RSI: 0000000000000002 RDI:
    ffff88810770f728
    [27642.531270] RBP: 00000000000000a0 R08: 0000000000000000 R09:
    0000000000000000
    [27642.532383] R10: 0000000000000000 R11: 0000000000000000 R12:
    ffff888103f3c4c0
    [27642.533499] R13: 0000000000000000 R14: ffff88810770f728 R15:
    0000000000000000
    [27642.534614] FS:  00007f197c741740(0000) GS:ffff88856a94c000(0000)
    knlGS:0000000000000000
    [27642.535915] CS:  0010 DS: 0000 ES: 0000 CR0: 0000000080050033
    [27642.536858] CR2: 00000000000000a0 CR3: 000000011334c003 CR4:
    0000000000172eb0
    [27642.537982] Call Trace:
    [27642.538466]  <TASK>
    [27642.538907]  exc_page_fault+0x76/0x140
    [27642.539583]  asm_exc_page_fault+0x22/0x30
    [27642.540282] RIP: 0010:_raw_spin_lock_irqsave+0x10/0x30
    [27642.541134] Code: 07 85 c0 75 11 ba ff 00 00 00 f0 0f b1 17 75 06 b8
    01 00 00 00 c3 31 c0 c3 90 0f 1f 44 00 00 53 9c 5b fa 31 c0 ba 01 00 00
       00 <f0> 0f b1 17 75 05 48 89 d8 5b c3 89 c6 e8 7e 02 00 00 48 89 d8
          5b
    [27642.543936] RSP: 0018:ffff88810770f7d8 EFLAGS: 00010046
    [27642.544803] RAX: 0000000000000000 RBX: 0000000000000202 RCX:
    ffff888113ad96d8
    [27642.545916] RDX: 0000000000000001 RSI: ffff88810770f818 RDI:
    00000000000000a0
    [27642.547027] RBP: 0000000000000098 R08: 0000000000000400 R09:
    ffff88810b980f00
    [27642.548140] R10: 0000000000000001 R11: ffff888101845a80 R12:
    00000000000000a8
    [27642.549263] R13: ffffffffa02a9060 R14: 00000000000000a0 R15:
    ffff8881130d8a40
    [27642.550379]  complete_all+0x20/0x90
    [27642.551010]  mlx5e_ipsec_disable_events+0xb6/0xf0 [mlx5_core]
    [27642.552022]  mlx5e_nic_disable+0x12d/0x220 [mlx5_core]
    [27642.552929]  mlx5e_detach_netdev+0x66/0xf0 [mlx5_core]
    [27642.553822]  mlx5e_netdev_change_profile+0x5b/0x120 [mlx5_core]
    [27642.554821]  mlx5e_vport_rep_load+0x419/0x590 [mlx5_core]
    [27642.555757]  ? xa_load+0x53/0x90
    [27642.556361]  __esw_offloads_load_rep+0x54/0x70 [mlx5_core]
    [27642.557328]  mlx5_esw_offloads_rep_load+0x45/0xd0 [mlx5_core]
    [27642.558320]  esw_offloads_enable+0xb4b/0xc90 [mlx5_core]
    [27642.559247]  mlx5_eswitch_enable_locked+0x34e/0x4f0 [mlx5_core]
    [27642.560257]  ? mlx5_rescan_drivers_locked+0x222/0x2d0 [mlx5_core]
    [27642.561284]  mlx5_devlink_eswitch_mode_set+0x5ac/0x9c0 [mlx5_core]
    [27642.562334]  ? devlink_rate_set_ops_supported+0x21/0x3a0
    [27642.563220]  devlink_nl_eswitch_set_doit+0x67/0xe0
    [27642.564026]  genl_family_rcv_msg_doit+0xe0/0x130
    [27642.564816]  genl_rcv_msg+0x183/0x290
    [27642.565466]  ? __devlink_nl_pre_doit.isra.0+0x160/0x160
    [27642.566329]  ? devlink_nl_eswitch_get_doit+0x290/0x290
    [27642.567181]  ? devlink_nl_pre_doit_parent_dev_optional+0x20/0x20
    [27642.568147]  ? genl_family_rcv_msg_dumpit+0xf0/0xf0
    [27642.568966]  netlink_rcv_skb+0x4b/0xf0
    [27642.569629]  genl_rcv+0x24/0x40
    [27642.570215]  netlink_unicast+0x255/0x380
    [27642.570901]  ? __alloc_skb+0xfa/0x1e0
    [27642.571560]  netlink_sendmsg+0x1f3/0x420
    [27642.572249]  __sock_sendmsg+0x38/0x60
    [27642.572911]  __sys_sendto+0x119/0x180
    [27642.573561]  ? __sys_recvmsg+0x5c/0xb0
    [27642.574227]  __x64_sys_sendto+0x20/0x30
    [27642.574904]  do_syscall_64+0x55/0xc10
    [27642.575554]  entry_SYSCALL_64_after_hwframe+0x4b/0x53
    [27642.576391] RIP: 0033:0x7f197c85e807
    [27642.577050] Code: c7 c0 ff ff ff ff eb be 66 2e 0f 1f 84 00 00 00 00
    00 90 f3 0f 1e fa 80 3d 45 08 0d 00 00 41 89 ca 74 10 b8 2c 00 00 00 0f
       05 <48> 3d 00 f0 ff ff 77 69 c3 55 48 89 e5 53 48 83 ec 38 44 89 4d
          d0
    [27642.579846] RSP: 002b:00007ffebd4e2248 EFLAGS: 00000202 ORIG_RAX:
    000000000000002c
    [27642.581082] RAX: ffffffffffffffda RBX: 000055cfcd9cd2a0 RCX:
    00007f197c85e807
    [27642.582200] RDX: 0000000000000038 RSI: 000055cfcd9cd490 RDI:
    0000000000000003
    [27642.583320] RBP: 00007ffebd4e2290 R08: 00007f197c942200 R09:
    000000000000000c
    [27642.584437] R10: 0000000000000000 R11: 0000000000000202 R12:
    0000000000000000
    [27642.585555] R13: 000055cfcd9cd490 R14: 00007ffebd4e45d1 R15:
    000055cfcd9cd2a0
    [27642.586671]  </TASK>
    [27642.587121] ---[ end trace 0000000000000000 ]---
    [27642.587910] BUG: kernel NULL pointer dereference, address:
    00000000000000a0
    
    Fixes: 664f76be38a1 ("net/mlx5: Fix IPsec cleanup over MPV device")
    Signed-off-by: Patrisious Haddad <[email protected]>
    Reviewed-by: Leon Romanovsky <[email protected]>
    Signed-off-by: Tariq Toukan <[email protected]>
    Link: https://patch.msgid.link/[email protected]
    Signed-off-by: Jakub Kicinski <[email protected]>
    Signed-off-by: Sasha Levin <[email protected]>

net/mlx5: Fix deadlock between devlink lock and esw->wq [+ + +]

Author: Cosmin Ratiu <[email protected]>
Date:   Thu Mar 5 10:10:19 2026 +0200

    net/mlx5: Fix deadlock between devlink lock and esw->wq
    
    [ Upstream commit aed763abf0e905b4b8d747d1ba9e172961572f57 ]
    
    esw->work_queue executes esw_functions_changed_event_handler ->
    esw_vfs_changed_event_handler and acquires the devlink lock.
    
    .eswitch_mode_set (acquires devlink lock in devlink_nl_pre_doit) ->
    mlx5_devlink_eswitch_mode_set -> mlx5_eswitch_disable_locked ->
    mlx5_eswitch_event_handler_unregister -> flush_workqueue deadlocks
    when esw_vfs_changed_event_handler executes.
    
    Fix that by no longer flushing the work to avoid the deadlock, and using
    a generation counter to keep track of work relevance. This avoids an old
    handler manipulating an esw that has undergone one or more mode changes:
    - the counter is incremented in mlx5_eswitch_event_handler_unregister.
    - the counter is read and passed to the ephemeral mlx5_host_work struct.
    - the work handler takes the devlink lock and bails out if the current
      generation is different than the one it was scheduled to operate on.
    - mlx5_eswitch_cleanup does the final draining before destroying the wq.
    
    No longer flushing the workqueue has the side effect of maybe no longer
    cancelling pending vport_change_handler work items, but that's ok since
    those are disabled elsewhere:
    - mlx5_eswitch_disable_locked disables the vport eq notifier.
    - mlx5_esw_vport_disable disarms the HW EQ notification and marks
      vport->enabled under state_lock to false to prevent pending vport
      handler from doing anything.
    - mlx5_eswitch_cleanup destroys the workqueue and makes sure all events
      are disabled/finished.
    
    Fixes: f1bc646c9a06 ("net/mlx5: Use devl_ API in mlx5_esw_offloads_devlink_port_register")
    Signed-off-by: Cosmin Ratiu <[email protected]>
    Reviewed-by: Moshe Shemesh <[email protected]>
    Reviewed-by: Dragos Tatulea <[email protected]>
    Reviewed-by: Simon Horman <[email protected]>
    Signed-off-by: Tariq Toukan <[email protected]>
    Link: https://patch.msgid.link/[email protected]
    Signed-off-by: Jakub Kicinski <[email protected]>
    Signed-off-by: Sasha Levin <[email protected]>

net/mlx5: IFC updates for disabled host PF [+ + +]

Author: Daniel Jurgens <[email protected]>
Date:   Wed Jul 9 15:41:07 2025 +0300

    net/mlx5: IFC updates for disabled host PF
    
    [ Upstream commit cd1746cb6555a2238c4aae9f9d60b637a61bf177 ]
    
    The port 2 host PF can be disabled, this bit reflects that setting.
    
    Signed-off-by: Daniel Jurgens <[email protected]>
    Reviewed-by: William Tu <[email protected]>
    Signed-off-by: Tariq Toukan <[email protected]>
    Link: https://patch.msgid.link/[email protected]
    Signed-off-by: Leon Romanovsky <[email protected]>
    Stable-dep-of: aed763abf0e9 ("net/mlx5: Fix deadlock between devlink lock and esw->wq")
    Signed-off-by: Sasha Levin <[email protected]>

net/mlx5: qos: Restrict RTNL area to avoid a lock cycle [+ + +]

Author: Cosmin Ratiu <[email protected]>
Date:   Mon Mar 16 11:46:01 2026 +0200

    net/mlx5: qos: Restrict RTNL area to avoid a lock cycle
    
    [ Upstream commit b7e3a5d9c0d66b7fb44f63aef3bd734821afa0c8 ]
    
    A lock dependency cycle exists where:
    1. mlx5_ib_roce_init -> mlx5_core_uplink_netdev_event_replay ->
    mlx5_blocking_notifier_call_chain (takes notifier_rwsem) ->
    mlx5e_mdev_notifier_event -> mlx5_netdev_notifier_register ->
    register_netdevice_notifier_dev_net (takes rtnl)
    => notifier_rwsem -> rtnl
    
    2. mlx5e_probe -> _mlx5e_probe ->
    mlx5_core_uplink_netdev_set (takes uplink_netdev_lock) ->
    mlx5_blocking_notifier_call_chain (takes notifier_rwsem)
    => uplink_netdev_lock -> notifier_rwsem
    
    3: devlink_nl_rate_set_doit -> devlink_nl_rate_set ->
    mlx5_esw_devlink_rate_leaf_tx_max_set -> esw_qos_devlink_rate_to_mbps ->
    mlx5_esw_qos_max_link_speed_get (takes rtnl) ->
    mlx5_esw_qos_lag_link_speed_get_locked ->
    mlx5_uplink_netdev_get (takes uplink_netdev_lock)
    => rtnl -> uplink_netdev_lock
    => BOOM! (lock cycle)
    
    Fix that by restricting the rtnl-protected section to just the necessary
    part, the call to netdev_master_upper_dev_get and speed querying, so
    that the last lock dependency is avoided and the cycle doesn't close.
    This is safe because mlx5_uplink_netdev_get uses netdev_hold to keep the
    uplink netdev alive while its master device is queried.
    
    Use this opportunity to rename the ambiguously-named "hold_rtnl_lock"
    argument to "take_rtnl" and remove the "_locked" suffix from
    mlx5_esw_qos_lag_link_speed_get_locked.
    
    Fixes: 6b4be64fd9fe ("net/mlx5e: Harden uplink netdev access against device unbind")
    Signed-off-by: Cosmin Ratiu <[email protected]>
    Reviewed-by: Dragos Tatulea <[email protected]>
    Signed-off-by: Tariq Toukan <[email protected]>
    Link: https://patch.msgid.link/[email protected]
    Signed-off-by: Jakub Kicinski <[email protected]>
    Signed-off-by: Sasha Levin <[email protected]>

net/mlx5: Query to see if host PF is disabled [+ + +]

Author: Daniel Jurgens <[email protected]>
Date:   Wed Aug 13 22:19:55 2025 +0300

    net/mlx5: Query to see if host PF is disabled
    
    [ Upstream commit 9e84de72aef9bcf0e751a0bff3ac91b0cf52366f ]
    
    The host PF can be disabled, query firmware to check if the host PF of
    this function exists.
    
    Signed-off-by: Daniel Jurgens <[email protected]>
    Reviewed-by: William Tu <[email protected]>
    Signed-off-by: Tariq Toukan <[email protected]>
    Link: https://patch.msgid.link/[email protected]
    Signed-off-by: Jakub Kicinski <[email protected]>
    Stable-dep-of: aed763abf0e9 ("net/mlx5: Fix deadlock between devlink lock and esw->wq")
    Signed-off-by: Sasha Levin <[email protected]>

net/mlx5e: Fix DMA FIFO desync on error CQE SQ recovery [+ + +]

Author: Gal Pressman <[email protected]>
Date:   Thu Mar 5 16:26:32 2026 +0200

    net/mlx5e: Fix DMA FIFO desync on error CQE SQ recovery
    
    [ Upstream commit 1633111d69053512d099658d4a05fc736fab36b0 ]
    
    In case of a TX error CQE, a recovery flow is triggered,
    mlx5e_reset_txqsq_cc_pc() resets dma_fifo_cc to 0 but not dma_fifo_pc,
    desyncing the DMA FIFO producer and consumer.
    
    After recovery, the producer pushes new DMA entries at the old
    dma_fifo_pc, while the consumer reads from position 0.
    This causes us to unmap stale DMA addresses from before the recovery.
    
    The DMA FIFO is a purely software construct with no HW counterpart.
    At the point of reset, all WQEs have been flushed so dma_fifo_cc is
    already equal to dma_fifo_pc. There is no need to reset either counter,
    similar to how skb_fifo pc/cc are untouched.
    
    Remove the 'dma_fifo_cc = 0' reset.
    
    This fixes the following WARNING:
        WARNING: CPU: 0 PID: 0 at drivers/iommu/dma-iommu.c:1240 iommu_dma_unmap_page+0x79/0x90
        Modules linked in: mlx5_vdpa vringh vdpa bonding mlx5_ib mlx5_vfio_pci ipip mlx5_fwctl tunnel4 mlx5_core ib_ipoib geneve ip6_gre ip_gre gre nf_tables ip6_tunnel rdma_ucm ib_uverbs ib_umad vfio_pci vfio_pci_core act_mirred act_skbedit act_vlan vhost_net vhost tap ip6table_mangle ip6table_nat ip6table_filter ip6_tables iptable_mangle cls_matchall nfnetlink_cttimeout act_gact cls_flower sch_ingress vhost_iotlb iptable_raw tunnel6 vfio_iommu_type1 vfio openvswitch nsh rpcsec_gss_krb5 auth_rpcgss oid_registry xt_conntrack xt_MASQUERADE nf_conntrack_netlink nfnetlink iptable_nat nf_nat xt_addrtype br_netfilter overlay zram zsmalloc rpcrdma ib_iser libiscsi scsi_transport_iscsi rdma_cm iw_cm ib_cm ib_core fuse [last unloaded: nf_tables]
        CPU: 0 UID: 0 PID: 0 Comm: swapper/0 Not tainted 6.13.0-rc5_for_upstream_min_debug_2024_12_30_21_33 #1
        Hardware name: QEMU Standard PC (Q35 + ICH9, 2009), BIOS rel-1.13.0-0-gf21b5a4aeb02-prebuilt.qemu.org 04/01/2014
        RIP: 0010:iommu_dma_unmap_page+0x79/0x90
        Code: 2b 4d 3b 21 72 26 4d 3b 61 08 73 20 49 89 d8 44 89 f9 5b 4c 89 f2 4c 89 e6 48 89 ef 5d 41 5c 41 5d 41 5e 41 5f e9 c7 ae 9e ff <0f> 0b 5b 5d 41 5c 41 5d 41 5e 41 5f c3 66 2e 0f 1f 84 00 00 00 00
        Call Trace:
         <IRQ>
         ? __warn+0x7d/0x110
         ? iommu_dma_unmap_page+0x79/0x90
         ? report_bug+0x16d/0x180
         ? handle_bug+0x4f/0x90
         ? exc_invalid_op+0x14/0x70
         ? asm_exc_invalid_op+0x16/0x20
         ? iommu_dma_unmap_page+0x79/0x90
         ? iommu_dma_unmap_page+0x2e/0x90
         dma_unmap_page_attrs+0x10d/0x1b0
         mlx5e_tx_wi_dma_unmap+0xbe/0x120 [mlx5_core]
         mlx5e_poll_tx_cq+0x16d/0x690 [mlx5_core]
         mlx5e_napi_poll+0x8b/0xac0 [mlx5_core]
         __napi_poll+0x24/0x190
         net_rx_action+0x32a/0x3b0
         ? mlx5_eq_comp_int+0x7e/0x270 [mlx5_core]
         ? notifier_call_chain+0x35/0xa0
         handle_softirqs+0xc9/0x270
         irq_exit_rcu+0x71/0xd0
         common_interrupt+0x7f/0xa0
         </IRQ>
         <TASK>
         asm_common_interrupt+0x22/0x40
    
    Fixes: db75373c91b0 ("net/mlx5e: Recover Send Queue (SQ) from error state")
    Signed-off-by: Gal Pressman <[email protected]>
    Reviewed-by: Dragos Tatulea <[email protected]>
    Signed-off-by: Tariq Toukan <[email protected]>
    Link: https://patch.msgid.link/[email protected]
    Signed-off-by: Jakub Kicinski <[email protected]>
    Signed-off-by: Sasha Levin <[email protected]>

net/mlx5e: Fix race condition during IPSec ESN update [+ + +]

Author: Jianbo Liu <[email protected]>
Date:   Mon Mar 16 11:46:03 2026 +0200

    net/mlx5e: Fix race condition during IPSec ESN update
    
    [ Upstream commit beb6e2e5976a128b0cccf10d158124422210c5ef ]
    
    In IPSec full offload mode, the device reports an ESN (Extended
    Sequence Number) wrap event to the driver. The driver validates this
    event by querying the IPSec ASO and checking that the esn_event_arm
    field is 0x0, which indicates an event has occurred. After handling
    the event, the driver must re-arm the context by setting esn_event_arm
    back to 0x1.
    
    A race condition exists in this handling path. After validating the
    event, the driver calls mlx5_accel_esp_modify_xfrm() to update the
    kernel's xfrm state. This function temporarily releases and
    re-acquires the xfrm state lock.
    
    So, need to acknowledge the event first by setting esn_event_arm to
    0x1. This prevents the driver from reprocessing the same ESN update if
    the hardware sends events for other reason. Since the next ESN update
    only occurs after nearly 2^31 packets are received, there's no risk of
    missing an update, as it will happen long after this handling has
    finished.
    
    Processing the event twice causes the ESN high-order bits (esn_msb) to
    be incremented incorrectly. The driver then programs the hardware with
    this invalid ESN state, which leads to anti-replay failures and a
    complete halt of IPSec traffic.
    
    Fix this by re-arming the ESN event immediately after it is validated,
    before calling mlx5_accel_esp_modify_xfrm(). This ensures that any
    spurious, duplicate events are correctly ignored, closing the race
    window.
    
    Fixes: fef06678931f ("net/mlx5e: Fix ESN update kernel panic")
    Signed-off-by: Jianbo Liu <[email protected]>
    Reviewed-by: Leon Romanovsky <[email protected]>
    Signed-off-by: Tariq Toukan <[email protected]>
    Link: https://patch.msgid.link/[email protected]
    Signed-off-by: Jakub Kicinski <[email protected]>
    Signed-off-by: Sasha Levin <[email protected]>

net/mlx5e: Prevent concurrent access to IPSec ASO context [+ + +]

Author: Jianbo Liu <[email protected]>
Date:   Mon Mar 16 11:46:02 2026 +0200

    net/mlx5e: Prevent concurrent access to IPSec ASO context
    
    [ Upstream commit 99b36850d881e2d65912b2520a1c80d0fcc9429a ]
    
    The query or updating IPSec offload object is through Access ASO WQE.
    The driver uses a single mlx5e_ipsec_aso struct for each PF, which
    contains a shared DMA-mapped context for all ASO operations.
    
    A race condition exists because the ASO spinlock is released before
    the hardware has finished processing WQE. If a second operation is
    initiated immediately after, it overwrites the shared context in the
    DMA area.
    
    When the first operation's completion is processed later, it reads
    this corrupted context, leading to unexpected behavior and incorrect
    results.
    
    This commit fixes the race by introducing a private context within
    each IPSec offload object. The shared ASO context is now copied to
    this private context while the ASO spinlock is held. Subsequent
    processing uses this saved, per-object context, ensuring its integrity
    is maintained.
    
    Fixes: 1ed78fc03307 ("net/mlx5e: Update IPsec soft and hard limits")
    Signed-off-by: Jianbo Liu <[email protected]>
    Reviewed-by: Leon Romanovsky <[email protected]>
    Signed-off-by: Tariq Toukan <[email protected]>
    Link: https://patch.msgid.link/[email protected]
    Signed-off-by: Jakub Kicinski <[email protected]>
    Signed-off-by: Sasha Levin <[email protected]>

net/rose: fix NULL pointer dereference in rose_transmit_link on reconnect [+ + +]

Author: Jiayuan Chen <[email protected]>
Date:   Wed Mar 11 15:06:02 2026 +0800

    net/rose: fix NULL pointer dereference in rose_transmit_link on reconnect
    
    [ Upstream commit e1f0a18c9564cdb16523c802e2c6fe5874e3d944 ]
    
    syzkaller reported a bug [1], and the reproducer is available at [2].
    
    ROSE sockets use four sk->sk_state values: TCP_CLOSE, TCP_LISTEN,
    TCP_SYN_SENT, and TCP_ESTABLISHED. rose_connect() already rejects
    calls for TCP_ESTABLISHED (-EISCONN) and TCP_CLOSE with SS_CONNECTING
    (-ECONNREFUSED), but lacks a check for TCP_SYN_SENT.
    
    When rose_connect() is called a second time while the first connection
    attempt is still in progress (TCP_SYN_SENT), it overwrites
    rose->neighbour via rose_get_neigh(). If that returns NULL, the socket
    is left with rose->state == ROSE_STATE_1 but rose->neighbour == NULL.
    When the socket is subsequently closed, rose_release() sees
    ROSE_STATE_1 and calls rose_write_internal() ->
    rose_transmit_link(skb, NULL), causing a NULL pointer dereference.
    
    Per connect(2), a second connect() while a connection is already in
    progress should return -EALREADY. Add this missing check for
    TCP_SYN_SENT to complete the state validation in rose_connect().
    
    [1] https://syzkaller.appspot.com/bug?extid=d00f90e0af54102fb271
    [2] https://gist.github.com/mrpre/9e6779e0d13e2c66779b1653fef80516
    
    Fixes: 1da177e4c3f4 ("Linux-2.6.12-rc2")
    Reported-by: [email protected]
    Closes: https://lore.kernel.org/all/[email protected]/T/
    Suggested-by: Eric Dumazet <[email protected]>
    Signed-off-by: Jiayuan Chen <[email protected]>
    Reviewed-by: Eric Dumazet <[email protected]>
    Link: https://patch.msgid.link/[email protected]
    Signed-off-by: Jakub Kicinski <[email protected]>
    Signed-off-by: Sasha Levin <[email protected]>

net/sched: act_gate: snapshot parameters with RCU on replace [+ + +]

Author: Paul Moses <[email protected]>
Date:   Thu Mar 12 13:37:37 2026 -0400

    net/sched: act_gate: snapshot parameters with RCU on replace
    
    [ Upstream commit 62413a9c3cb183afb9bb6e94dd68caf4e4145f4c ]
    
    The gate action can be replaced while the hrtimer callback or dump path is
    walking the schedule list.
    
    Convert the parameters to an RCU-protected snapshot and swap updates under
    tcf_lock, freeing the previous snapshot via call_rcu(). When REPLACE omits
    the entry list, preserve the existing schedule so the effective state is
    unchanged.
    
    Fixes: a51c328df310 ("net: qos: introduce a gate control flow action")
    Cc: [email protected]
    Signed-off-by: Paul Moses <[email protected]>
    Tested-by: Vladimir Oltean <[email protected]>
    Acked-by: Jamal Hadi Salim <[email protected]>
    Reviewed-by: Victor Nogueira <[email protected]>
    Link: https://patch.msgid.link/[email protected]
    Signed-off-by: Jakub Kicinski <[email protected]>
    [ hrtimer_setup() => hrtimer_init() + keep is_tcf_gate() ]
    Signed-off-by: Sasha Levin <[email protected]>
    Signed-off-by: Greg Kroah-Hartman <[email protected]>

net/sched: teql: Fix double-free in teql_master_xmit [+ + +]

Author: Jamal Hadi Salim <[email protected]>
Date:   Sun Mar 15 11:54:22 2026 -0400

    net/sched: teql: Fix double-free in teql_master_xmit
    
    [ Upstream commit 66360460cab63c248ca5b1070a01c0c29133b960 ]
    
    Whenever a TEQL devices has a lockless Qdisc as root, qdisc_reset should
    be called using the seq_lock to avoid racing with the datapath. Failure
    to do so may cause crashes like the following:
    
    [  238.028993][  T318] BUG: KASAN: double-free in skb_release_data (net/core/skbuff.c:1139)
    [  238.029328][  T318] Free of addr ffff88810c67ec00 by task poc_teql_uaf_ke/318
    [  238.029749][  T318]
    [  238.029900][  T318] CPU: 3 UID: 0 PID: 318 Comm: poc_teql_ke Not tainted 7.0.0-rc3-00149-ge5b31d988a41 #704 PREEMPT(full)
    [  238.029906][  T318] Hardware name: Bochs Bochs, BIOS Bochs 01/01/2011
    [  238.029910][  T318] Call Trace:
    [  238.029913][  T318]  <TASK>
    [  238.029916][  T318]  dump_stack_lvl (lib/dump_stack.c:122)
    [  238.029928][  T318]  print_report (mm/kasan/report.c:379 mm/kasan/report.c:482)
    [  238.029940][  T318]  ? skb_release_data (net/core/skbuff.c:1139)
    [  238.029944][  T318]  ? srso_alias_return_thunk (arch/x86/lib/retpoline.S:221)
    ...
    [  238.029957][  T318]  ? skb_release_data (net/core/skbuff.c:1139)
    [  238.029969][  T318]  kasan_report_invalid_free (mm/kasan/report.c:221 mm/kasan/report.c:563)
    [  238.029979][  T318]  ? skb_release_data (net/core/skbuff.c:1139)
    [  238.029989][  T318]  check_slab_allocation (mm/kasan/common.c:231)
    [  238.029995][  T318]  kmem_cache_free (mm/slub.c:2637 (discriminator 1) mm/slub.c:6168 (discriminator 1) mm/slub.c:6298 (discriminator 1))
    [  238.030004][  T318]  skb_release_data (net/core/skbuff.c:1139)
    ...
    [  238.030025][  T318]  sk_skb_reason_drop (net/core/skbuff.c:1256)
    [  238.030032][  T318]  pfifo_fast_reset (./include/linux/ptr_ring.h:171 ./include/linux/ptr_ring.h:309 ./include/linux/skb_array.h:98 net/sched/sch_generic.c:827)
    [  238.030039][  T318]  ? srso_alias_return_thunk (arch/x86/lib/retpoline.S:221)
    ...
    [  238.030054][  T318]  qdisc_reset (net/sched/sch_generic.c:1034)
    [  238.030062][  T318]  teql_destroy (./include/linux/spinlock.h:395 net/sched/sch_teql.c:157)
    [  238.030071][  T318]  __qdisc_destroy (./include/net/pkt_sched.h:328 net/sched/sch_generic.c:1077)
    [  238.030077][  T318]  qdisc_graft (net/sched/sch_api.c:1062 net/sched/sch_api.c:1053 net/sched/sch_api.c:1159)
    [  238.030089][  T318]  ? __pfx_qdisc_graft (net/sched/sch_api.c:1091)
    [  238.030095][  T318]  ? srso_alias_return_thunk (arch/x86/lib/retpoline.S:221)
    [  238.030102][  T318]  ? srso_alias_return_thunk (arch/x86/lib/retpoline.S:221)
    [  238.030106][  T318]  ? srso_alias_return_thunk (arch/x86/lib/retpoline.S:221)
    [  238.030114][  T318]  tc_get_qdisc (net/sched/sch_api.c:1529 net/sched/sch_api.c:1556)
    ...
    [  238.072958][  T318] Allocated by task 303 on cpu 5 at 238.026275s:
    [  238.073392][  T318]  kasan_save_stack (mm/kasan/common.c:58)
    [  238.073884][  T318]  kasan_save_track (mm/kasan/common.c:64 (discriminator 5) mm/kasan/common.c:79 (discriminator 5))
    [  238.074230][  T318]  __kasan_slab_alloc (mm/kasan/common.c:369)
    [  238.074578][  T318]  kmem_cache_alloc_node_noprof (./include/linux/kasan.h:253 mm/slub.c:4542 mm/slub.c:4869 mm/slub.c:4921)
    [  238.076091][  T318]  kmalloc_reserve (net/core/skbuff.c:616 (discriminator 107))
    [  238.076450][  T318]  __alloc_skb (net/core/skbuff.c:713)
    [  238.076834][  T318]  alloc_skb_with_frags (./include/linux/skbuff.h:1383 net/core/skbuff.c:6763)
    [  238.077178][  T318]  sock_alloc_send_pskb (net/core/sock.c:2997)
    [  238.077520][  T318]  packet_sendmsg (net/packet/af_packet.c:2926 net/packet/af_packet.c:3019 net/packet/af_packet.c:3108)
    [  238.081469][  T318]
    [  238.081870][  T318] Freed by task 299 on cpu 1 at 238.028496s:
    [  238.082761][  T318]  kasan_save_stack (mm/kasan/common.c:58)
    [  238.083481][  T318]  kasan_save_track (mm/kasan/common.c:64 (discriminator 5) mm/kasan/common.c:79 (discriminator 5))
    [  238.085348][  T318]  kasan_save_free_info (mm/kasan/generic.c:587 (discriminator 1))
    [  238.085900][  T318]  __kasan_slab_free (mm/kasan/common.c:287)
    [  238.086439][  T318]  kmem_cache_free (mm/slub.c:6168 (discriminator 3) mm/slub.c:6298 (discriminator 3))
    [  238.087007][  T318]  skb_release_data (net/core/skbuff.c:1139)
    [  238.087491][  T318]  consume_skb (net/core/skbuff.c:1451)
    [  238.087757][  T318]  teql_master_xmit (net/sched/sch_teql.c:358)
    [  238.088116][  T318]  dev_hard_start_xmit (./include/linux/netdevice.h:5324 ./include/linux/netdevice.h:5333 net/core/dev.c:3871 net/core/dev.c:3887)
    [  238.088468][  T318]  sch_direct_xmit (net/sched/sch_generic.c:347)
    [  238.088820][  T318]  __qdisc_run (net/sched/sch_generic.c:420 (discriminator 1))
    [  238.089166][  T318]  __dev_queue_xmit (./include/net/sch_generic.h:229 ./include/net/pkt_sched.h:121 ./include/net/pkt_sched.h:117 net/core/dev.c:4196 net/core/dev.c:4802)
    
    Workflow to reproduce:
    1. Initialize a TEQL topology (dummy0 and ifb0 as slaves, teql0 up).
    2. Start multiple sender workers continuously transmitting packets
       through teql0 to drive teql_master_xmit().
    3. In parallel, repeatedly delete and re-add the root qdisc on
       dummy0 and ifb0 via RTNETLINK, forcing frequent teardown and reset activity
       (teql_destroy() / qdisc_reset()).
    4. After running both workloads concurrently for several iterations,
       KASAN reports slab-use-after-free or double-free in the skb free path.
    
    Fix this by moving dev_reset_queue to sch_generic.h and calling it, instead
    of qdisc_reset, in teql_destroy since it handles both the lock and lockless
    cases correctly for root qdiscs.
    
    Fixes: 96009c7d500e ("sched: replace __QDISC_STATE_RUNNING bit with a spin lock")
    Reported-by: Xianrui Dong <[email protected]>
    Tested-by: Xianrui Dong <[email protected]>
    Co-developed-by: Victor Nogueira <[email protected]>
    Signed-off-by: Victor Nogueira <[email protected]>
    Signed-off-by: Jamal Hadi Salim <[email protected]>
    Link: https://patch.msgid.link/[email protected]
    Signed-off-by: Jakub Kicinski <[email protected]>
    Signed-off-by: Sasha Levin <[email protected]>

net/sched: teql: fix NULL pointer dereference in iptunnel_xmit on TEQL slave xmit [+ + +]

Author: Weiming Shi <[email protected]>
Date:   Wed Mar 4 12:42:18 2026 +0800

    net/sched: teql: fix NULL pointer dereference in iptunnel_xmit on TEQL slave xmit
    
    [ Upstream commit 0cc0c2e661af418bbf7074179ea5cfffc0a5c466 ]
    
    teql_master_xmit() calls netdev_start_xmit(skb, slave) to transmit
    through slave devices, but does not update skb->dev to the slave device
    beforehand.
    
    When a gretap tunnel is a TEQL slave, the transmit path reaches
    iptunnel_xmit() which saves dev = skb->dev (still pointing to teql0
    master) and later calls iptunnel_xmit_stats(dev, pkt_len). This
    function does:
    
        get_cpu_ptr(dev->tstats)
    
    Since teql_master_setup() does not set dev->pcpu_stat_type to
    NETDEV_PCPU_STAT_TSTATS, the core network stack never allocates tstats
    for teql0, so dev->tstats is NULL. get_cpu_ptr(NULL) computes
    NULL + __per_cpu_offset[cpu], resulting in a page fault.
    
     BUG: unable to handle page fault for address: ffff8880e6659018
     #PF: supervisor write access in kernel mode
     #PF: error_code(0x0002) - not-present page
     PGD 68bc067 P4D 68bc067 PUD 0
     Oops: Oops: 0002 [#1] SMP KASAN PTI
     RIP: 0010:iptunnel_xmit (./include/net/ip_tunnels.h:664 net/ipv4/ip_tunnel_core.c:89)
     Call Trace:
      <TASK>
      ip_tunnel_xmit (net/ipv4/ip_tunnel.c:847)
      __gre_xmit (net/ipv4/ip_gre.c:478)
      gre_tap_xmit (net/ipv4/ip_gre.c:779)
      teql_master_xmit (net/sched/sch_teql.c:319)
      dev_hard_start_xmit (net/core/dev.c:3887)
      sch_direct_xmit (net/sched/sch_generic.c:347)
      __dev_queue_xmit (net/core/dev.c:4802)
      neigh_direct_output (net/core/neighbour.c:1660)
      ip_finish_output2 (net/ipv4/ip_output.c:237)
      __ip_finish_output.part.0 (net/ipv4/ip_output.c:315)
      ip_mc_output (net/ipv4/ip_output.c:369)
      ip_send_skb (net/ipv4/ip_output.c:1508)
      udp_send_skb (net/ipv4/udp.c:1195)
      udp_sendmsg (net/ipv4/udp.c:1485)
      inet_sendmsg (net/ipv4/af_inet.c:859)
      __sys_sendto (net/socket.c:2206)
    
    Fix this by setting skb->dev = slave before calling
    netdev_start_xmit(), so that tunnel xmit functions see the correct
    slave device with properly allocated tstats.
    
    Fixes: 039f50629b7f ("ip_tunnel: Move stats update to iptunnel_xmit()")
    Reported-by: Xiang Mei <[email protected]>
    Signed-off-by: Weiming Shi <[email protected]>
    Link: https://patch.msgid.link/[email protected]
    Signed-off-by: Jakub Kicinski <[email protected]>
    Signed-off-by: Sasha Levin <[email protected]>

net/smc: fix NULL dereference and UAF in smc_tcp_syn_recv_sock() [+ + +]

Author: Jiayuan Chen <[email protected]>
Date:   Thu Mar 12 17:29:07 2026 +0800

    net/smc: fix NULL dereference and UAF in smc_tcp_syn_recv_sock()
    
    [ Upstream commit 6d5e4538364b9ceb1ac2941a4deb86650afb3538 ]
    
    Syzkaller reported a panic in smc_tcp_syn_recv_sock() [1].
    
    smc_tcp_syn_recv_sock() is called in the TCP receive path
    (softirq) via icsk_af_ops->syn_recv_sock on the clcsock (TCP
    listening socket). It reads sk_user_data to get the smc_sock
    pointer. However, when the SMC listen socket is being closed
    concurrently, smc_close_active() sets clcsock->sk_user_data
    to NULL under sk_callback_lock, and then the smc_sock itself
    can be freed via sock_put() in smc_release().
    
    This leads to two issues:
    
    1) NULL pointer dereference: sk_user_data is NULL when
       accessed.
    2) Use-after-free: sk_user_data is read as non-NULL, but the
       smc_sock is freed before its fields (e.g., queued_smc_hs,
       ori_af_ops) are accessed.
    
    The race window looks like this (the syzkaller crash [1]
    triggers via the SYN cookie path: tcp_get_cookie_sock() ->
    smc_tcp_syn_recv_sock(), but the normal tcp_check_req() path
    has the same race):
    
      CPU A (softirq)              CPU B (process ctx)
    
      tcp_v4_rcv()
        TCP_NEW_SYN_RECV:
        sk = req->rsk_listener
        sock_hold(sk)
        /* No lock on listener */
                                   smc_close_active():
                                     write_lock_bh(cb_lock)
                                     sk_user_data = NULL
                                     write_unlock_bh(cb_lock)
                                     ...
                                     smc_clcsock_release()
                                     sock_put(smc->sk) x2
                                       -> smc_sock freed!
        tcp_check_req()
          smc_tcp_syn_recv_sock():
            smc = user_data(sk)
              -> NULL or dangling
            smc->queued_smc_hs
              -> crash!
    
    Note that the clcsock and smc_sock are two independent objects
    with separate refcounts. TCP stack holds a reference on the
    clcsock, which keeps it alive, but this does NOT prevent the
    smc_sock from being freed.
    
    Fix this by using RCU and refcount_inc_not_zero() to safely
    access smc_sock. Since smc_tcp_syn_recv_sock() is called in
    the TCP three-way handshake path, taking read_lock_bh on
    sk_callback_lock is too heavy and would not survive a SYN
    flood attack. Using rcu_read_lock() is much more lightweight.
    
    - Set SOCK_RCU_FREE on the SMC listen socket so that
      smc_sock freeing is deferred until after the RCU grace
      period. This guarantees the memory is still valid when
      accessed inside rcu_read_lock().
    - Use rcu_read_lock() to protect reading sk_user_data.
    - Use refcount_inc_not_zero(&smc->sk.sk_refcnt) to pin the
      smc_sock. If the refcount has already reached zero (close
      path completed), it returns false and we bail out safely.
    
    Note: smc_hs_congested() has a similar lockless read of
    sk_user_data without rcu_read_lock(), but it only checks for
    NULL and accesses the global smc_hs_wq, never dereferencing
    any smc_sock field, so it is not affected.
    
    Reproducer was verified with mdelay injection and smc_run,
    the issue no longer occurs with this patch applied.
    
    [1] https://syzkaller.appspot.com/bug?extid=827ae2bfb3a3529333e9
    
    Fixes: 8270d9c21041 ("net/smc: Limit backlog connections")
    Reported-by: [email protected]
    Closes: https://lore.kernel.org/all/[email protected]/T/
    Suggested-by: Eric Dumazet <[email protected]>
    Reviewed-by: Eric Dumazet <[email protected]>
    Signed-off-by: Jiayuan Chen <[email protected]>
    Link: https://patch.msgid.link/[email protected]
    Signed-off-by: Jakub Kicinski <[email protected]>
    Signed-off-by: Sasha Levin <[email protected]>

net/tcp-ao: Fix MAC comparison to be constant-time [+ + +]

Author: Eric Biggers <[email protected]>
Date:   Mon Mar 2 12:36:00 2026 -0800

    net/tcp-ao: Fix MAC comparison to be constant-time
    
    commit 67edfec516d30d3e62925c397be4a1e5185802fc upstream.
    
    To prevent timing attacks, MACs need to be compared in constant
    time.  Use the appropriate helper function for this.
    
    Fixes: 0a3a809089eb ("net/tcp: Verify inbound TCP-AO signed segments")
    Cc: [email protected]
    Signed-off-by: Eric Biggers <[email protected]>
    Reviewed-by: Dmitry Safonov <[email protected]>
    Link: https://patch.msgid.link/[email protected]
    Signed-off-by: Jakub Kicinski <[email protected]>
    Signed-off-by: Greg Kroah-Hartman <[email protected]>

net/tcp-md5: Fix MAC comparison to be constant-time [+ + +]

Author: Eric Biggers <[email protected]>
Date:   Tue Mar 10 13:16:57 2026 -0700

    net/tcp-md5: Fix MAC comparison to be constant-time
    
    commit 46d0d6f50dab706637f4c18a470aac20a21900d3 upstream.
    
    To prevent timing attacks, MACs need to be compared in constant
    time.  Use the appropriate helper function for this.
    
    Fixes: cfb6eeb4c860 ("[TCP]: MD5 Signature Option (RFC2385) support.")
    Fixes: 658ddaaf6694 ("tcp: md5: RST: getting md5 key from listener")
    Cc: [email protected]
    Signed-off-by: Eric Biggers <[email protected]>
    Link: https://patch.msgid.link/[email protected]
    Signed-off-by: Jakub Kicinski <[email protected]>
    Signed-off-by: Greg Kroah-Hartman <[email protected]>

net: add a common function to compute features for upper devices [+ + +]

Author: Hangbin Liu <[email protected]>
Date:   Fri Oct 17 03:41:52 2025 +0000

    net: add a common function to compute features for upper devices
    
    [ Upstream commit 28098defc79fe7d29e6bfe4eb6312991f6bdc3d3 ]
    
    Some high level software drivers need to compute features from lower
    devices. But each has their own implementations and may lost some
    feature compute. Let's use one common function to compute features
    for kinds of these devices.
    
    The new helper uses the current bond implementation as the reference
    one, as the latter already handles all the relevant aspects: netdev
    features, TSO limits and dst retention.
    
    Suggested-by: Paolo Abeni <[email protected]>
    Signed-off-by: Hangbin Liu <[email protected]>
    Reviewed-by: Sabrina Dubroca <[email protected]>
    Reviewed-by: Jiri Pirko <[email protected]>
    Link: https://patch.msgid.link/[email protected]
    Signed-off-by: Jakub Kicinski <[email protected]>
    Stable-dep-of: 950803f72547 ("bonding: fix type confusion in bond_setup_by_slave()")
    Signed-off-by: Sasha Levin <[email protected]>

net: add xmit recursion limit to tunnel xmit functions [+ + +]

Author: Weiming Shi <[email protected]>
Date:   Sat Mar 7 00:01:34 2026 +0800

    net: add xmit recursion limit to tunnel xmit functions
    
    [ Upstream commit 6f1a9140ecda3baba3d945b9a6155af4268aafc4 ]
    
    Tunnel xmit functions (iptunnel_xmit, ip6tunnel_xmit) lack their own
    recursion limit. When a bond device in broadcast mode has GRE tap
    interfaces as slaves, and those GRE tunnels route back through the
    bond, multicast/broadcast traffic triggers infinite recursion between
    bond_xmit_broadcast() and ip_tunnel_xmit()/ip6_tnl_xmit(), causing
    kernel stack overflow.
    
    The existing XMIT_RECURSION_LIMIT (8) in the no-qdisc path is not
    sufficient because tunnel recursion involves route lookups and full IP
    output, consuming much more stack per level. Use a lower limit of 4
    (IP_TUNNEL_RECURSION_LIMIT) to prevent overflow.
    
    Add recursion detection using dev_xmit_recursion helpers directly in
    iptunnel_xmit() and ip6tunnel_xmit() to cover all IPv4/IPv6 tunnel
    paths including UDP encapsulated tunnels (VXLAN, Geneve, etc.).
    
    Move dev_xmit_recursion helpers from net/core/dev.h to public header
    include/linux/netdevice.h so they can be used by tunnel code.
    
     BUG: KASAN: stack-out-of-bounds in blake2s.constprop.0+0xe7/0x160
     Write of size 32 at addr ffff88810033fed0 by task kworker/0:1/11
     Workqueue: mld mld_ifc_work
     Call Trace:
      <TASK>
      __build_flow_key.constprop.0 (net/ipv4/route.c:515)
      ip_rt_update_pmtu (net/ipv4/route.c:1073)
      iptunnel_xmit (net/ipv4/ip_tunnel_core.c:84)
      ip_tunnel_xmit (net/ipv4/ip_tunnel.c:847)
      gre_tap_xmit (net/ipv4/ip_gre.c:779)
      dev_hard_start_xmit (net/core/dev.c:3887)
      sch_direct_xmit (net/sched/sch_generic.c:347)
      __dev_queue_xmit (net/core/dev.c:4802)
      bond_dev_queue_xmit (drivers/net/bonding/bond_main.c:312)
      bond_xmit_broadcast (drivers/net/bonding/bond_main.c:5279)
      bond_start_xmit (drivers/net/bonding/bond_main.c:5530)
      dev_hard_start_xmit (net/core/dev.c:3887)
      __dev_queue_xmit (net/core/dev.c:4841)
      ip_finish_output2 (net/ipv4/ip_output.c:237)
      ip_output (net/ipv4/ip_output.c:438)
      iptunnel_xmit (net/ipv4/ip_tunnel_core.c:86)
      gre_tap_xmit (net/ipv4/ip_gre.c:779)
      dev_hard_start_xmit (net/core/dev.c:3887)
      sch_direct_xmit (net/sched/sch_generic.c:347)
      __dev_queue_xmit (net/core/dev.c:4802)
      bond_dev_queue_xmit (drivers/net/bonding/bond_main.c:312)
      bond_xmit_broadcast (drivers/net/bonding/bond_main.c:5279)
      bond_start_xmit (drivers/net/bonding/bond_main.c:5530)
      dev_hard_start_xmit (net/core/dev.c:3887)
      __dev_queue_xmit (net/core/dev.c:4841)
      ip_finish_output2 (net/ipv4/ip_output.c:237)
      ip_output (net/ipv4/ip_output.c:438)
      iptunnel_xmit (net/ipv4/ip_tunnel_core.c:86)
      ip_tunnel_xmit (net/ipv4/ip_tunnel.c:847)
      gre_tap_xmit (net/ipv4/ip_gre.c:779)
      dev_hard_start_xmit (net/core/dev.c:3887)
      sch_direct_xmit (net/sched/sch_generic.c:347)
      __dev_queue_xmit (net/core/dev.c:4802)
      bond_dev_queue_xmit (drivers/net/bonding/bond_main.c:312)
      bond_xmit_broadcast (drivers/net/bonding/bond_main.c:5279)
      bond_start_xmit (drivers/net/bonding/bond_main.c:5530)
      dev_hard_start_xmit (net/core/dev.c:3887)
      __dev_queue_xmit (net/core/dev.c:4841)
      mld_sendpack
      mld_ifc_work
      process_one_work
      worker_thread
      </TASK>
    
    Fixes: 745e20f1b626 ("net: add a recursion limit in xmit path")
    Reported-by: Xiang Mei <[email protected]>
    Signed-off-by: Weiming Shi <[email protected]>
    Link: https://patch.msgid.link/[email protected]
    Signed-off-by: Paolo Abeni <[email protected]>
    Signed-off-by: Sasha Levin <[email protected]>

net: airoha: fix PSE memory configuration in airoha_fe_pse_ports_init() [+ + +]

Author: Lorenzo Bianconi <[email protected]>
Date:   Tue Oct 1 12:10:25 2024 +0200

    net: airoha: fix PSE memory configuration in airoha_fe_pse_ports_init()
    
    [ Upstream commit 8e38e08f2c560328a873c35aff1a0dbea6a7d084 ]
    
    Align PSE memory configuration to vendor SDK. In particular, increase
    initial value of PSE reserved memory in airoha_fe_pse_ports_init()
    routine by the value used for the second Packet Processor Engine (PPE2)
    and do not overwrite the default value.
    
    Introduced by commit 23020f049327 ("net: airoha: Introduce ethernet support
    for EN7581 SoC")
    
    Signed-off-by: Lorenzo Bianconi <[email protected]>
    Reviewed-by: Simon Horman <[email protected]>
    Link: https://patch.msgid.link/[email protected]
    Signed-off-by: Jakub Kicinski <[email protected]>
    Stable-dep-of: d4a533ad249e ("net: airoha: Remove airoha_dev_stop() in airoha_remove()")
    Signed-off-by: Sasha Levin <[email protected]>

net: airoha: Read completion queue data in airoha_qdma_tx_napi_poll() [+ + +]

Author: Lorenzo Bianconi <[email protected]>
Date:   Tue Oct 29 13:17:09 2024 +0100

    net: airoha: Read completion queue data in airoha_qdma_tx_napi_poll()
    
    [ Upstream commit 3affa310de523d63e52ea8e2efb3c476df29e414 ]
    
    In order to avoid any possible race, read completion queue head and
    pending entry in airoha_qdma_tx_napi_poll routine instead of doing it in
    airoha_irq_handler. Remove unused airoha_tx_irq_queue unused fields.
    This is a preliminary patch to add Qdisc offload for airoha_eth driver.
    
    Signed-off-by: Lorenzo Bianconi <[email protected]>
    Link: https://patch.msgid.link/[email protected]
    Signed-off-by: Jakub Kicinski <[email protected]>
    Stable-dep-of: d4a533ad249e ("net: airoha: Remove airoha_dev_stop() in airoha_remove()")
    Signed-off-by: Sasha Levin <[email protected]>

net: airoha: read default PSE reserved pages value before updating [+ + +]

Author: Lorenzo Bianconi <[email protected]>
Date:   Tue Oct 1 12:10:24 2024 +0200

    net: airoha: read default PSE reserved pages value before updating
    
    [ Upstream commit 1f3e7ff4f296af1f4350f457d5bd82bc825e645a ]
    
    Store the default value for the number of PSE reserved pages in orig_val
    at the beginning of airoha_fe_set_pse_oq_rsv routine, before updating it
    with airoha_fe_set_pse_queue_rsv_pages().
    Introduce airoha_fe_get_pse_all_rsv utility routine.
    
    Introduced by commit 23020f049327 ("net: airoha: Introduce ethernet support
    for EN7581 SoC")
    
    Signed-off-by: Lorenzo Bianconi <[email protected]>
    Reviewed-by: Simon Horman <[email protected]>
    Link: https://patch.msgid.link/[email protected]
    Signed-off-by: Jakub Kicinski <[email protected]>
    Stable-dep-of: d4a533ad249e ("net: airoha: Remove airoha_dev_stop() in airoha_remove()")
    Signed-off-by: Sasha Levin <[email protected]>

net: airoha: Remove airoha_dev_stop() in airoha_remove() [+ + +]

Author: Lorenzo Bianconi <[email protected]>
Date:   Fri Mar 13 12:27:00 2026 +0100

    net: airoha: Remove airoha_dev_stop() in airoha_remove()
    
    [ Upstream commit d4a533ad249e9fbdc2d0633f2ddd60a5b3a9a4ca ]
    
    Do not run airoha_dev_stop routine explicitly in airoha_remove()
    since ndo_stop() callback is already executed by unregister_netdev() in
    __dev_close_many routine if necessary and, doing so, we will end up causing
    an underflow in the qdma users atomic counters. Rely on networking subsystem
    to stop the device removing the airoha_eth module.
    
    Fixes: 23020f0493270 ("net: airoha: Introduce ethernet support for EN7581 SoC")
    Signed-off-by: Lorenzo Bianconi <[email protected]>
    Reviewed-by: Simon Horman <[email protected]>
    Link: https://patch.msgid.link/20260313-airoha-remove-ndo_stop-remove-net-v2-1-67542c3ceeca@kernel.org
    Signed-off-by: Jakub Kicinski <[email protected]>
    Signed-off-by: Sasha Levin <[email protected]>

net: bcmgenet: fix broken EEE by converting to phylib-managed state [+ + +]

Author: Nicolai Buchwitz <[email protected]>
Date:   Tue Mar 10 06:49:35 2026 +0100

    net: bcmgenet: fix broken EEE by converting to phylib-managed state
    
    [ Upstream commit 908c344d5cfac4160f49715da9efacdf5b6a28bd ]
    
    The bcmgenet EEE implementation is broken in several ways.
    phy_support_eee() is never called, so the PHY never advertises EEE
    and phylib never sets phydev->enable_tx_lpi.  bcmgenet_mac_config()
    checks priv->eee.eee_enabled to decide whether to enable the MAC
    LPI logic, but that field is never initialised to true, so the MAC
    never enters Low Power Idle even when EEE is negotiated - wasting
    the power savings EEE is designed to provide.  The only way to get
    EEE working at all is a manual 'ethtool --set-eee eth0 eee on' after
    every link-up, and even then bcmgenet_get_eee() immediately clobbers
    the reported state because phy_ethtool_get_eee() overwrites
    eee_enabled and tx_lpi_enabled with the uninitialised PHY eee_cfg
    values.  Finally, bcmgenet_mac_config() is only called on link-up,
    so EEE is never disabled in hardware on link-down.
    
    Fix all of this by removing the MAC-side EEE state tracking
    (priv->eee) and aligning with the pattern used by other non-phylink
    MAC drivers such as FEC.
    
    Call phy_support_eee() in bcmgenet_mii_probe() so the PHY advertises
    EEE link modes and phylib tracks negotiation state.  Move the EEE
    hardware control to bcmgenet_mii_setup(), which is called on every
    link event, and drive it directly from phydev->enable_tx_lpi - the
    flag phylib sets when EEE is negotiated and the user has not disabled
    it.  This enables EEE automatically once the link partner agrees and
    disables it cleanly on link-down.
    
    Make bcmgenet_get_eee() and bcmgenet_set_eee() pure passthroughs to
    phy_ethtool_get_eee() and phy_ethtool_set_eee(), with the MAC
    hardware register read/written for tx_lpi_timer.  Drop struct
    ethtool_keee eee from struct bcmgenet_priv.
    
    Fixes: fe0d4fd9285e ("net: phy: Keep track of EEE configuration")
    Link: https://lore.kernel.org/netdev/[email protected]/
    Suggested-by: Andrew Lunn <[email protected]>
    Signed-off-by: Nicolai Buchwitz <[email protected]>
    Reviewed-by: Florian Fainelli <[email protected]>
    Tested-by: Florian Fainelli <[email protected]>
    Link: https://patch.msgid.link/[email protected]
    Signed-off-by: Jakub Kicinski <[email protected]>
    Signed-off-by: Sasha Levin <[email protected]>

net: bcmgenet: increase WoL poll timeout [+ + +]

Author: Justin Chen <[email protected]>
Date:   Thu Mar 12 12:18:52 2026 -0700

    net: bcmgenet: increase WoL poll timeout
    
    [ Upstream commit 6cfc3bc02b977f2fba5f7268e6504d1931a774f7 ]
    
    Some systems require more than 5ms to get into WoL mode. Increase the
    timeout value to 50ms.
    
    Fixes: c51de7f3976b ("net: bcmgenet: add Wake-on-LAN support code")
    Signed-off-by: Justin Chen <[email protected]>
    Reviewed-by: Florian Fainelli <[email protected]>
    Link: https://patch.msgid.link/[email protected]
    Signed-off-by: Jakub Kicinski <[email protected]>
    Signed-off-by: Sasha Levin <[email protected]>

net: bonding: Fix nd_tbl NULL dereference when IPv6 is disabled [+ + +]

Author: Ricardo B. Marlière <[email protected]>
Date:   Sat Mar 7 17:50:54 2026 -0300

    net: bonding: Fix nd_tbl NULL dereference when IPv6 is disabled
    
    [ Upstream commit 30021e969d48e5819d5ae56936c2f34c0f7ce997 ]
    
    When booting with the 'ipv6.disable=1' parameter, the nd_tbl is never
    initialized because inet6_init() exits before ndisc_init() is called
    which initializes it. If bonding ARP/NS validation is enabled, an IPv6
    NS/NA packet received on a slave can reach bond_validate_na(), which
    calls bond_has_this_ip6(). That path calls ipv6_chk_addr() and can
    crash in __ipv6_chk_addr_and_flags().
    
     BUG: kernel NULL pointer dereference, address: 00000000000005d8
     Oops: Oops: 0000 [#1] SMP NOPTI
     RIP: 0010:__ipv6_chk_addr_and_flags+0x69/0x170
     Call Trace:
      <IRQ>
      ipv6_chk_addr+0x1f/0x30
      bond_validate_na+0x12e/0x1d0 [bonding]
      ? __pfx_bond_handle_frame+0x10/0x10 [bonding]
      bond_rcv_validate+0x1a0/0x450 [bonding]
      bond_handle_frame+0x5e/0x290 [bonding]
      ? srso_alias_return_thunk+0x5/0xfbef5
      __netif_receive_skb_core.constprop.0+0x3e8/0xe50
      ? srso_alias_return_thunk+0x5/0xfbef5
      ? update_cfs_rq_load_avg+0x1a/0x240
      ? srso_alias_return_thunk+0x5/0xfbef5
      ? __enqueue_entity+0x5e/0x240
      __netif_receive_skb_one_core+0x39/0xa0
      process_backlog+0x9c/0x150
      __napi_poll+0x30/0x200
      ? srso_alias_return_thunk+0x5/0xfbef5
      net_rx_action+0x338/0x3b0
      handle_softirqs+0xc9/0x2a0
      do_softirq+0x42/0x60
      </IRQ>
      <TASK>
      __local_bh_enable_ip+0x62/0x70
      __dev_queue_xmit+0x2d3/0x1000
      ? srso_alias_return_thunk+0x5/0xfbef5
      ? srso_alias_return_thunk+0x5/0xfbef5
      ? packet_parse_headers+0x10a/0x1a0
      packet_sendmsg+0x10da/0x1700
      ? kick_pool+0x5f/0x140
      ? srso_alias_return_thunk+0x5/0xfbef5
      ? __queue_work+0x12d/0x4f0
      __sys_sendto+0x1f3/0x220
      __x64_sys_sendto+0x24/0x30
      do_syscall_64+0x101/0xf80
      ? exc_page_fault+0x6e/0x170
      ? srso_alias_return_thunk+0x5/0xfbef5
      entry_SYSCALL_64_after_hwframe+0x77/0x7f
      </TASK>
    
    Fix this by checking ipv6_mod_enabled() before dispatching IPv6 packets to
    bond_na_rcv(). If IPv6 is disabled, return early from bond_rcv_validate()
    and avoid the path to ipv6_chk_addr().
    
    Suggested-by: Fernando Fernandez Mancera <[email protected]>
    Fixes: 4e24be018eb9 ("bonding: add new parameter ns_targets")
    Signed-off-by: Ricardo B. Marlière <[email protected]>
    Reviewed-by: Hangbin Liu <[email protected]>
    Link: https://patch.msgid.link/[email protected]
    Signed-off-by: Jakub Kicinski <[email protected]>
    Signed-off-by: Sasha Levin <[email protected]>

net: bonding: fix NULL deref in bond_debug_rlb_hash_show [+ + +]

Author: Xiang Mei <[email protected]>
Date:   Mon Mar 16 17:50:34 2026 -0700

    net: bonding: fix NULL deref in bond_debug_rlb_hash_show
    
    [ Upstream commit 605b52497bf89b3b154674deb135da98f916e390 ]
    
    rlb_clear_slave intentionally keeps RLB hash-table entries on
    the rx_hashtbl_used_head list with slave set to NULL when no
    replacement slave is available. However, bond_debug_rlb_hash_show
    visites client_info->slave without checking if it's NULL.
    
    Other used-list iterators in bond_alb.c already handle this NULL-slave
    state safely:
    
    - rlb_update_client returns early on !client_info->slave
    - rlb_req_update_slave_clients, rlb_clear_slave, and rlb_rebalance
    compare slave values before visiting
    - lb_req_update_subnet_clients continues if slave is NULL
    
    The following NULL deref crash can be trigger in
    bond_debug_rlb_hash_show:
    
    [    1.289791] BUG: kernel NULL pointer dereference, address: 0000000000000000
    [    1.292058] RIP: 0010:bond_debug_rlb_hash_show (drivers/net/bonding/bond_debugfs.c:41)
    [    1.293101] RSP: 0018:ffffc900004a7d00 EFLAGS: 00010286
    [    1.293333] RAX: 0000000000000000 RBX: ffff888102b48200 RCX: ffff888102b48204
    [    1.293631] RDX: ffff888102b48200 RSI: ffffffff839daad5 RDI: ffff888102815078
    [    1.293924] RBP: ffff888102815078 R08: ffff888102b4820e R09: 0000000000000000
    [    1.294267] R10: 0000000000000000 R11: 0000000000000000 R12: ffff888100f929c0
    [    1.294564] R13: ffff888100f92a00 R14: 0000000000000001 R15: ffffc900004a7ed8
    [    1.294864] FS:  0000000001395380(0000) GS:ffff888196e75000(0000) knlGS:0000000000000000
    [    1.295239] CS:  0010 DS: 0000 ES: 0000 CR0: 0000000080050033
    [    1.295480] CR2: 0000000000000000 CR3: 0000000102adc004 CR4: 0000000000772ef0
    [    1.295897] Call Trace:
    [    1.296134]  seq_read_iter (fs/seq_file.c:231)
    [    1.296341]  seq_read (fs/seq_file.c:164)
    [    1.296493]  full_proxy_read (fs/debugfs/file.c:378 (discriminator 1))
    [    1.296658]  vfs_read (fs/read_write.c:572)
    [    1.296981]  ksys_read (fs/read_write.c:717)
    [    1.297132]  do_syscall_64 (arch/x86/entry/syscall_64.c:63 (discriminator 1) arch/x86/entry/syscall_64.c:94 (discriminator 1))
    [    1.297325]  entry_SYSCALL_64_after_hwframe (arch/x86/entry/entry_64.S:130)
    
    Add a NULL check and print "(none)" for entries with no assigned slave.
    
    Fixes: caafa84251b88 ("bonding: add the debugfs interface to see RLB hash table")
    Reported-by: Weiming Shi <[email protected]>
    Signed-off-by: Xiang Mei <[email protected]>
    Link: https://patch.msgid.link/[email protected]
    Signed-off-by: Jakub Kicinski <[email protected]>
    Signed-off-by: Sasha Levin <[email protected]>

net: dsa: bcm_sf2: fix missing clk_disable_unprepare() in error paths [+ + +]

Author: Anas Iqbal <[email protected]>
Date:   Wed Mar 18 08:42:12 2026 +0000

    net: dsa: bcm_sf2: fix missing clk_disable_unprepare() in error paths
    
    [ Upstream commit b48731849609cbd8c53785a48976850b443153fd ]
    
    Smatch reports:
    drivers/net/dsa/bcm_sf2.c:997 bcm_sf2_sw_resume() warn:
    'priv->clk' from clk_prepare_enable() not released on lines: 983,990.
    
    The clock enabled by clk_prepare_enable() in bcm_sf2_sw_resume()
    is not released if bcm_sf2_sw_rst() or bcm_sf2_cfp_resume() fails.
    
    Add the missing clk_disable_unprepare() calls in the error paths
    to properly release the clock resource.
    
    Fixes: e9ec5c3bd238 ("net: dsa: bcm_sf2: request and handle clocks")
    Reviewed-by: Jonas Gorski <[email protected]>
    Reviewed-by: Florian Fainelli <[email protected]>
    Signed-off-by: Anas Iqbal <[email protected]>
    Link: https://patch.msgid.link/[email protected]
    Signed-off-by: Jakub Kicinski <[email protected]>
    Signed-off-by: Sasha Levin <[email protected]>

net: dsa: microchip: Fix error path in PTP IRQ setup [+ + +]

Author: Bastien Curutchet (Schneider Electric) <[email protected]>
Date:   Mon Mar 9 14:15:43 2026 +0100

    net: dsa: microchip: Fix error path in PTP IRQ setup
    
    commit 99c8c16a4aad0b37293cae213e15957c573cf79b upstream.
    
    If request_threaded_irq() fails during the PTP message IRQ setup, the
    newly created IRQ mapping is never disposed. Indeed, the
    ksz_ptp_irq_setup()'s error path only frees the mappings that were
    successfully set up.
    
    Dispose the newly created mapping if the associated
    request_threaded_irq() fails at setup.
    
    Cc: [email protected]
    Fixes: d0b8fec8ae505 ("net: dsa: microchip: Fix symetry in ksz_ptp_msg_irq_{setup/free}()")
    Signed-off-by: Bastien Curutchet (Schneider Electric) <[email protected]>
    Reviewed-by: Simon Horman <[email protected]>
    Reviewed-by: Vladimir Oltean <[email protected]>
    Link: https://patch.msgid.link/[email protected]
    Signed-off-by: Jakub Kicinski <[email protected]>
    Signed-off-by: Greg Kroah-Hartman <[email protected]>

net: dsa: properly keep track of conduit reference [+ + +]

Author: Vladimir Oltean <[email protected]>
Date:   Tue Mar 3 13:51:20 2026 +0800

    net: dsa: properly keep track of conduit reference
    
    [ Upstream commit 06e219f6a706c367c93051f408ac61417643d2f9 ]
    
    Problem description
    -------------------
    
    DSA has a mumbo-jumbo of reference handling of the conduit net device
    and its kobject which, sadly, is just wrong and doesn't make sense.
    
    There are two distinct problems.
    
    1. The OF path, which uses of_find_net_device_by_node(), never releases
       the elevated refcount on the conduit's kobject. Nominally, the OF and
       non-OF paths should result in objects having identical reference
       counts taken, and it is already suspicious that
       dsa_dev_to_net_device() has a put_device() call which is missing in
       dsa_port_parse_of(), but we can actually even verify that an issue
       exists. With CONFIG_DEBUG_KOBJECT_RELEASE=y, if we run this command
       "before" and "after" applying this patch:
    
    (unbind the conduit driver for net device eno2)
    echo 0000:00:00.2 > /sys/bus/pci/drivers/fsl_enetc/unbind
    
    we see these lines in the output diff which appear only with the patch
    applied:
    
    kobject: 'eno2' (ffff002009a3a6b8): kobject_release, parent 0000000000000000 (delayed 1000)
    kobject: '109' (ffff0020099d59a0): kobject_release, parent 0000000000000000 (delayed 1000)
    
    2. After we find the conduit interface one way (OF) or another (non-OF),
       it can get unregistered at any time, and DSA remains with a long-lived,
       but in this case stale, cpu_dp->conduit pointer. Holding the net
       device's underlying kobject isn't actually of much help, it just
       prevents it from being freed (but we never need that kobject
       directly). What helps us to prevent the net device from being
       unregistered is the parallel netdev reference mechanism (dev_hold()
       and dev_put()).
    
    Actually we actually use that netdev tracker mechanism implicitly on
    user ports since commit 2f1e8ea726e9 ("net: dsa: link interfaces with
    the DSA master to get rid of lockdep warnings"), via netdev_upper_dev_link().
    But time still passes at DSA switch probe time between the initial
    of_find_net_device_by_node() code and the user port creation time, time
    during which the conduit could unregister itself and DSA wouldn't know
    about it.
    
    So we have to run of_find_net_device_by_node() under rtnl_lock() to
    prevent that from happening, and release the lock only with the netdev
    tracker having acquired the reference.
    
    Do we need to keep the reference until dsa_unregister_switch() /
    dsa_switch_shutdown()?
    1: Maybe yes. A switch device will still be registered even if all user
       ports failed to probe, see commit 86f8b1c01a0a ("net: dsa: Do not
       make user port errors fatal"), and the cpu_dp->conduit pointers
       remain valid.  I haven't audited all call paths to see whether they
       will actually use the conduit in lack of any user port, but if they
       do, it seems safer to not rely on user ports for that reference.
    2. Definitely yes. We support changing the conduit which a user port is
       associated to, and we can get into a situation where we've moved all
       user ports away from a conduit, thus no longer hold any reference to
       it via the net device tracker. But we shouldn't let it go nonetheless
       - see the next change in relation to dsa_tree_find_first_conduit()
       and LAG conduits which disappear.
       We have to be prepared to return to the physical conduit, so the CPU
       port must explicitly keep another reference to it. This is also to
       say: the user ports and their CPU ports may not always keep a
       reference to the same conduit net device, and both are needed.
    
    As for the conduit's kobject for the /sys/class/net/ entry, we don't
    care about it, we can release it as soon as we hold the net device
    object itself.
    
    History and blame attribution
    -----------------------------
    
    The code has been refactored so many times, it is very difficult to
    follow and properly attribute a blame, but I'll try to make a short
    history which I hope to be correct.
    
    We have two distinct probing paths:
    - one for OF, introduced in 2016 in commit 83c0afaec7b7 ("net: dsa: Add
      new binding implementation")
    - one for non-OF, introduced in 2017 in commit 71e0bbde0d88 ("net: dsa:
      Add support for platform data")
    
    These are both complete rewrites of the original probing paths (which
    used struct dsa_switch_driver and other weird stuff, instead of regular
    devices on their respective buses for register access, like MDIO, SPI,
    I2C etc):
    - one for OF, introduced in 2013 in commit 5e95329b701c ("dsa: add
      device tree bindings to register DSA switches")
    - one for non-OF, introduced in 2008 in commit 91da11f870f0 ("net:
      Distributed Switch Architecture protocol support")
    
    except for tiny bits and pieces like dsa_dev_to_net_device() which were
    seemingly carried over since the original commit, and used to this day.
    
    The point is that the original probing paths received a fix in 2015 in
    the form of commit 679fb46c5785 ("net: dsa: Add missing master netdev
    dev_put() calls"), but the fix never made it into the "new" (dsa2)
    probing paths that can still be traced to today, and the fixed probing
    path was later deleted in 2019 in commit 93e86b3bc842 ("net: dsa: Remove
    legacy probing support").
    
    That is to say, the new probing paths were never quite correct in this
    area.
    
    The existence of the legacy probing support which was deleted in 2019
    explains why dsa_dev_to_net_device() returns a conduit with elevated
    refcount (because it was supposed to be released during
    dsa_remove_dst()). After the removal of the legacy code, the only user
    of dsa_dev_to_net_device() calls dev_put(conduit) immediately after this
    function returns. This pattern makes no sense today, and can only be
    interpreted historically to understand why dev_hold() was there in the
    first place.
    
    Change details
    --------------
    
    Today we have a better netdev tracking infrastructure which we should
    use. Logically netdev_hold() belongs in common code
    (dsa_port_parse_cpu(), where dp->conduit is assigned), but there is a
    tradeoff to be made with the rtnl_lock() section which would become a
    bit too long if we did that - dsa_port_parse_cpu() also calls
    request_module(). So we duplicate a bit of logic in order for the
    callers of dsa_port_parse_cpu() to be the ones responsible of holding
    the conduit reference and releasing it on error. This shortens the
    rtnl_lock() section significantly.
    
    In the dsa_switch_probe() error path, dsa_switch_release_ports() will be
    called in a number of situations, one being where dsa_port_parse_cpu()
    maybe didn't get the chance to run at all (a different port failed
    earlier, etc). So we have to test for the conduit being NULL prior to
    calling netdev_put().
    
    There have still been so many transformations to the code since the
    blamed commits (rename master -> conduit, commit 0650bf52b31f ("net:
    dsa: be compatible with masters which unregister on shutdown")), that it
    only makes sense to fix the code using the best methods available today
    and see how it can be backported to stable later. I suspect the fix
    cannot even be backported to kernels which lack dsa_switch_shutdown(),
    and I suspect this is also maybe why the long-lived conduit reference
    didn't make it into the new DSA probing paths at the time (problems
    during shutdown).
    
    Because dsa_dev_to_net_device() has a single call site and has to be
    changed anyway, the logic was just absorbed into the non-OF
    dsa_port_parse().
    
    Tested on the ocelot/felix switch and on dsa_loop, both on the NXP
    LS1028A with CONFIG_DEBUG_KOBJECT_RELEASE=y.
    
    Reported-by: Ma Ke <[email protected]>
    Closes: https://lore.kernel.org/netdev/[email protected]/
    Fixes: 83c0afaec7b7 ("net: dsa: Add new binding implementation")
    Fixes: 71e0bbde0d88 ("net: dsa: Add support for platform data")
    Reviewed-by: Jonas Gorski <[email protected]>
    Signed-off-by: Vladimir Oltean <[email protected]>
    Link: https://patch.msgid.link/[email protected]
    Signed-off-by: Paolo Abeni <[email protected]>
    Signed-off-by: Robert Garcia <[email protected]>
    Signed-off-by: Greg Kroah-Hartman <[email protected]>

net: dsa: realtek: Fix LED group port bit for non-zero LED group [+ + +]

Author: Marek Behún <[email protected]>
Date:   Wed Mar 11 12:12:37 2026 +0100

    net: dsa: realtek: Fix LED group port bit for non-zero LED group
    
    [ Upstream commit e8f0dc024ce55451ebd54bad975134ba802e4fcc ]
    
    The rtl8366rb_led_group_port_mask() function always returns LED port
    bit in LED group 0; the switch statement returns the same thing in all
    non-default cases.
    
    This means that the driver does not currently support configuring LEDs
    in non-zero LED groups.
    
    Fix this.
    
    Fixes: 32d617005475a71e ("net: dsa: realtek: add LED drivers for rtl8366rb")
    Signed-off-by: Marek Behún <[email protected]>
    Reviewed-by: Andrew Lunn <[email protected]>
    Link: https://patch.msgid.link/[email protected]
    Signed-off-by: Jakub Kicinski <[email protected]>
    Signed-off-by: Sasha Levin <[email protected]>

net: dsa: realtek: rtl8365mb: remove ifOutDiscards from rx_packets [+ + +]

Author: Mieczyslaw Nalewaj <[email protected]>
Date:   Tue Mar 3 17:25:12 2026 -0300

    net: dsa: realtek: rtl8365mb: remove ifOutDiscards from rx_packets
    
    [ Upstream commit f76a93241d71fbba8425e3967097b498c29264ed ]
    
    rx_packets should report the number of frames successfully received:
    unicast + multicast + broadcast. Subtracting ifOutDiscards (a TX
    counter) is incorrect and can undercount RX packets. RX drops are
    already reported via rx_dropped (e.g. etherStatsDropEvents), so
    there is no need to adjust rx_packets.
    
    This patch removes the subtraction of ifOutDiscards from rx_packets
    in rtl8365mb_stats_update().
    
    Link: https://lore.kernel.org/netdev/[email protected]/
    Fixes: 4af2950c50c8 ("net: dsa: realtek-smi: add rtl8365mb subdriver for RTL8365MB-VC")
    Signed-off-by: Mieczyslaw Nalewaj <[email protected]>
    Signed-off-by: Luiz Angelo Daros de Luca <[email protected]>
    Reviewed-by: Simon Horman <[email protected]>
    Acked-by: Linus Walleij <[email protected]>
    Link: https://patch.msgid.link/[email protected]
    Signed-off-by: Jakub Kicinski <[email protected]>
    Signed-off-by: Sasha Levin <[email protected]>

net: ethernet: arc: emac: quiesce interrupts before requesting IRQ [+ + +]

Author: Fan Wu <[email protected]>
Date:   Mon Mar 9 13:24:09 2026 +0000

    net: ethernet: arc: emac: quiesce interrupts before requesting IRQ
    
    commit 2503d08f8a2de618e5c3a8183b250ff4a2e2d52c upstream.
    
    Normal RX/TX interrupts are enabled later, in arc_emac_open(), so probe
    should not see interrupt delivery in the usual case. However, hardware may
    still present stale or latched interrupt status left by firmware or the
    bootloader.
    
    If probe later unwinds after devm_request_irq() has installed the handler,
    such a stale interrupt can still reach arc_emac_intr() during teardown and
    race with release of the associated net_device.
    
    Avoid that window by putting the device into a known quiescent state before
    requesting the IRQ: disable all EMAC interrupt sources and clear any
    pending EMAC interrupt status bits. This keeps the change hardware-focused
    and minimal, while preventing spurious IRQ delivery from leftover state.
    
    Fixes: e4f2379db6c6 ("ethernet/arc/arc_emac - Add new driver")
    Cc: [email protected]
    Signed-off-by: Fan Wu <[email protected]>
    Link: https://patch.msgid.link/[email protected]
    Signed-off-by: Jakub Kicinski <[email protected]>
    Signed-off-by: Greg Kroah-Hartman <[email protected]>

net: macb: fix uninitialized rx_fs_lock [+ + +]

Author: Fedor Pchelkin <[email protected]>
Date:   Mon Mar 16 13:38:25 2026 +0300

    net: macb: fix uninitialized rx_fs_lock
    
    [ Upstream commit 34b11cc56e4369bc08b1f4c4a04222d75ed596ce ]
    
    If hardware doesn't support RX Flow Filters, rx_fs_lock spinlock is not
    initialized leading to the following assertion splat triggerable via
    set_rxnfc callback.
    
    INFO: trying to register non-static key.
    The code is fine but needs lockdep annotation, or maybe
    you didn't initialize this object before use?
    turning off the locking correctness validator.
    CPU: 1 PID: 949 Comm: syz.0.6 Not tainted 6.1.164+ #113
    Hardware name: QEMU Standard PC (Q35 + ICH9, 2009), BIOS rel-1.16.1-0-g3208b098f51a-prebuilt.qemu.org 04/01/2014
    Call Trace:
     <TASK>
     __dump_stack lib/dump_stack.c:88 [inline]
     dump_stack_lvl+0x8d/0xba lib/dump_stack.c:106
     assign_lock_key kernel/locking/lockdep.c:974 [inline]
     register_lock_class+0x141b/0x17f0 kernel/locking/lockdep.c:1287
     __lock_acquire+0x74f/0x6c40 kernel/locking/lockdep.c:4928
     lock_acquire kernel/locking/lockdep.c:5662 [inline]
     lock_acquire+0x190/0x4b0 kernel/locking/lockdep.c:5627
     __raw_spin_lock_irqsave include/linux/spinlock_api_smp.h:110 [inline]
     _raw_spin_lock_irqsave+0x33/0x50 kernel/locking/spinlock.c:162
     gem_del_flow_filter drivers/net/ethernet/cadence/macb_main.c:3562 [inline]
     gem_set_rxnfc+0x533/0xac0 drivers/net/ethernet/cadence/macb_main.c:3667
     ethtool_set_rxnfc+0x18c/0x280 net/ethtool/ioctl.c:961
     __dev_ethtool net/ethtool/ioctl.c:2956 [inline]
     dev_ethtool+0x229c/0x6290 net/ethtool/ioctl.c:3095
     dev_ioctl+0x637/0x1070 net/core/dev_ioctl.c:510
     sock_do_ioctl+0x20d/0x2c0 net/socket.c:1215
     sock_ioctl+0x577/0x6d0 net/socket.c:1320
     vfs_ioctl fs/ioctl.c:51 [inline]
     __do_sys_ioctl fs/ioctl.c:870 [inline]
     __se_sys_ioctl fs/ioctl.c:856 [inline]
     __x64_sys_ioctl+0x18c/0x210 fs/ioctl.c:856
     do_syscall_x64 arch/x86/entry/common.c:46 [inline]
     do_syscall_64+0x35/0x80 arch/x86/entry/common.c:76
     entry_SYSCALL_64_after_hwframe+0x6e/0xd8
    
    A more straightforward solution would be to always initialize rx_fs_lock,
    just like rx_fs_list.  However, in this case the driver set_rxnfc callback
    would return with a rather confusing error code, e.g. -EINVAL.  So deny
    set_rxnfc attempts directly if the RX filtering feature is not supported
    by hardware.
    
    Fixes: ae8223de3df5 ("net: macb: Added support for RX filtering")
    Signed-off-by: Fedor Pchelkin <[email protected]>
    Link: https://patch.msgid.link/[email protected]
    Signed-off-by: Jakub Kicinski <[email protected]>
    Signed-off-by: Sasha Levin <[email protected]>

net: macb: fix use-after-free access to PTP clock [+ + +]

Author: Fedor Pchelkin <[email protected]>
Date:   Mon Mar 16 13:38:24 2026 +0300

    net: macb: fix use-after-free access to PTP clock
    
    commit 8da13e6d63c1a97f7302d342c89c4a56a55c7015 upstream.
    
    PTP clock is registered on every opening of the interface and destroyed on
    every closing.  However it may be accessed via get_ts_info ethtool call
    which is possible while the interface is just present in the kernel.
    
    BUG: KASAN: use-after-free in ptp_clock_index+0x47/0x50 drivers/ptp/ptp_clock.c:426
    Read of size 4 at addr ffff8880194345cc by task syz.0.6/948
    
    CPU: 1 PID: 948 Comm: syz.0.6 Not tainted 6.1.164+ #109
    Hardware name: QEMU Standard PC (Q35 + ICH9, 2009), BIOS rel-1.16.1-0-g3208b098f51a-prebuilt.qemu.org 04/01/2014
    Call Trace:
     <TASK>
     __dump_stack lib/dump_stack.c:88 [inline]
     dump_stack_lvl+0x8d/0xba lib/dump_stack.c:106
     print_address_description mm/kasan/report.c:316 [inline]
     print_report+0x17f/0x496 mm/kasan/report.c:420
     kasan_report+0xd9/0x180 mm/kasan/report.c:524
     ptp_clock_index+0x47/0x50 drivers/ptp/ptp_clock.c:426
     gem_get_ts_info+0x138/0x1e0 drivers/net/ethernet/cadence/macb_main.c:3349
     macb_get_ts_info+0x68/0xb0 drivers/net/ethernet/cadence/macb_main.c:3371
     __ethtool_get_ts_info+0x17c/0x260 net/ethtool/common.c:558
     ethtool_get_ts_info net/ethtool/ioctl.c:2367 [inline]
     __dev_ethtool net/ethtool/ioctl.c:3017 [inline]
     dev_ethtool+0x2b05/0x6290 net/ethtool/ioctl.c:3095
     dev_ioctl+0x637/0x1070 net/core/dev_ioctl.c:510
     sock_do_ioctl+0x20d/0x2c0 net/socket.c:1215
     sock_ioctl+0x577/0x6d0 net/socket.c:1320
     vfs_ioctl fs/ioctl.c:51 [inline]
     __do_sys_ioctl fs/ioctl.c:870 [inline]
     __se_sys_ioctl fs/ioctl.c:856 [inline]
     __x64_sys_ioctl+0x18c/0x210 fs/ioctl.c:856
     do_syscall_x64 arch/x86/entry/common.c:46 [inline]
     do_syscall_64+0x35/0x80 arch/x86/entry/common.c:76
     entry_SYSCALL_64_after_hwframe+0x6e/0xd8
     </TASK>
    
    Allocated by task 457:
     kmalloc include/linux/slab.h:563 [inline]
     kzalloc include/linux/slab.h:699 [inline]
     ptp_clock_register+0x144/0x10e0 drivers/ptp/ptp_clock.c:235
     gem_ptp_init+0x46f/0x930 drivers/net/ethernet/cadence/macb_ptp.c:375
     macb_open+0x901/0xd10 drivers/net/ethernet/cadence/macb_main.c:2920
     __dev_open+0x2ce/0x500 net/core/dev.c:1501
     __dev_change_flags+0x56a/0x740 net/core/dev.c:8651
     dev_change_flags+0x92/0x170 net/core/dev.c:8722
     do_setlink+0xaf8/0x3a80 net/core/rtnetlink.c:2833
     __rtnl_newlink+0xbf4/0x1940 net/core/rtnetlink.c:3608
     rtnl_newlink+0x63/0xa0 net/core/rtnetlink.c:3655
     rtnetlink_rcv_msg+0x3c6/0xed0 net/core/rtnetlink.c:6150
     netlink_rcv_skb+0x15d/0x430 net/netlink/af_netlink.c:2511
     netlink_unicast_kernel net/netlink/af_netlink.c:1318 [inline]
     netlink_unicast+0x6d7/0xa30 net/netlink/af_netlink.c:1344
     netlink_sendmsg+0x97e/0xeb0 net/netlink/af_netlink.c:1872
     sock_sendmsg_nosec net/socket.c:718 [inline]
     __sock_sendmsg+0x14b/0x180 net/socket.c:730
     __sys_sendto+0x320/0x3b0 net/socket.c:2152
     __do_sys_sendto net/socket.c:2164 [inline]
     __se_sys_sendto net/socket.c:2160 [inline]
     __x64_sys_sendto+0xdc/0x1b0 net/socket.c:2160
     do_syscall_x64 arch/x86/entry/common.c:46 [inline]
     do_syscall_64+0x35/0x80 arch/x86/entry/common.c:76
     entry_SYSCALL_64_after_hwframe+0x6e/0xd8
    
    Freed by task 938:
     kasan_slab_free include/linux/kasan.h:177 [inline]
     slab_free_hook mm/slub.c:1729 [inline]
     slab_free_freelist_hook mm/slub.c:1755 [inline]
     slab_free mm/slub.c:3687 [inline]
     __kmem_cache_free+0xbc/0x320 mm/slub.c:3700
     device_release+0xa0/0x240 drivers/base/core.c:2507
     kobject_cleanup lib/kobject.c:681 [inline]
     kobject_release lib/kobject.c:712 [inline]
     kref_put include/linux/kref.h:65 [inline]
     kobject_put+0x1cd/0x350 lib/kobject.c:729
     put_device+0x1b/0x30 drivers/base/core.c:3805
     ptp_clock_unregister+0x171/0x270 drivers/ptp/ptp_clock.c:391
     gem_ptp_remove+0x4e/0x1f0 drivers/net/ethernet/cadence/macb_ptp.c:404
     macb_close+0x1c8/0x270 drivers/net/ethernet/cadence/macb_main.c:2966
     __dev_close_many+0x1b9/0x310 net/core/dev.c:1585
     __dev_close net/core/dev.c:1597 [inline]
     __dev_change_flags+0x2bb/0x740 net/core/dev.c:8649
     dev_change_flags+0x92/0x170 net/core/dev.c:8722
     dev_ifsioc+0x151/0xe00 net/core/dev_ioctl.c:326
     dev_ioctl+0x33e/0x1070 net/core/dev_ioctl.c:572
     sock_do_ioctl+0x20d/0x2c0 net/socket.c:1215
     sock_ioctl+0x577/0x6d0 net/socket.c:1320
     vfs_ioctl fs/ioctl.c:51 [inline]
     __do_sys_ioctl fs/ioctl.c:870 [inline]
     __se_sys_ioctl fs/ioctl.c:856 [inline]
     __x64_sys_ioctl+0x18c/0x210 fs/ioctl.c:856
     do_syscall_x64 arch/x86/entry/common.c:46 [inline]
     do_syscall_64+0x35/0x80 arch/x86/entry/common.c:76
     entry_SYSCALL_64_after_hwframe+0x6e/0xd8
    
    Set the PTP clock pointer to NULL after unregistering.
    
    Fixes: c2594d804d5c ("macb: Common code to enable ptp support for MACB/GEM")
    Cc: [email protected]
    Signed-off-by: Fedor Pchelkin <[email protected]>
    Link: https://patch.msgid.link/[email protected]
    Signed-off-by: Jakub Kicinski <[email protected]>
    Signed-off-by: Greg Kroah-Hartman <[email protected]>

net: macb: Introduce gem_init_rx_ring() [+ + +]

Author: Kevin Hao <[email protected]>
Date:   Fri Mar 20 10:14:00 2026 -0400

    net: macb: Introduce gem_init_rx_ring()
    
    [ Upstream commit 1a7124ecd655bcaf1845197fe416aa25cff4c3ea ]
    
    Extract the initialization code for the GEM RX ring into a new function.
    This change will be utilized in a subsequent patch. No functional changes
    are introduced.
    
    Signed-off-by: Kevin Hao <[email protected]>
    Reviewed-by: Simon Horman <[email protected]>
    Link: https://patch.msgid.link/[email protected]
    Signed-off-by: Jakub Kicinski <[email protected]>
    Stable-dep-of: 718d0766ce4c ("net: macb: Reinitialize tx/rx queue pointer registers and rx ring during resume")
    Signed-off-by: Sasha Levin <[email protected]>
    Signed-off-by: Greg Kroah-Hartman <[email protected]>

net: macb: Reinitialize tx/rx queue pointer registers and rx ring during resume [+ + +]

Author: Kevin Hao <[email protected]>
Date:   Fri Mar 20 10:14:01 2026 -0400

    net: macb: Reinitialize tx/rx queue pointer registers and rx ring during resume
    
    [ Upstream commit 718d0766ce4c7634ce62fa78b526ea7263487edd ]
    
    On certain platforms, such as AMD Versal boards, the tx/rx queue pointer
    registers are cleared after suspend, and the rx queue pointer register
    is also disabled during suspend if WOL is enabled. Previously, we assumed
    that these registers would be restored by macb_mac_link_up(). However,
    in commit bf9cf80cab81, macb_init_buffers() was moved from
    macb_mac_link_up() to macb_open(). Therefore, we should call
    macb_init_buffers() to reinitialize the tx/rx queue pointer registers
    during resume.
    
    Due to the reset of these two registers, we also need to adjust the
    tx/rx rings accordingly. The tx ring will be handled by
    gem_shuffle_tx_rings() in macb_mac_link_up(), so we only need to
    initialize the rx ring here.
    
    Fixes: bf9cf80cab81 ("net: macb: Fix tx/rx malfunction after phy link down and up")
    Reported-by: Quanyang Wang <[email protected]>
    Signed-off-by: Kevin Hao <[email protected]>
    Tested-by: Quanyang Wang <[email protected]>
    Cc: [email protected]
    Reviewed-by: Simon Horman <[email protected]>
    Link: https://patch.msgid.link/[email protected]
    Signed-off-by: Jakub Kicinski <[email protected]>
    Signed-off-by: Sasha Levin <[email protected]>
    Signed-off-by: Greg Kroah-Hartman <[email protected]>

net: macb: Shuffle the tx ring before enabling tx [+ + +]

Author: Kevin Hao <[email protected]>
Date:   Wed Mar 18 12:11:53 2026 -0400

    net: macb: Shuffle the tx ring before enabling tx
    
    [ Upstream commit 881a0263d502e1a93ebc13a78254e9ad19520232 ]
    
    Quanyang observed that when using an NFS rootfs on an AMD ZynqMp board,
    the rootfs may take an extended time to recover after a suspend.
    Upon investigation, it was determined that the issue originates from a
    problem in the macb driver.
    
    According to the Zynq UltraScale TRM [1], when transmit is disabled,
    the transmit buffer queue pointer resets to point to the address
    specified by the transmit buffer queue base address register.
    
    In the current implementation, the code merely resets `queue->tx_head`
    and `queue->tx_tail` to '0'. This approach presents several issues:
    
    - Packets already queued in the tx ring are silently lost,
      leading to memory leaks since the associated skbs cannot be released.
    
    - Concurrent write access to `queue->tx_head` and `queue->tx_tail` may
      occur from `macb_tx_poll()` or `macb_start_xmit()` when these values
      are reset to '0'.
    
    - The transmission may become stuck on a packet that has already been sent
      out, with its 'TX_USED' bit set, but has not yet been processed. However,
      due to the manipulation of 'queue->tx_head' and 'queue->tx_tail',
      `macb_tx_poll()` incorrectly assumes there are no packets to handle
      because `queue->tx_head == queue->tx_tail`. This issue is only resolved
      when a new packet is placed at this position. This is the root cause of
      the prolonged recovery time observed for the NFS root filesystem.
    
    To resolve this issue, shuffle the tx ring and tx skb array so that
    the first unsent packet is positioned at the start of the tx ring.
    Additionally, ensure that updates to `queue->tx_head` and
    `queue->tx_tail` are properly protected with the appropriate lock.
    
    [1] https://docs.amd.com/v/u/en-US/ug1085-zynq-ultrascale-trm
    
    Fixes: bf9cf80cab81 ("net: macb: Fix tx/rx malfunction after phy link down and up")
    Reported-by: Quanyang Wang <[email protected]>
    Signed-off-by: Kevin Hao <[email protected]>
    Cc: [email protected]
    Reviewed-by: Simon Horman <[email protected]>
    Link: https://patch.msgid.link/[email protected]
    Signed-off-by: Jakub Kicinski <[email protected]>
    [ adapted include block context ]
    Signed-off-by: Sasha Levin <[email protected]>
    Signed-off-by: Greg Kroah-Hartman <[email protected]>

net: mana: fix use-after-free in mana_hwc_destroy_channel() by reordering teardown [+ + +]

Author: Dipayaan Roy <[email protected]>
Date:   Wed Mar 11 12:22:04 2026 -0700

    net: mana: fix use-after-free in mana_hwc_destroy_channel() by reordering teardown
    
    [ Upstream commit fa103fc8f56954a60699a29215cb713448a39e87 ]
    
    A potential race condition exists in mana_hwc_destroy_channel() where
    hwc->caller_ctx is freed before the HWC's Completion Queue (CQ) and
    Event Queue (EQ) are destroyed. This allows an in-flight CQ interrupt
    handler to dereference freed memory, leading to a use-after-free or
    NULL pointer dereference in mana_hwc_handle_resp().
    
    mana_smc_teardown_hwc() signals the hardware to stop but does not
    synchronize against IRQ handlers already executing on other CPUs. The
    IRQ synchronization only happens in mana_hwc_destroy_cq() via
    mana_gd_destroy_eq() -> mana_gd_deregister_irq(). Since this runs
    after kfree(hwc->caller_ctx), a concurrent mana_hwc_rx_event_handler()
    can dereference freed caller_ctx (and rxq->msg_buf) in
    mana_hwc_handle_resp().
    
    Fix this by reordering teardown to reverse-of-creation order: destroy
    the TX/RX work queues and CQ/EQ before freeing hwc->caller_ctx. This
    ensures all in-flight interrupt handlers complete before the memory they
    access is freed.
    
    Fixes: ca9c54d2d6a5 ("net: mana: Add a driver for Microsoft Azure Network Adapter (MANA)")
    Reviewed-by: Haiyang Zhang <[email protected]>
    Signed-off-by: Dipayaan Roy <[email protected]>
    Reviewed-by: Simon Horman <[email protected]>
    Link: https://patch.msgid.link/abHA3AjNtqa1nx9k@linuxonhyperv3.guj3yctzbm1etfxqx2vob5hsef.xx.internal.cloudapp.net
    Signed-off-by: Jakub Kicinski <[email protected]>
    Signed-off-by: Sasha Levin <[email protected]>

net: mana: Ring doorbell at 4 CQ wraparounds [+ + +]

Author: Long Li <[email protected]>
Date:   Thu Feb 26 11:28:33 2026 -0800

    net: mana: Ring doorbell at 4 CQ wraparounds
    
    commit dabffd08545ffa1d7183bc45e387860984025291 upstream.
    
    MANA hardware requires at least one doorbell ring every 8 wraparounds
    of the CQ. The driver rings the doorbell as a form of flow control to
    inform hardware that CQEs have been consumed.
    
    The NAPI poll functions mana_poll_tx_cq() and mana_poll_rx_cq() can
    poll up to CQE_POLLING_BUFFER (512) completions per call. If the CQ
    has fewer than 512 entries, a single poll call can process more than
    4 wraparounds without ringing the doorbell. The doorbell threshold
    check also uses ">" instead of ">=", delaying the ring by one extra
    CQE beyond 4 wraparounds. Combined, these issues can cause the driver
    to exceed the 8-wraparound hardware limit, leading to missed
    completions and stalled queues.
    
    Fix this by capping the number of CQEs polled per call to 4 wraparounds
    of the CQ in both TX and RX paths. Also change the doorbell threshold
    from ">" to ">=" so the doorbell is rung as soon as 4 wraparounds are
    reached.
    
    Cc: [email protected]
    Fixes: 58a63729c957 ("net: mana: Fix doorbell out of order violation and avoid unnecessary doorbell rings")
    Signed-off-by: Long Li <[email protected]>
    Reviewed-by: Haiyang Zhang <[email protected]>
    Reviewed-by: Vadim Fedorenko <[email protected]>
    Link: https://patch.msgid.link/[email protected]
    Signed-off-by: Jakub Kicinski <[email protected]>
    Signed-off-by: Greg Kroah-Hartman <[email protected]>

net: mvpp2: guard flow control update with global_tx_fc in buffer switching [+ + +]

Author: Muhammad Hammad Ijaz <[email protected]>
Date:   Mon Mar 16 12:31:01 2026 -0700

    net: mvpp2: guard flow control update with global_tx_fc in buffer switching
    
    [ Upstream commit 8a63baadf08453f66eb582fdb6dd234f72024723 ]
    
    mvpp2_bm_switch_buffers() unconditionally calls
    mvpp2_bm_pool_update_priv_fc() when switching between per-cpu and
    shared buffer pool modes. This function programs CM3 flow control
    registers via mvpp2_cm3_read()/mvpp2_cm3_write(), which dereference
    priv->cm3_base without any NULL check.
    
    When the CM3 SRAM resource is not present in the device tree (the
    third reg entry added by commit 60523583b07c ("dts: marvell: add CM3
    SRAM memory to cp11x ethernet device tree")), priv->cm3_base remains
    NULL and priv->global_tx_fc is false. Any operation that triggers
    mvpp2_bm_switch_buffers(), for example an MTU change that crosses
    the jumbo frame threshold, will crash:
    
      Unable to handle kernel NULL pointer dereference at
      virtual address 0000000000000000
      Mem abort info:
        ESR = 0x0000000096000006
        EC = 0x25: DABT (current EL), IL = 32 bits
      pc : readl+0x0/0x18
      lr : mvpp2_cm3_read.isra.0+0x14/0x20
      Call trace:
       readl+0x0/0x18
       mvpp2_bm_pool_update_fc+0x40/0x12c
       mvpp2_bm_pool_update_priv_fc+0x94/0xd8
       mvpp2_bm_switch_buffers.isra.0+0x80/0x1c0
       mvpp2_change_mtu+0x140/0x380
       __dev_set_mtu+0x1c/0x38
       dev_set_mtu_ext+0x78/0x118
       dev_set_mtu+0x48/0xa8
       dev_ifsioc+0x21c/0x43c
       dev_ioctl+0x2d8/0x42c
       sock_ioctl+0x314/0x378
    
    Every other flow control call site in the driver already guards
    hardware access with either priv->global_tx_fc or port->tx_fc.
    mvpp2_bm_switch_buffers() is the only place that omits this check.
    
    Add the missing priv->global_tx_fc guard to both the disable and
    re-enable calls in mvpp2_bm_switch_buffers(), consistent with the
    rest of the driver.
    
    Fixes: 3a616b92a9d1 ("net: mvpp2: Add TX flow control support for jumbo frames")
    Signed-off-by: Muhammad Hammad Ijaz <[email protected]>
    Reviewed-by: Gunnar Kudrjavets <[email protected]>
    Link: https://patch.msgid.link/[email protected]
    Signed-off-by: Paolo Abeni <[email protected]>
    Signed-off-by: Sasha Levin <[email protected]>

net: ncsi: fix skb leak in error paths [+ + +]

Author: Jian Zhang <[email protected]>
Date:   Thu Mar 5 14:06:55 2026 +0800

    net: ncsi: fix skb leak in error paths
    
    commit 5c3398a54266541610c8d0a7082e654e9ff3e259 upstream.
    
    Early return paths in NCSI RX and AEN handlers fail to release
    the received skb, resulting in a memory leak.
    
    Specifically, ncsi_aen_handler() returns on invalid AEN packets
    without consuming the skb. Similarly, ncsi_rcv_rsp() exits early
    when failing to resolve the NCSI device, response handler, or
    request, leaving the skb unfreed.
    
    CC: [email protected]
    Fixes: 7a82ecf4cfb8 ("net/ncsi: NCSI AEN packet handler")
    Fixes: 138635cc27c9 ("net/ncsi: NCSI response packet handler")
    Signed-off-by: Jian Zhang <[email protected]>
    Link: https://patch.msgid.link/[email protected]
    Signed-off-by: Jakub Kicinski <[email protected]>
    Signed-off-by: Greg Kroah-Hartman <[email protected]>

net: nexthop: fix percpu use-after-free in remove_nh_grp_entry [+ + +]

Author: Mehul Rao <[email protected]>
Date:   Fri Mar 6 18:38:20 2026 -0500

    net: nexthop: fix percpu use-after-free in remove_nh_grp_entry
    
    commit b2662e7593e94ae09b1cf7ee5f09160a3612bcb2 upstream.
    
    When removing a nexthop from a group, remove_nh_grp_entry() publishes
    the new group via rcu_assign_pointer() then immediately frees the
    removed entry's percpu stats with free_percpu(). However, the
    synchronize_net() grace period in the caller remove_nexthop_from_groups()
    runs after the free. RCU readers that entered before the publish still
    see the old group and can dereference the freed stats via
    nh_grp_entry_stats_inc() -> get_cpu_ptr(nhge->stats), causing a
    use-after-free on percpu memory.
    
    Fix by deferring the free_percpu() until after synchronize_net() in the
    caller. Removed entries are chained via nh_list onto a local deferred
    free list. After the grace period completes and all RCU readers have
    finished, the percpu stats are safely freed.
    
    Fixes: f4676ea74b85 ("net: nexthop: Add nexthop group entry stats")
    Cc: [email protected]
    Signed-off-by: Mehul Rao <[email protected]>
    Reviewed-by: Eric Dumazet <[email protected]>
    Reviewed-by: Ido Schimmel <[email protected]>
    Link: https://patch.msgid.link/[email protected]
    Signed-off-by: Jakub Kicinski <[email protected]>
    Signed-off-by: Greg Kroah-Hartman <[email protected]>

net: phy: register phy led_triggers during probe to avoid AB-BA deadlock [+ + +]

Author: Andrew Lunn <[email protected]>
Date:   Mon Mar 9 09:17:14 2026 -0400

    net: phy: register phy led_triggers during probe to avoid AB-BA deadlock
    
    [ Upstream commit c8dbdc6e380e7e96a51706db3e4b7870d8a9402d ]
    
    There is an AB-BA deadlock when both LEDS_TRIGGER_NETDEV and
    LED_TRIGGER_PHY are enabled:
    
    [ 1362.049207] [<8054e4b8>] led_trigger_register+0x5c/0x1fc             <-- Trying to get lock "triggers_list_lock" via down_write(&triggers_list_lock);
    [ 1362.054536] [<80662830>] phy_led_triggers_register+0xd0/0x234
    [ 1362.060329] [<8065e200>] phy_attach_direct+0x33c/0x40c
    [ 1362.065489] [<80651fc4>] phylink_fwnode_phy_connect+0x15c/0x23c
    [ 1362.071480] [<8066ee18>] mtk_open+0x7c/0xba0
    [ 1362.075849] [<806d714c>] __dev_open+0x280/0x2b0
    [ 1362.080384] [<806d7668>] __dev_change_flags+0x244/0x24c
    [ 1362.085598] [<806d7698>] dev_change_flags+0x28/0x78
    [ 1362.090528] [<807150e4>] dev_ioctl+0x4c0/0x654                       <-- Hold lock "rtnl_mutex" by calling rtnl_lock();
    [ 1362.094985] [<80694360>] sock_ioctl+0x2f4/0x4e0
    [ 1362.099567] [<802e9c4c>] sys_ioctl+0x32c/0xd8c
    [ 1362.104022] [<80014504>] syscall_common+0x34/0x58
    
    Here LED_TRIGGER_PHY is registering LED triggers during phy_attach
    while holding RTNL and then taking triggers_list_lock.
    
    [ 1362.191101] [<806c2640>] register_netdevice_notifier+0x60/0x168      <-- Trying to get lock "rtnl_mutex" via rtnl_lock();
    [ 1362.197073] [<805504ac>] netdev_trig_activate+0x194/0x1e4
    [ 1362.202490] [<8054e28c>] led_trigger_set+0x1d4/0x360                 <-- Hold lock "triggers_list_lock" by down_read(&triggers_list_lock);
    [ 1362.207511] [<8054eb38>] led_trigger_write+0xd8/0x14c
    [ 1362.212566] [<80381d98>] sysfs_kf_bin_write+0x80/0xbc
    [ 1362.217688] [<8037fcd8>] kernfs_fop_write_iter+0x17c/0x28c
    [ 1362.223174] [<802cbd70>] vfs_write+0x21c/0x3c4
    [ 1362.227712] [<802cc0c4>] ksys_write+0x78/0x12c
    [ 1362.232164] [<80014504>] syscall_common+0x34/0x58
    
    Here LEDS_TRIGGER_NETDEV is being enabled on an LED. It first takes
    triggers_list_lock and then RTNL. A classical AB-BA deadlock.
    
    phy_led_triggers_registers() does not require the RTNL, it does not
    make any calls into the network stack which require protection. There
    is also no requirement the PHY has been attached to a MAC, the
    triggers only make use of phydev state. This allows the call to
    phy_led_triggers_registers() to be placed elsewhere. PHY probe() and
    release() don't hold RTNL, so solving the AB-BA deadlock.
    
    Reported-by: Shiji Yang <[email protected]>
    Closes: https://lore.kernel.org/all/OS7PR01MB13602B128BA1AD3FA38B6D1FFBC69A@OS7PR01MB13602.jpnprd01.prod.outlook.com/
    Fixes: 06f502f57d0d ("leds: trigger: Introduce a NETDEV trigger")
    Cc: [email protected]
    Signed-off-by: Andrew Lunn <[email protected]>
    Tested-by: Shiji Yang <[email protected]>
    Link: https://patch.msgid.link/[email protected]
    Signed-off-by: Paolo Abeni <[email protected]>
    [ adapted condition to preserve existing `!phy_driver_is_genphy_10g(phydev)` guard ]
    Signed-off-by: Sasha Levin <[email protected]>
    Signed-off-by: Greg Kroah-Hartman <[email protected]>

net: prevent NULL deref in ip[6]tunnel_xmit() [+ + +]

Author: Eric Dumazet <[email protected]>
Date:   Thu Mar 12 04:39:08 2026 +0000

    net: prevent NULL deref in ip[6]tunnel_xmit()
    
    [ Upstream commit c38b8f5f791ecce13ab77e2257f8fd2444ba80f6 ]
    
    Blamed commit missed that both functions can be called with dev == NULL.
    
    Also add unlikely() hints for these conditions that only fuzzers can hit.
    
    Fixes: 6f1a9140ecda ("net: add xmit recursion limit to tunnel xmit functions")
    Signed-off-by: Eric Dumazet <[email protected]>
    CC: Weiming Shi <[email protected]>
    Link: https://patch.msgid.link/[email protected]
    Signed-off-by: Paolo Abeni <[email protected]>
    Signed-off-by: Sasha Levin <[email protected]>

net: sfp: improve Huawei MA5671a fixup [+ + +]

Author: Álvaro Fernández Rojas <[email protected]>
Date:   Fri Mar 6 13:29:55 2026 +0100

    net: sfp: improve Huawei MA5671a fixup
    
    [ Upstream commit 87d126852158467ab87d5cbc36ccfd3f15464a6c ]
    
    With the current sfp_fixup_ignore_tx_fault() fixup we ignore the TX_FAULT
    signal, but we also need to apply sfp_fixup_ignore_los() in order to be
    able to communicate with the module even if the fiber isn't connected for
    configuration purposes.
    This is needed for all the MA5671a firmwares, excluding the FS modded
    firmware.
    
    Fixes: 2069624dac19 ("net: sfp: Add tx-fault workaround for Huawei MA5671A SFP ONT")
    Signed-off-by: Álvaro Fernández Rojas <[email protected]>
    Reviewed-by: Andrew Lunn <[email protected]>
    Link: https://patch.msgid.link/[email protected]
    Signed-off-by: Jakub Kicinski <[email protected]>
    Signed-off-by: Sasha Levin <[email protected]>

net: stmmac: remove support for lpi_intr_o [+ + +]

Author: Russell King (Oracle) <[email protected]>
Date:   Fri Mar 6 15:06:20 2026 +0000

    net: stmmac: remove support for lpi_intr_o
    
    commit 14eb64db8ff07b58a35b98375f446d9e20765674 upstream.
    
    The dwmac databook for v3.74a states that lpi_intr_o is a sideband
    signal which should be used to ungate the application clock, and this
    signal is synchronous to the receive clock. The receive clock can run
    at 2.5, 25 or 125MHz depending on the media speed, and can stop under
    the control of the link partner. This means that the time it takes to
    clear is dependent on the negotiated media speed, and thus can be 8,
    40, or 400ns after reading the LPI control and status register.
    
    It has been observed with some aggressive link partners, this clock
    can stop while lpi_intr_o is still asserted, meaning that the signal
    remains asserted for an indefinite period that the local system has
    no direct control over.
    
    The LPI interrupts will still be signalled through the main interrupt
    path in any case, and this path is not dependent on the receive clock.
    
    This, since we do not gate the application clock, and the chances of
    adding clock gating in the future are slim due to the clocks being
    ill-defined, lpi_intr_o serves no useful purpose. Remove the code which
    requests the interrupt, and all associated code.
    
    Reported-by: Ovidiu Panait <[email protected]>
    Tested-by: Ovidiu Panait <[email protected]> # Renesas RZ/V2H board
    Signed-off-by: Russell King (Oracle) <[email protected]>
    Link: https://patch.msgid.link/[email protected]
    Signed-off-by: Jakub Kicinski <[email protected]>
    Signed-off-by: Ovidiu Panait <[email protected]>
    Signed-off-by: Greg Kroah-Hartman <[email protected]>

net: usb: aqc111: Do not perform PM inside suspend callback [+ + +]

Author: Nikola Z. Ivanov <[email protected]>
Date:   Fri Mar 13 16:16:43 2026 +0200

    net: usb: aqc111: Do not perform PM inside suspend callback
    
    [ Upstream commit 069c8f5aebe4d5224cf62acc7d4b3486091c658a ]
    
    syzbot reports "task hung in rpm_resume"
    
    This is caused by aqc111_suspend calling
    the PM variant of its write_cmd routine.
    
    The simplified call trace looks like this:
    
    rpm_suspend()
      usb_suspend_both() - here udev->dev.power.runtime_status == RPM_SUSPENDING
        aqc111_suspend() - called for the usb device interface
          aqc111_write32_cmd()
            usb_autopm_get_interface()
              pm_runtime_resume_and_get()
                rpm_resume() - here we call rpm_resume() on our parent
                  rpm_resume() - Here we wait for a status change that will never happen.
    
    At this point we block another task which holds
    rtnl_lock and locks up the whole networking stack.
    
    Fix this by replacing the write_cmd calls with their _nopm variants
    
    Reported-by: [email protected]
    Closes: https://syzkaller.appspot.com/bug?extid=48dc1e8dfc92faf1124c
    Fixes: e58ba4544c77 ("net: usb: aqc111: Add support for wake on LAN by MAGIC packet")
    Signed-off-by: Nikola Z. Ivanov <[email protected]>
    Link: https://patch.msgid.link/[email protected]
    Signed-off-by: Paolo Abeni <[email protected]>
    Signed-off-by: Sasha Levin <[email protected]>

net: usb: cdc_ncm: add ndpoffset to NDP16 nframes bounds check [+ + +]

Author: Tobi Gaertner <[email protected]>
Date:   Fri Mar 13 22:46:39 2026 -0700

    net: usb: cdc_ncm: add ndpoffset to NDP16 nframes bounds check
    
    [ Upstream commit 2aa8a4fa8d5b7d0e1ebcec100e1a4d80a1f4b21a ]
    
    cdc_ncm_rx_verify_ndp16() validates that the NDP header and its DPE
    entries fit within the skb. The first check correctly accounts for
    ndpoffset:
    
      if ((ndpoffset + sizeof(struct usb_cdc_ncm_ndp16)) > skb_in->len)
    
    but the second check omits it:
    
      if ((sizeof(struct usb_cdc_ncm_ndp16) +
           ret * (sizeof(struct usb_cdc_ncm_dpe16))) > skb_in->len)
    
    This validates the DPE array size against the total skb length as if
    the NDP were at offset 0, rather than at ndpoffset. When the NDP is
    placed near the end of the NTB (large wNdpIndex), the DPE entries can
    extend past the skb data buffer even though the check passes.
    cdc_ncm_rx_fixup() then reads out-of-bounds memory when iterating
    the DPE array.
    
    Add ndpoffset to the nframes bounds check and use struct_size_t() to
    express the NDP-plus-DPE-array size more clearly.
    
    Fixes: ff06ab13a4cc ("net: cdc_ncm: splitting rx_fixup for code reuse")
    Signed-off-by: Tobi Gaertner <[email protected]>
    Link: https://patch.msgid.link/[email protected]
    Signed-off-by: Jakub Kicinski <[email protected]>
    Signed-off-by: Sasha Levin <[email protected]>

net: usb: cdc_ncm: add ndpoffset to NDP32 nframes bounds check [+ + +]

Author: Tobi Gaertner <[email protected]>
Date:   Fri Mar 13 22:46:40 2026 -0700

    net: usb: cdc_ncm: add ndpoffset to NDP32 nframes bounds check
    
    [ Upstream commit 77914255155e68a20aa41175edeecf8121dac391 ]
    
    The same bounds-check bug fixed for NDP16 in the previous patch also
    exists in cdc_ncm_rx_verify_ndp32(). The DPE array size is validated
    against the total skb length without accounting for ndpoffset, allowing
    out-of-bounds reads when the NDP32 is placed near the end of the NTB.
    
    Add ndpoffset to the nframes bounds check and use struct_size_t() to
    express the NDP-plus-DPE-array size more clearly.
    
    Compile-tested only.
    
    Fixes: 0fa81b304a79 ("cdc_ncm: Implement the 32-bit version of NCM Transfer Block")
    Signed-off-by: Tobi Gaertner <[email protected]>
    Link: https://patch.msgid.link/[email protected]
    Signed-off-by: Jakub Kicinski <[email protected]>
    Signed-off-by: Sasha Levin <[email protected]>

net: usb: lan78xx: fix silent drop of packets with checksum errors [+ + +]

Author: Oleksij Rempel <[email protected]>
Date:   Thu Mar 5 15:34:26 2026 +0100

    net: usb: lan78xx: fix silent drop of packets with checksum errors
    
    commit e4f774a0cc955ce762aec91c66915a6e15087ab7 upstream.
    
    Do not drop packets with checksum errors at the USB driver level;
    pass them to the network stack.
    
    Previously, the driver dropped all packets where the 'Receive Error
    Detected' (RED) bit was set, regardless of the specific error type. This
    caused packets with only IP or TCP/UDP checksum errors to be dropped
    before reaching the kernel, preventing the network stack from accounting
    for them or performing software fallback.
    
    Add a mask for hard hardware errors to safely drop genuinely corrupt
    frames, while allowing checksum-errored frames to pass with their
    ip_summed field explicitly set to CHECKSUM_NONE.
    
    Fixes: 55d7de9de6c3 ("Microchip's LAN7800 family USB 2/3 to 10/100/1000 Ethernet device driver")
    Cc: [email protected]
    Signed-off-by: Oleksij Rempel <[email protected]>
    Link: https://patch.msgid.link/[email protected]
    Signed-off-by: Jakub Kicinski <[email protected]>
    Signed-off-by: Greg Kroah-Hartman <[email protected]>

net: usb: lan78xx: fix TX byte statistics for small packets [+ + +]

Author: Oleksij Rempel <[email protected]>
Date:   Thu Mar 5 15:34:27 2026 +0100

    net: usb: lan78xx: fix TX byte statistics for small packets
    
    commit 50988747c30df47b73b787f234f746027cb7ec6c upstream.
    
    Account for hardware auto-padding in TX byte counters to reflect actual
    wire traffic.
    
    The LAN7850 hardware automatically pads undersized frames to the minimum
    Ethernet frame length (ETH_ZLEN, 60 bytes). However, the driver tracks
    the network statistics based on the unpadded socket buffer length. This
    results in the tx_bytes counter under-reporting the actual physical
    bytes placed on the Ethernet wire for small packets (like short ARP or
    ICMP requests).
    
    Use max_t() to ensure the transmission statistics accurately account for
    the hardware-generated padding.
    
    Fixes: d383216a7efe ("lan78xx: Introduce Tx URB processing improvements")
    Cc: [email protected]
    Signed-off-by: Oleksij Rempel <[email protected]>
    Link: https://patch.msgid.link/[email protected]
    Signed-off-by: Jakub Kicinski <[email protected]>
    Signed-off-by: Greg Kroah-Hartman <[email protected]>

net: usb: lan78xx: skip LTM configuration for LAN7850 [+ + +]

Author: Oleksij Rempel <[email protected]>
Date:   Thu Mar 5 15:34:28 2026 +0100

    net: usb: lan78xx: skip LTM configuration for LAN7850
    
    commit d9cc0e440f0664f6f3e2c26e39ab9dd5f3badba7 upstream.
    
    Do not configure Latency Tolerance Messaging (LTM) on USB 2.0 hardware.
    
    The LAN7850 is a High-Speed (USB 2.0) only device and does not support
    SuperSpeed features like LTM. Currently, the driver unconditionally
    attempts to configure LTM registers during initialization. On the
    LAN7850, these registers do not exist, resulting in writes to invalid
    or undocumented memory space.
    
    This issue was identified during a port to the regmap API with strict
    register validation enabled. While no functional issues or crashes have
    been observed from these invalid writes, bypassing LTM initialization
    on the LAN7850 ensures the driver strictly adheres to the hardware's
    valid register map.
    
    Fixes: 55d7de9de6c3 ("Microchip's LAN7800 family USB 2/3 to 10/100/1000 Ethernet device driver")
    Cc: [email protected]
    Signed-off-by: Oleksij Rempel <[email protected]>
    Link: https://patch.msgid.link/[email protected]
    Signed-off-by: Jakub Kicinski <[email protected]>
    Signed-off-by: Greg Kroah-Hartman <[email protected]>

netfilter: bpf: defer hook memory release until rcu readers are done [+ + +]

Author: Florian Westphal <[email protected]>
Date:   Tue Mar 17 12:23:08 2026 +0100

    netfilter: bpf: defer hook memory release until rcu readers are done
    
    [ Upstream commit 24f90fa3994b992d1a09003a3db2599330a5232a ]
    
    Yiming Qian reports UaF when concurrent process is dumping hooks via
    nfnetlink_hooks:
    
    BUG: KASAN: slab-use-after-free in nfnl_hook_dump_one.isra.0+0xe71/0x10f0
    Read of size 8 at addr ffff888003edbf88 by task poc/79
    Call Trace:
     <TASK>
     nfnl_hook_dump_one.isra.0+0xe71/0x10f0
     netlink_dump+0x554/0x12b0
     nfnl_hook_get+0x176/0x230
     [..]
    
    Defer release until after concurrent readers have completed.
    
    Reported-by: Yiming Qian <[email protected]>
    Fixes: 84601d6ee68a ("bpf: add bpf_link support for BPF_NETFILTER programs")
    Signed-off-by: Florian Westphal <[email protected]>
    Signed-off-by: Sasha Levin <[email protected]>

netfilter: ctnetlink: fix use-after-free in ctnetlink_dump_exp_ct() [+ + +]

Author: Hyunwoo Kim <[email protected]>
Date:   Sun Mar 8 02:21:37 2026 +0900

    netfilter: ctnetlink: fix use-after-free in ctnetlink_dump_exp_ct()
    
    [ Upstream commit 5cb81eeda909dbb2def209dd10636b51549a3f8a ]
    
    ctnetlink_dump_exp_ct() stores a conntrack pointer in cb->data for the
    netlink dump callback ctnetlink_exp_ct_dump_table(), but drops the
    conntrack reference immediately after netlink_dump_start().  When the
    dump spans multiple rounds, the second recvmsg() triggers the dump
    callback which dereferences the now-freed conntrack via nfct_help(ct),
    leading to a use-after-free on ct->ext.
    
    The bug is that the netlink_dump_control has no .start or .done
    callbacks to manage the conntrack reference across dump rounds.  Other
    dump functions in the same file (e.g. ctnetlink_get_conntrack) properly
    use .start/.done callbacks for this purpose.
    
    Fix this by adding .start and .done callbacks that hold and release the
    conntrack reference for the duration of the dump, and move the
    nfct_help() call after the cb->args[0] early-return check in the dump
    callback to avoid dereferencing ct->ext unnecessarily.
    
     BUG: KASAN: slab-use-after-free in ctnetlink_exp_ct_dump_table+0x4f/0x2e0
     Read of size 8 at addr ffff88810597ebf0 by task ctnetlink_poc/133
    
     CPU: 1 UID: 0 PID: 133 Comm: ctnetlink_poc Not tainted 7.0.0-rc2+ #3 PREEMPTLAZY
     Call Trace:
      <TASK>
      ctnetlink_exp_ct_dump_table+0x4f/0x2e0
      netlink_dump+0x333/0x880
      netlink_recvmsg+0x3e2/0x4b0
      ? aa_sk_perm+0x184/0x450
      sock_recvmsg+0xde/0xf0
    
     Allocated by task 133:
      kmem_cache_alloc_noprof+0x134/0x440
      __nf_conntrack_alloc+0xa8/0x2b0
      ctnetlink_create_conntrack+0xa1/0x900
      ctnetlink_new_conntrack+0x3cf/0x7d0
      nfnetlink_rcv_msg+0x48e/0x510
      netlink_rcv_skb+0xc9/0x1f0
      nfnetlink_rcv+0xdb/0x220
      netlink_unicast+0x3ec/0x590
      netlink_sendmsg+0x397/0x690
      __sys_sendmsg+0xf4/0x180
    
     Freed by task 0:
      slab_free_after_rcu_debug+0xad/0x1e0
      rcu_core+0x5c3/0x9c0
    
    Fixes: e844a928431f ("netfilter: ctnetlink: allow to dump expectation per master conntrack")
    Signed-off-by: Hyunwoo Kim <[email protected]>
    Signed-off-by: Florian Westphal <[email protected]>
    Signed-off-by: Sasha Levin <[email protected]>

netfilter: ctnetlink: remove refcounting in expectation dumpers [+ + +]

Author: Florian Westphal <[email protected]>
Date:   Fri Aug 1 17:25:09 2025 +0200

    netfilter: ctnetlink: remove refcounting in expectation dumpers
    
    [ Upstream commit 1492e3dcb2be3aa46d1963da96aa9593e4e4db5a ]
    
    Same pattern as previous patch: do not keep the expectation object
    alive via refcount, only store a cookie value and then use that
    as the skip hint for dump resumption.
    
    AFAICS this has the same issue as the one resolved in the conntrack
    dumper, when we do
      if (!refcount_inc_not_zero(&exp->use))
    
    to increment the refcount, there is a chance that exp == last, which
    causes a double-increment of the refcount and subsequent memory leak.
    
    Fixes: cf6994c2b981 ("[NETFILTER]: nf_conntrack_netlink: sync expectation dumping with conntrack table dumping")
    Fixes: e844a928431f ("netfilter: ctnetlink: allow to dump expectation per master conntrack")
    Signed-off-by: Florian Westphal <[email protected]>
    Signed-off-by: Pablo Neira Ayuso <[email protected]>
    Stable-dep-of: 5cb81eeda909 ("netfilter: ctnetlink: fix use-after-free in ctnetlink_dump_exp_ct()")
    Signed-off-by: Sasha Levin <[email protected]>

netfilter: nf_conntrack_h323: check for zero length in DecodeQ931() [+ + +]

Author: Jenny Guanni Qu <[email protected]>
Date:   Thu Mar 12 14:49:50 2026 +0000

    netfilter: nf_conntrack_h323: check for zero length in DecodeQ931()
    
    [ Upstream commit f173d0f4c0f689173f8cdac79991043a4a89bf66 ]
    
    In DecodeQ931(), the UserUserIE code path reads a 16-bit length from
    the packet, then decrements it by 1 to skip the protocol discriminator
    byte before passing it to DecodeH323_UserInformation(). If the encoded
    length is 0, the decrement wraps to -1, which is then passed as a
    large value to the decoder, leading to an out-of-bounds read.
    
    Add a check to ensure len is positive after the decrement.
    
    Fixes: 5e35941d9901 ("[NETFILTER]: Add H.323 conntrack/NAT helper")
    Reported-by: Klaudia Kloc <[email protected]>
    Reported-by: Dawid Moczadło <[email protected]>
    Tested-by: Jenny Guanni Qu <[email protected]>
    Signed-off-by: Jenny Guanni Qu <[email protected]>
    Signed-off-by: Florian Westphal <[email protected]>
    Signed-off-by: Sasha Levin <[email protected]>

netfilter: nf_conntrack_h323: fix OOB read in decode_int() CONS case [+ + +]

Author: Jenny Guanni Qu <[email protected]>
Date:   Thu Mar 12 02:29:32 2026 +0000

    netfilter: nf_conntrack_h323: fix OOB read in decode_int() CONS case
    
    [ Upstream commit 1e3a3593162c96e8a8de48b1e14f60c3b57fca8a ]
    
    In decode_int(), the CONS case calls get_bits(bs, 2) to read a length
    value, then calls get_uint(bs, len) without checking that len bytes
    remain in the buffer. The existing boundary check only validates the
    2 bits for get_bits(), not the subsequent 1-4 bytes that get_uint()
    reads. This allows a malformed H.323/RAS packet to cause a 1-4 byte
    slab-out-of-bounds read.
    
    Add a boundary check for len bytes after get_bits() and before
    get_uint().
    
    Fixes: 5e35941d9901 ("[NETFILTER]: Add H.323 conntrack/NAT helper")
    Reported-by: Klaudia Kloc <[email protected]>
    Reported-by: Dawid Moczadło <[email protected]>
    Signed-off-by: Jenny Guanni Qu <[email protected]>
    Signed-off-by: Florian Westphal <[email protected]>
    Signed-off-by: Sasha Levin <[email protected]>

netfilter: nf_conntrack_sip: fix Content-Length u32 truncation in sip_help_tcp() [+ + +]

Author: Lukas Johannes Möller <[email protected]>
Date:   Tue Mar 10 21:49:01 2026 +0000

    netfilter: nf_conntrack_sip: fix Content-Length u32 truncation in sip_help_tcp()
    
    [ Upstream commit fbce58e719a17aa215c724473fd5baaa4a8dc57c ]
    
    sip_help_tcp() parses the SIP Content-Length header with
    simple_strtoul(), which returns unsigned long, but stores the result in
    unsigned int clen.  On 64-bit systems, values exceeding UINT_MAX are
    silently truncated before computing the SIP message boundary.
    
    For example, Content-Length 4294967328 (2^32 + 32) is truncated to 32,
    causing the parser to miscalculate where the current message ends.  The
    loop then treats trailing data in the TCP segment as a second SIP
    message and processes it through the SDP parser.
    
    Fix this by changing clen to unsigned long to match the return type of
    simple_strtoul(), and reject Content-Length values that exceed the
    remaining TCP payload length.
    
    Fixes: f5b321bd37fb ("netfilter: nf_conntrack_sip: add TCP support")
    Signed-off-by: Lukas Johannes Möller <[email protected]>
    Signed-off-by: Florian Westphal <[email protected]>
    Signed-off-by: Sasha Levin <[email protected]>

netfilter: nf_tables: always walk all pending catchall elements [+ + +]

Author: Florian Westphal <[email protected]>
Date:   Thu Mar 5 21:32:00 2026 +0100

    netfilter: nf_tables: always walk all pending catchall elements
    
    [ Upstream commit 7cb9a23d7ae40a702577d3d8bacb7026f04ac2a9 ]
    
    During transaction processing we might have more than one catchall element:
    1 live catchall element and 1 pending element that is coming as part of the
    new batch.
    
    If the map holding the catchall elements is also going away, its
    required to toggle all catchall elements and not just the first viable
    candidate.
    
    Otherwise, we get:
     WARNING: ./include/net/netfilter/nf_tables.h:1281 at nft_data_release+0xb7/0xe0 [nf_tables], CPU#2: nft/1404
     RIP: 0010:nft_data_release+0xb7/0xe0 [nf_tables]
     [..]
     __nft_set_elem_destroy+0x106/0x380 [nf_tables]
     nf_tables_abort_release+0x348/0x8d0 [nf_tables]
     nf_tables_abort+0xcf2/0x3ac0 [nf_tables]
     nfnetlink_rcv_batch+0x9c9/0x20e0 [..]
    
    Fixes: 628bd3e49cba ("netfilter: nf_tables: drop map element references from preparation phase")
    Reported-by: Yiming Qian <[email protected]>
    Signed-off-by: Florian Westphal <[email protected]>
    Signed-off-by: Sasha Levin <[email protected]>

netfilter: nf_tables: release flowtable after rcu grace period on error [+ + +]

Author: Pablo Neira Ayuso <[email protected]>
Date:   Tue Mar 17 20:00:26 2026 +0100

    netfilter: nf_tables: release flowtable after rcu grace period on error
    
    [ Upstream commit d73f4b53aaaea4c95f245e491aa5eeb8a21874ce ]
    
    Call synchronize_rcu() after unregistering the hooks from error path,
    since a hook that already refers to this flowtable can be already
    registered, exposing this flowtable to packet path and nfnetlink_hook
    control plane.
    
    This error path is rare, it should only happen by reaching the maximum
    number hooks or by failing to set up to hardware offload, just call
    synchronize_rcu().
    
    There is a check for already used device hooks by different flowtable
    that could result in EEXIST at this late stage. The hook parser can be
    updated to perform this check earlier to this error path really becomes
    rarely exercised.
    
    Uncovered by KASAN reported as use-after-free from nfnetlink_hook path
    when dumping hooks.
    
    Fixes: 3b49e2e94e6e ("netfilter: nf_tables: add flow table netlink frontend")
    Reported-by: Yiming Qian <[email protected]>
    Signed-off-by: Pablo Neira Ayuso <[email protected]>
    Signed-off-by: Florian Westphal <[email protected]>
    Signed-off-by: Sasha Levin <[email protected]>

netfilter: nfnetlink_cthelper: fix OOB read in nfnl_cthelper_dump_table() [+ + +]

Author: Hyunwoo Kim <[email protected]>
Date:   Sun Mar 8 02:23:34 2026 +0900

    netfilter: nfnetlink_cthelper: fix OOB read in nfnl_cthelper_dump_table()
    
    [ Upstream commit 6dcee8496d53165b2d8a5909b3050b62ae71fe89 ]
    
    nfnl_cthelper_dump_table() has a 'goto restart' that jumps to a label
    inside the for loop body.  When the "last" helper saved in cb->args[1]
    is deleted between dump rounds, every entry fails the (cur != last)
    check, so cb->args[1] is never cleared.  The for loop finishes with
    cb->args[0] == nf_ct_helper_hsize, and the 'goto restart' jumps back
    into the loop body bypassing the bounds check, causing an 8-byte
    out-of-bounds read on nf_ct_helper_hash[nf_ct_helper_hsize].
    
    The 'goto restart' block was meant to re-traverse the current bucket
    when "last" is no longer found, but it was placed after the for loop
    instead of inside it.  Move the block into the for loop body so that
    the restart only occurs while cb->args[0] is still within bounds.
    
     BUG: KASAN: slab-out-of-bounds in nfnl_cthelper_dump_table+0x9f/0x1b0
     Read of size 8 at addr ffff888104ca3000 by task poc_cthelper/131
     Call Trace:
      nfnl_cthelper_dump_table+0x9f/0x1b0
      netlink_dump+0x333/0x880
      netlink_recvmsg+0x3e2/0x4b0
      sock_recvmsg+0xde/0xf0
      __sys_recvfrom+0x150/0x200
      __x64_sys_recvfrom+0x76/0x90
      do_syscall_64+0xc3/0x6e0
    
     Allocated by task 1:
      __kvmalloc_node_noprof+0x21b/0x700
      nf_ct_alloc_hashtable+0x65/0xd0
      nf_conntrack_helper_init+0x21/0x60
      nf_conntrack_init_start+0x18d/0x300
      nf_conntrack_standalone_init+0x12/0xc0
    
    Fixes: 12f7a505331e ("netfilter: add user-space connection tracking helper infrastructure")
    Signed-off-by: Hyunwoo Kim <[email protected]>
    Signed-off-by: Florian Westphal <[email protected]>
    Signed-off-by: Sasha Levin <[email protected]>

netfilter: nfnetlink_queue: fix entry leak in bridge verdict error path [+ + +]

Author: Hyunwoo Kim <[email protected]>
Date:   Sun Mar 8 02:24:06 2026 +0900

    netfilter: nfnetlink_queue: fix entry leak in bridge verdict error path
    
    [ Upstream commit f1ba83755d81c6fc66ac7acd723d238f974091e9 ]
    
    nfqnl_recv_verdict() calls find_dequeue_entry() to remove the queue
    entry from the queue data structures, taking ownership of the entry.
    For PF_BRIDGE packets, it then calls nfqa_parse_bridge() to parse VLAN
    attributes.  If nfqa_parse_bridge() returns an error (e.g. NFQA_VLAN
    present but NFQA_VLAN_TCI missing), the function returns immediately
    without freeing the dequeued entry or its sk_buff.
    
    This leaks the nf_queue_entry, its associated sk_buff, and all held
    references (net_device refcounts, struct net refcount).  Repeated
    triggering exhausts kernel memory.
    
    Fix this by dropping the entry via nfqnl_reinject() with NF_DROP verdict
    on the error path, consistent with other error handling in this file.
    
    Fixes: 8d45ff22f1b4 ("netfilter: bridge: nf queue verdict to use NFQA_VLAN and NFQA_L2HDR")
    Reviewed-by: David Dull <[email protected]>
    Signed-off-by: Hyunwoo Kim <[email protected]>
    Signed-off-by: Florian Westphal <[email protected]>
    Signed-off-by: Sasha Levin <[email protected]>

netfilter: nft_ct: drop pending enqueued packets on removal [+ + +]

Author: Pablo Neira Ayuso <[email protected]>
Date:   Thu Mar 12 13:48:47 2026 +0100

    netfilter: nft_ct: drop pending enqueued packets on removal
    
    [ Upstream commit 36eae0956f659e48d5366d9b083d9417f3263ddc ]
    
    Packets sitting in nfqueue might hold a reference to:
    
    - templates that specify the conntrack zone, because a percpu area is
      used and module removal is possible.
    - conntrack timeout policies and helper, where object removal leave
      a stale reference.
    
    Since these objects can just go away, drop enqueued packets to avoid
    stale reference to them.
    
    If there is a need for finer grain removal, this logic can be revisited
    to make selective packet drop upon dependencies.
    
    Fixes: 7e0b2b57f01d ("netfilter: nft_ct: add ct timeout support")
    Reported-by: Yiming Qian <[email protected]>
    Signed-off-by: Pablo Neira Ayuso <[email protected]>
    Signed-off-by: Florian Westphal <[email protected]>
    Signed-off-by: Sasha Levin <[email protected]>

netfilter: nft_set_pipapo: fix stack out-of-bounds read in pipapo_drop() [+ + +]

Author: Jenny Guanni Qu <[email protected]>
Date:   Fri Mar 6 19:12:38 2026 +0000

    netfilter: nft_set_pipapo: fix stack out-of-bounds read in pipapo_drop()
    
    [ Upstream commit d6d8cd2db236a9dd13dbc2d05843b3445cc964b5 ]
    
    pipapo_drop() passes rulemap[i + 1].n to pipapo_unmap() as the
    to_offset argument on every iteration, including the last one where
    i == m->field_count - 1. This reads one element past the end of the
    stack-allocated rulemap array (declared as rulemap[NFT_PIPAPO_MAX_FIELDS]
    with NFT_PIPAPO_MAX_FIELDS == 16).
    
    Although pipapo_unmap() returns early when is_last is true without
    using the to_offset value, the argument is evaluated at the call site
    before the function body executes, making this a genuine out-of-bounds
    stack read confirmed by KASAN:
    
      BUG: KASAN: stack-out-of-bounds in pipapo_drop+0x50c/0x57c [nf_tables]
      Read of size 4 at addr ffff8000810e71a4
    
      This frame has 1 object:
       [32, 160) 'rulemap'
    
      The buggy address is at offset 164 -- exactly 4 bytes past the end
      of the rulemap array.
    
    Pass 0 instead of rulemap[i + 1].n on the last iteration to avoid
    the out-of-bounds read.
    
    Fixes: 3c4287f62044 ("nf_tables: Add set type for arbitrary concatenation of ranges")
    Signed-off-by: Jenny Guanni Qu <[email protected]>
    Signed-off-by: Florian Westphal <[email protected]>
    Signed-off-by: Sasha Levin <[email protected]>

netfilter: x_tables: guard option walkers against 1-byte tail reads [+ + +]

Author: David Dull <[email protected]>
Date:   Sat Mar 7 20:26:21 2026 +0200

    netfilter: x_tables: guard option walkers against 1-byte tail reads
    
    [ Upstream commit cfe770220ac2dbd3e104c6b45094037455da81d4 ]
    
    When the last byte of options is a non-single-byte option kind, walkers
    that advance with i += op[i + 1] ? : 1 can read op[i + 1] past the end
    of the option area.
    
    Add an explicit i == optlen - 1 check before dereferencing op[i + 1]
    in xt_tcpudp and xt_dccp option walkers.
    
    Fixes: 2e4e6a17af35 ("[NETFILTER] x_tables: Abstraction layer for {ip,ip6,arp}_tables")
    Signed-off-by: David Dull <[email protected]>
    Signed-off-by: Florian Westphal <[email protected]>
    Signed-off-by: Sasha Levin <[email protected]>

netfilter: xt_CT: drop pending enqueued packets on template removal [+ + +]

Author: Pablo Neira Ayuso <[email protected]>
Date:   Thu Mar 12 13:48:48 2026 +0100

    netfilter: xt_CT: drop pending enqueued packets on template removal
    
    [ Upstream commit f62a218a946b19bb59abdd5361da85fa4606b96b ]
    
    Templates refer to objects that can go away while packets are sitting in
    nfqueue refer to:
    
    - helper, this can be an issue on module removal.
    - timeout policy, nfnetlink_cttimeout might remove it.
    
    The use of templates with zone and event cache filter are safe, since
    this just copies values.
    
    Flush these enqueued packets in case the template rule gets removed.
    
    Fixes: 24de58f46516 ("netfilter: xt_CT: allow to attach timeout policy + glue code")
    Reported-by: Yiming Qian <[email protected]>
    Signed-off-by: Pablo Neira Ayuso <[email protected]>
    Signed-off-by: Florian Westphal <[email protected]>
    Signed-off-by: Sasha Levin <[email protected]>

netfilter: xt_IDLETIMER: reject rev0 reuse of ALARM timer labels [+ + +]

Author: Yuan Tan <[email protected]>
Date:   Mon Mar 9 03:41:46 2026 -0700

    netfilter: xt_IDLETIMER: reject rev0 reuse of ALARM timer labels
    
    [ Upstream commit 329f0b9b48ee6ab59d1ab72fef55fe8c6463a6cf ]
    
    IDLETIMER revision 0 rules reuse existing timers by label and always call
    mod_timer() on timer->timer.
    
    If the label was created first by revision 1 with XT_IDLETIMER_ALARM,
    the object uses alarm timer semantics and timer->timer is never initialized.
    Reusing that object from revision 0 causes mod_timer() on an uninitialized
    timer_list, triggering debugobjects warnings and possible panic when
    panic_on_warn=1.
    
    Fix this by rejecting revision 0 rule insertion when an existing timer with
    the same label is of ALARM type.
    
    Fixes: 68983a354a65 ("netfilter: xtables: Add snapshot of hardidletimer target")
    Co-developed-by: Yifan Wu <[email protected]>
    Signed-off-by: Yifan Wu <[email protected]>
    Co-developed-by: Juefei Pu <[email protected]>
    Signed-off-by: Juefei Pu <[email protected]>
    Signed-off-by: Yuan Tan <[email protected]>
    Signed-off-by: Xin Liu <[email protected]>
    Signed-off-by: Florian Westphal <[email protected]>
    Signed-off-by: Sasha Levin <[email protected]>

netfilter: xt_time: use unsigned int for monthday bit shift [+ + +]

Author: Jenny Guanni Qu <[email protected]>
Date:   Thu Mar 12 14:59:49 2026 +0000

    netfilter: xt_time: use unsigned int for monthday bit shift
    
    [ Upstream commit 00050ec08cecfda447e1209b388086d76addda3a ]
    
    The monthday field can be up to 31, and shifting a signed integer 1
    by 31 positions (1 << 31) is undefined behavior in C, as the result
    overflows a 32-bit signed int. Use 1U to ensure well-defined behavior
    for all valid monthday values.
    
    Change the weekday shift to 1U as well for consistency.
    
    Fixes: ee4411a1b1e0 ("[NETFILTER]: x_tables: add xt_time match")
    Reported-by: Klaudia Kloc <[email protected]>
    Reported-by: Dawid Moczadło <[email protected]>
    Tested-by: Jenny Guanni Qu <[email protected]>
    Signed-off-by: Jenny Guanni Qu <[email protected]>
    Signed-off-by: Florian Westphal <[email protected]>
    Signed-off-by: Sasha Levin <[email protected]>

nf_tables: nft_dynset: fix possible stateful expression memleak in error path [+ + +]

Author: Pablo Neira Ayuso <[email protected]>
Date:   Thu Mar 12 12:38:59 2026 +0100

    nf_tables: nft_dynset: fix possible stateful expression memleak in error path
    
    [ Upstream commit 0548a13b5a145b16e4da0628b5936baf35f51b43 ]
    
    If cloning the second stateful expression in the element via GFP_ATOMIC
    fails, then the first stateful expression remains in place without being
    released.
    
       unreferenced object (percpu) 0x607b97e9cab8 (size 16):
         comm "softirq", pid 0, jiffies 4294931867
         hex dump (first 16 bytes on cpu 3):
           00 00 00 00 00 00 00 00 00 00 00 00 00 00 00 00
         backtrace (crc 0):
           pcpu_alloc_noprof+0x453/0xd80
           nft_counter_clone+0x9c/0x190 [nf_tables]
           nft_expr_clone+0x8f/0x1b0 [nf_tables]
           nft_dynset_new+0x2cb/0x5f0 [nf_tables]
           nft_rhash_update+0x236/0x11c0 [nf_tables]
           nft_dynset_eval+0x11f/0x670 [nf_tables]
           nft_do_chain+0x253/0x1700 [nf_tables]
           nft_do_chain_ipv4+0x18d/0x270 [nf_tables]
           nf_hook_slow+0xaa/0x1e0
           ip_local_deliver+0x209/0x330
    
    Fixes: 563125a73ac3 ("netfilter: nftables: generalize set extension to support for several expressions")
    Reported-by: Gurpreet Shergill <[email protected]>
    Signed-off-by: Pablo Neira Ayuso <[email protected]>
    Signed-off-by: Florian Westphal <[email protected]>
    Signed-off-by: Sasha Levin <[email protected]>

NFC: nxp-nci: allow GPIOs to sleep [+ + +]

Author: Ian Ray <[email protected]>
Date:   Tue Mar 17 10:53:36 2026 +0200

    NFC: nxp-nci: allow GPIOs to sleep
    
    commit 55dc632ab2ac2889b15995a9eef56c753d48ebc7 upstream.
    
    Allow the firmware and enable GPIOs to sleep.
    
    This fixes a `WARN_ON' and allows the driver to operate GPIOs which are
    connected to I2C GPIO expanders.
    
    -- >8 --
    kernel: WARNING: CPU: 3 PID: 2636 at drivers/gpio/gpiolib.c:3880 gpiod_set_value+0x88/0x98
    -- >8 --
    
    Fixes: 43201767b44c ("NFC: nxp-nci: Convert to use GPIO descriptor")
    Cc: [email protected]
    Signed-off-by: Ian Ray <[email protected]>
    Link: https://patch.msgid.link/[email protected]
    Signed-off-by: Jakub Kicinski <[email protected]>
    Signed-off-by: Greg Kroah-Hartman <[email protected]>

nfnetlink_osf: validate individual option lengths in fingerprints [+ + +]

Author: Weiming Shi <[email protected]>
Date:   Thu Mar 19 15:32:44 2026 +0800

    nfnetlink_osf: validate individual option lengths in fingerprints
    
    [ Upstream commit dbdfaae9609629a9569362e3b8f33d0a20fd783c ]
    
    nfnl_osf_add_callback() validates opt_num bounds and string
    NUL-termination but does not check individual option length fields.
    A zero-length option causes nf_osf_match_one() to enter the option
    matching loop even when foptsize sums to zero, which matches packets
    with no TCP options where ctx->optp is NULL:
    
     Oops: general protection fault
     KASAN: null-ptr-deref in range [0x0000000000000000-0x0000000000000007]
     RIP: 0010:nf_osf_match_one (net/netfilter/nfnetlink_osf.c:98)
     Call Trace:
      nf_osf_match (net/netfilter/nfnetlink_osf.c:227)
      xt_osf_match_packet (net/netfilter/xt_osf.c:32)
      ipt_do_table (net/ipv4/netfilter/ip_tables.c:293)
      nf_hook_slow (net/netfilter/core.c:623)
      ip_local_deliver (net/ipv4/ip_input.c:262)
      ip_rcv (net/ipv4/ip_input.c:573)
    
    Additionally, an MSS option (kind=2) with length < 4 causes
    out-of-bounds reads when nf_osf_match_one() unconditionally accesses
    optp[2] and optp[3] for MSS value extraction.  While RFC 9293
    section 3.2 specifies that the MSS option is always exactly 4
    bytes (Kind=2, Length=4), the check uses "< 4" rather than
    "!= 4" because lengths greater than 4 do not cause memory
    safety issues -- the buffer is guaranteed to be at least
    foptsize bytes by the ctx->optsize == foptsize check.
    
    Reject fingerprints where any option has zero length, or where an MSS
    option has length less than 4, at add time rather than trusting these
    values in the packet matching hot path.
    
    Fixes: 11eeef41d5f6 ("netfilter: passive OS fingerprint xtables match")
    Reported-by: Xiang Mei <[email protected]>
    Signed-off-by: Weiming Shi <[email protected]>
    Signed-off-by: Florian Westphal <[email protected]>
    Signed-off-by: Sasha Levin <[email protected]>

nfs: return EISDIR on nfs3_proc_create if d_alias is a dir [+ + +]

Author: Roberto Bergantinos Corpas <[email protected]>
Date:   Thu Feb 19 13:04:40 2026 +0100

    nfs: return EISDIR on nfs3_proc_create if d_alias is a dir
    
    [ Upstream commit 410666a298c34ebd57256fde6b24c96bd23059a2 ]
    
    If we found an alias through nfs3_do_create/nfs_add_or_obtain
    /d_splice_alias which happens to be a dir dentry, we don't return
    any error, and simply forget about this alias, but the original
    dentry we were adding and passed as parameter remains negative.
    
    This later causes an oops on nfs_atomic_open_v23/finish_open since we
    supply a negative dentry to do_dentry_open.
    
    This has been observed running lustre-racer, where dirs and files are
    created/removed concurrently with the same name and O_EXCL is not
    used to open files (frequent file redirection).
    
    While d_splice_alias typically returns a directory alias or NULL, we
    explicitly check d_is_dir() to ensure that we don't attempt to perform
    file operations (like finish_open) on a directory inode, which triggers
    the observed oops.
    
    Fixes: 7c6c5249f061 ("NFS: add atomic_open for NFSv3 to handle O_TRUNC correctly.")
    Reviewed-by: Olga Kornievskaia <[email protected]>
    Reviewed-by: Scott Mayhew <[email protected]>
    Signed-off-by: Roberto Bergantinos Corpas <[email protected]>
    Signed-off-by: Anna Schumaker <[email protected]>
    Signed-off-by: Sasha Levin <[email protected]>

nfsd: Fix cred ref leak in nfsd_nl_listener_set_doit(). [+ + +]

Author: Kuniyuki Iwashima <[email protected]>
Date:   Sat Jan 24 04:18:41 2026 +0000

    nfsd: Fix cred ref leak in nfsd_nl_listener_set_doit().
    
    commit 92978c83bb4eef55d02a6c990c01c423131eefa7 upstream.
    
    nfsd_nl_listener_set_doit() uses get_current_cred() without
    put_cred().
    
    As we can see from other callers, svc_xprt_create_from_sa()
    does not require the extra refcount.
    
    nfsd_nl_listener_set_doit() is always in the process context,
    sendmsg(), and current->cred does not go away.
    
    Let's use current_cred() in nfsd_nl_listener_set_doit().
    
    Fixes: 16a471177496 ("NFSD: add listener-{set,get} netlink command")
    Cc: [email protected]
    Signed-off-by: Kuniyuki Iwashima <[email protected]>
    Reviewed-by: Jeff Layton <[email protected]>
    Signed-off-by: Chuck Lever <[email protected]>
    Signed-off-by: Greg Kroah-Hartman <[email protected]>

nfsd: fix heap overflow in NFSv4.0 LOCK replay cache [+ + +]

Author: Jeff Layton <[email protected]>
Date:   Tue Feb 24 11:33:35 2026 -0500

    nfsd: fix heap overflow in NFSv4.0 LOCK replay cache
    
    commit 5133b61aaf437e5f25b1b396b14242a6bb0508e2 upstream.
    
    The NFSv4.0 replay cache uses a fixed 112-byte inline buffer
    (rp_ibuf[NFSD4_REPLAY_ISIZE]) to store encoded operation responses.
    This size was calculated based on OPEN responses and does not account
    for LOCK denied responses, which include the conflicting lock owner as
    a variable-length field up to 1024 bytes (NFS4_OPAQUE_LIMIT).
    
    When a LOCK operation is denied due to a conflict with an existing lock
    that has a large owner, nfsd4_encode_operation() copies the full encoded
    response into the undersized replay buffer via read_bytes_from_xdr_buf()
    with no bounds check. This results in a slab-out-of-bounds write of up
    to 944 bytes past the end of the buffer, corrupting adjacent heap memory.
    
    This can be triggered remotely by an unauthenticated attacker with two
    cooperating NFSv4.0 clients: one sets a lock with a large owner string,
    then the other requests a conflicting lock to provoke the denial.
    
    We could fix this by increasing NFSD4_REPLAY_ISIZE to allow for a full
    opaque, but that would increase the size of every stateowner, when most
    lockowners are not that large.
    
    Instead, fix this by checking the encoded response length against
    NFSD4_REPLAY_ISIZE before copying into the replay buffer. If the
    response is too large, set rp_buflen to 0 to skip caching the replay
    payload. The status is still cached, and the client already received the
    correct response on the original request.
    
    Fixes: 1da177e4c3f4 ("Linux-2.6.12-rc2")
    Cc: [email protected]
    Reported-by: Nicholas Carlini <[email protected]>
    Tested-by: Nicholas Carlini <[email protected]>
    Signed-off-by: Jeff Layton <[email protected]>
    Signed-off-by: Chuck Lever <[email protected]>
    Signed-off-by: Greg Kroah-Hartman <[email protected]>

NFSD: Hold net reference for the lifetime of /proc/fs/nfs/exports fd [+ + +]

Author: Chuck Lever <[email protected]>
Date:   Thu Feb 19 16:50:17 2026 -0500

    NFSD: Hold net reference for the lifetime of /proc/fs/nfs/exports fd
    
    commit e7fcf179b82d3a3730fd8615da01b087cc654d0b upstream.
    
    The /proc/fs/nfs/exports proc entry is created at module init
    and persists for the module's lifetime. exports_proc_open()
    captures the caller's current network namespace and stores
    its svc_export_cache in seq->private, but takes no reference
    on the namespace. If the namespace is subsequently torn down
    (e.g. container destruction after the opener does setns() to a
    different namespace), nfsd_net_exit() calls nfsd_export_shutdown()
    which frees the cache. Subsequent reads on the still-open fd
    dereference the freed cache_detail, walking a freed hash table.
    
    Hold a reference on the struct net for the lifetime of the open
    file descriptor. This prevents nfsd_net_exit() from running --
    and thus prevents nfsd_export_shutdown() from freeing the cache
    -- while any exports fd is open. cache_detail already stores
    its net pointer (cd->net, set by cache_create_net()), so
    exports_release() can retrieve it without additional per-file
    storage.
    
    Reported-by: Misbah Anjum N <[email protected]>
    Closes: https://lore.kernel.org/linux-nfs/[email protected]/
    Fixes: 96d851c4d28d ("nfsd: use proper net while reading "exports" file")
    Cc: [email protected]
    Reviewed-by: Jeff Layton <[email protected]>
    Reviewed-by: NeilBrown <[email protected]>
    Tested-by: Olga Kornievskaia <[email protected]>
    Signed-off-by: Chuck Lever <[email protected]>
    Signed-off-by: Greg Kroah-Hartman <[email protected]>

nouveau/dpcd: return EBUSY for aux xfer if the device is asleep [+ + +]

Author: Dave Airlie <[email protected]>
Date:   Tue Feb 24 13:17:50 2026 +1000

    nouveau/dpcd: return EBUSY for aux xfer if the device is asleep
    
    commit 8f3c6f08ababad2e3bdd239728cf66a9949446b4 upstream.
    
    If we have runtime suspended, and userspace wants to use /dev/drm_dp_*
    then just tell it the device is busy instead of crashing in the GSP
    code.
    
    WARNING: CPU: 2 PID: 565741 at drivers/gpu/drm/nouveau/nvkm/subdev/gsp/rm/r535/rpc.c:164 r535_gsp_msgq_wait+0x9a/0xb0 [nouveau]
    CPU: 2 UID: 0 PID: 565741 Comm: fwupd Not tainted 6.18.10-200.fc43.x86_64 #1 PREEMPT(lazy)
    Hardware name: LENOVO 20QTS0PQ00/20QTS0PQ00, BIOS N2OET65W (1.52 ) 08/05/2024
    RIP: 0010:r535_gsp_msgq_wait+0x9a/0xb0 [nouveau]
    
    This is a simple fix to get backported. We should probably engineer a
    proper power domain solution to wake up devices and keep them awake
    while fw updates are happening.
    
    Cc: [email protected]
    Fixes: 8894f4919bc4 ("drm/nouveau: register a drm_dp_aux channel for each dp connector")
    Reviewed-by: Lyude Paul <[email protected]>
    Signed-off-by: Dave Airlie <[email protected]>
    Link: https://patch.msgid.link/[email protected]
    Signed-off-by: Danilo Krummrich <[email protected]>
    Signed-off-by: Greg Kroah-Hartman <[email protected]>

nsfs: tighten permission checks for ns iteration ioctls [+ + +]

Author: Christian Brauner <[email protected]>
Date:   Tue Mar 17 19:37:44 2026 -0400

    nsfs: tighten permission checks for ns iteration ioctls
    
    [ Upstream commit e6b899f08066e744f89df16ceb782e06868bd148 ]
    
    Even privileged services should not necessarily be able to see other
    privileged service's namespaces so they can't leak information to each
    other. Use may_see_all_namespaces() helper that centralizes this policy
    until the nstree adapts.
    
    Link: https://patch.msgid.link/[email protected]
    Fixes: a1d220d9dafa ("nsfs: iterate through mount namespaces")
    Reviewed-by: Jeff Layton <[email protected]>
    Cc: [email protected] # v6.12+
    Signed-off-by: Christian Brauner <[email protected]>
    [ Different file names ]
    Signed-off-by: Sasha Levin <[email protected]>
    Signed-off-by: Greg Kroah-Hartman <[email protected]>

nvdimm/bus: Fix potential use after free in asynchronous initialization [+ + +]

Author: Ira Weiny <[email protected]>
Date:   Fri Mar 6 12:33:05 2026 -0600

    nvdimm/bus: Fix potential use after free in asynchronous initialization
    
    commit a8aec14230322ed8f1e8042b6d656c1631d41163 upstream.
    
    Dingisoul with KASAN reports a use after free if device_add() fails in
    nd_async_device_register().
    
    Commit b6eae0f61db2 ("libnvdimm: Hold reference on parent while
    scheduling async init") correctly added a reference on the parent device
    to be held until asynchronous initialization was complete.  However, if
    device_add() results in an allocation failure the ref count of the
    device drops to 0 prior to the parent pointer being accessed.  Thus
    resulting in use after free.
    
    The bug bot AI correctly identified the fix.  Save a reference to the
    parent pointer to be used to drop the parent reference regardless of the
    outcome of device_add().
    
    Reported-by: Dingisoul <[email protected]>
    Closes: http://lore.kernel.org/[email protected]
    Fixes: b6eae0f61db2 ("libnvdimm: Hold reference on parent while scheduling async init")
    Cc: [email protected]
    Reviewed-by: Dave Jiang <[email protected]>
    Link: https://patch.msgid.link/[email protected]
    Signed-off-by: Ira Weiny <[email protected]>
    Signed-off-by: Greg Kroah-Hartman <[email protected]>

nvme-pci: Fix race bug in nvme_poll_irqdisable() [+ + +]

Author: Sungwoo Kim <[email protected]>
Date:   Sat Mar 7 14:46:36 2026 -0500

    nvme-pci: Fix race bug in nvme_poll_irqdisable()
    
    [ Upstream commit fc71f409b22ca831a9f87a2712eaa09ef2bb4a5e ]
    
    In the following scenario, pdev can be disabled between (1) and (3) by
    (2). This sets pdev->msix_enabled = 0. Then, pci_irq_vector() will
    return MSI-X IRQ(>15) for (1) whereas return INTx IRQ(<=15) for (2).
    This causes IRQ warning because it tries to enable INTx IRQ that has
    never been disabled before.
    
    To fix this, save IRQ number into a local variable and ensure
    disable_irq() and enable_irq() operate on the same IRQ number.  Even if
    pci_free_irq_vectors() frees the IRQ concurrently, disable_irq() and
    enable_irq() on a stale IRQ number is still valid and safe, and the
    depth accounting reamins balanced.
    
    task 1:
    nvme_poll_irqdisable()
      disable_irq(pci_irq_vector(pdev, nvmeq->cq_vector)) ...(1)
      enable_irq(pci_irq_vector(pdev, nvmeq->cq_vector))  ...(3)
    
    task 2:
    nvme_reset_work()
      nvme_dev_disable()
        pdev->msix_enable = 0;  ...(2)
    
    crash log:
    
    ------------[ cut here ]------------
    Unbalanced enable for IRQ 10
    WARNING: kernel/irq/manage.c:753 at __enable_irq+0x102/0x190 kernel/irq/manage.c:753, CPU#1: kworker/1:0H/26
    Modules linked in:
    CPU: 1 UID: 0 PID: 26 Comm: kworker/1:0H Not tainted 6.19.0-dirty #9 PREEMPT(voluntary)
    Hardware name: QEMU Standard PC (i440FX + PIIX, 1996), BIOS rel-1.16.3-0-ga6ed6b701f0a-prebuilt.qemu.org 04/01/2014
    Workqueue: kblockd blk_mq_timeout_work
    RIP: 0010:__enable_irq+0x107/0x190 kernel/irq/manage.c:753
    Code: ff df 48 89 fa 48 c1 ea 03 0f b6 14 02 48 89 f8 83 e0 07 83 c0 03 38 d0 7c 04 84 d2 75 79 48 8d 3d 2e 7a 3f 05 41 8b 74 24 2c <67> 48 0f b9 3a e8 ef b9 21 00 5b 41 5c 5d e9 46 54 66 03 e8 e1 b9
    RSP: 0018:ffffc900001bf550 EFLAGS: 00010046
    RAX: 0000000000000007 RBX: 0000000000000000 RCX: ffffffffb20c0e90
    RDX: 0000000000000000 RSI: 000000000000000a RDI: ffffffffb74b88f0
    RBP: ffffc900001bf560 R08: ffff88800197cf00 R09: 0000000000000001
    R10: 0000000000000003 R11: 0000000000000003 R12: ffff8880012a6000
    R13: 1ffff92000037eae R14: 000000000000000a R15: 0000000000000293
    FS:  0000000000000000(0000) GS:ffff8880b49f7000(0000) knlGS:0000000000000000
    CS:  0010 DS: 0000 ES: 0000 CR0: 0000000080050033
    CR2: 0000555da4a25fa8 CR3: 00000000208e8000 CR4: 00000000000006f0
    Call Trace:
     <TASK>
     enable_irq+0x121/0x1e0 kernel/irq/manage.c:797
     nvme_poll_irqdisable+0x162/0x1c0 drivers/nvme/host/pci.c:1494
     nvme_timeout+0x965/0x14b0 drivers/nvme/host/pci.c:1744
     blk_mq_rq_timed_out block/blk-mq.c:1653 [inline]
     blk_mq_handle_expired+0x227/0x2d0 block/blk-mq.c:1721
     bt_iter+0x2fc/0x3a0 block/blk-mq-tag.c:292
     __sbitmap_for_each_set include/linux/sbitmap.h:269 [inline]
     sbitmap_for_each_set include/linux/sbitmap.h:290 [inline]
     bt_for_each block/blk-mq-tag.c:324 [inline]
     blk_mq_queue_tag_busy_iter+0x969/0x1e80 block/blk-mq-tag.c:536
     blk_mq_timeout_work+0x627/0x870 block/blk-mq.c:1763
     process_one_work+0x956/0x1aa0 kernel/workqueue.c:3257
     process_scheduled_works kernel/workqueue.c:3340 [inline]
     worker_thread+0x65c/0xe60 kernel/workqueue.c:3421
     kthread+0x41a/0x930 kernel/kthread.c:463
     ret_from_fork+0x6f8/0x8c0 arch/x86/kernel/process.c:158
     ret_from_fork_asm+0x1a/0x30 arch/x86/entry/entry_64.S:246
     </TASK>
    irq event stamp: 74478
    hardirqs last  enabled at (74477): [<ffffffffb5720a9c>] __raw_spin_unlock_irq include/linux/spinlock_api_smp.h:159 [inline]
    hardirqs last  enabled at (74477): [<ffffffffb5720a9c>] _raw_spin_unlock_irq+0x2c/0x60 kernel/locking/spinlock.c:202
    hardirqs last disabled at (74478): [<ffffffffb57207b5>] __raw_spin_lock_irqsave include/linux/spinlock_api_smp.h:108 [inline]
    hardirqs last disabled at (74478): [<ffffffffb57207b5>] _raw_spin_lock_irqsave+0x85/0xa0 kernel/locking/spinlock.c:162
    softirqs last  enabled at (74304): [<ffffffffb1e9466c>] __do_softirq kernel/softirq.c:656 [inline]
    softirqs last  enabled at (74304): [<ffffffffb1e9466c>] invoke_softirq kernel/softirq.c:496 [inline]
    softirqs last  enabled at (74304): [<ffffffffb1e9466c>] __irq_exit_rcu+0xdc/0x120 kernel/softirq.c:723
    softirqs last disabled at (74287): [<ffffffffb1e9466c>] __do_softirq kernel/softirq.c:656 [inline]
    softirqs last disabled at (74287): [<ffffffffb1e9466c>] invoke_softirq kernel/softirq.c:496 [inline]
    softirqs last disabled at (74287): [<ffffffffb1e9466c>] __irq_exit_rcu+0xdc/0x120 kernel/softirq.c:723
    ---[ end trace 0000000000000000 ]---
    
    Fixes: fa059b856a59 (nvme-pci: Simplify nvme_poll_irqdisable)
    Acked-by: Chao Shi <[email protected]>
    Acked-by: Weidong Zhu <[email protected]>
    Acked-by: Dave Tian <[email protected]>
    Reviewed-by: Christoph Hellwig <[email protected]>
    Signed-off-by: Sungwoo Kim <[email protected]>
    Signed-off-by: Keith Busch <[email protected]>
    Signed-off-by: Sasha Levin <[email protected]>

nvme-pci: Fix slab-out-of-bounds in nvme_dbbuf_set [+ + +]

Author: Sungwoo Kim <[email protected]>
Date:   Sun Mar 8 14:20:59 2026 -0400

    nvme-pci: Fix slab-out-of-bounds in nvme_dbbuf_set
    
    [ Upstream commit b4e78f1427c7d6859229ae9616df54e1fc05a516 ]
    
    dev->online_queues is a count incremented in nvme_init_queue. Thus,
    valid indices are 0 through dev->online_queues − 1.
    
    This patch fixes the loop condition to ensure the index stays within the
    valid range. Index 0 is excluded because it is the admin queue.
    
    KASAN splat:
    
    ==================================================================
    BUG: KASAN: slab-out-of-bounds in nvme_dbbuf_free drivers/nvme/host/pci.c:377 [inline]
    BUG: KASAN: slab-out-of-bounds in nvme_dbbuf_set+0x39c/0x400 drivers/nvme/host/pci.c:404
    Read of size 2 at addr ffff88800592a574 by task kworker/u8:5/74
    
    CPU: 0 UID: 0 PID: 74 Comm: kworker/u8:5 Not tainted 6.19.0-dirty #10 PREEMPT(voluntary)
    Hardware name: QEMU Standard PC (i440FX + PIIX, 1996), BIOS rel-1.16.3-0-ga6ed6b701f0a-prebuilt.qemu.org 04/01/2014
    Workqueue: nvme-reset-wq nvme_reset_work
    Call Trace:
     <TASK>
     __dump_stack lib/dump_stack.c:94 [inline]
     dump_stack_lvl+0xea/0x150 lib/dump_stack.c:120
     print_address_description mm/kasan/report.c:378 [inline]
     print_report+0xce/0x5d0 mm/kasan/report.c:482
     kasan_report+0xdc/0x110 mm/kasan/report.c:595
     __asan_report_load2_noabort+0x18/0x20 mm/kasan/report_generic.c:379
     nvme_dbbuf_free drivers/nvme/host/pci.c:377 [inline]
     nvme_dbbuf_set+0x39c/0x400 drivers/nvme/host/pci.c:404
     nvme_reset_work+0x36b/0x8c0 drivers/nvme/host/pci.c:3252
     process_one_work+0x956/0x1aa0 kernel/workqueue.c:3257
     process_scheduled_works kernel/workqueue.c:3340 [inline]
     worker_thread+0x65c/0xe60 kernel/workqueue.c:3421
     kthread+0x41a/0x930 kernel/kthread.c:463
     ret_from_fork+0x6f8/0x8c0 arch/x86/kernel/process.c:158
     ret_from_fork_asm+0x1a/0x30 arch/x86/entry/entry_64.S:246
     </TASK>
    
    Allocated by task 34 on cpu 1 at 4.241550s:
     kasan_save_stack+0x2c/0x60 mm/kasan/common.c:57
     kasan_save_track+0x1c/0x70 mm/kasan/common.c:78
     kasan_save_alloc_info+0x3c/0x50 mm/kasan/generic.c:570
     poison_kmalloc_redzone mm/kasan/common.c:398 [inline]
     __kasan_kmalloc+0xb5/0xc0 mm/kasan/common.c:415
     kasan_kmalloc include/linux/kasan.h:263 [inline]
     __do_kmalloc_node mm/slub.c:5657 [inline]
     __kmalloc_node_noprof+0x2bf/0x8d0 mm/slub.c:5663
     kmalloc_array_node_noprof include/linux/slab.h:1075 [inline]
     nvme_pci_alloc_dev drivers/nvme/host/pci.c:3479 [inline]
     nvme_probe+0x2f1/0x1820 drivers/nvme/host/pci.c:3534
     local_pci_probe+0xef/0x1c0 drivers/pci/pci-driver.c:324
     pci_call_probe drivers/pci/pci-driver.c:392 [inline]
     __pci_device_probe drivers/pci/pci-driver.c:417 [inline]
     pci_device_probe+0x743/0x920 drivers/pci/pci-driver.c:451
     call_driver_probe drivers/base/dd.c:583 [inline]
     really_probe+0x29b/0xb70 drivers/base/dd.c:661
     __driver_probe_device+0x3b0/0x4a0 drivers/base/dd.c:803
     driver_probe_device+0x56/0x1f0 drivers/base/dd.c:833
     __driver_attach_async_helper+0x155/0x340 drivers/base/dd.c:1159
     async_run_entry_fn+0xa6/0x4b0 kernel/async.c:129
     process_one_work+0x956/0x1aa0 kernel/workqueue.c:3257
     process_scheduled_works kernel/workqueue.c:3340 [inline]
     worker_thread+0x65c/0xe60 kernel/workqueue.c:3421
     kthread+0x41a/0x930 kernel/kthread.c:463
     ret_from_fork+0x6f8/0x8c0 arch/x86/kernel/process.c:158
     ret_from_fork_asm+0x1a/0x30 arch/x86/entry/entry_64.S:246
    
    The buggy address belongs to the object at ffff88800592a000
     which belongs to the cache kmalloc-2k of size 2048
    The buggy address is located 244 bytes to the right of
     allocated 1152-byte region [ffff88800592a000, ffff88800592a480)
    
    The buggy address belongs to the physical page:
    page: refcount:0 mapcount:0 mapping:0000000000000000 index:0x0 pfn:0x5928
    head: order:3 mapcount:0 entire_mapcount:0 nr_pages_mapped:0 pincount:0
    anon flags: 0xfffffc0000040(head|node=0|zone=1|lastcpupid=0x1fffff)
    page_type: f5(slab)
    raw: 000fffffc0000040 ffff888001042000 0000000000000000 dead000000000001
    raw: 0000000000000000 0000000000080008 00000000f5000000 0000000000000000
    head: 000fffffc0000040 ffff888001042000 0000000000000000 dead000000000001
    head: 0000000000000000 0000000000080008 00000000f5000000 0000000000000000
    head: 000fffffc0000003 ffffea0000164a01 00000000ffffffff 00000000ffffffff
    head: ffffffffffffffff 0000000000000000 00000000ffffffff 0000000000000008
    page dumped because: kasan: bad access detected
    
    Memory state around the buggy address:
     ffff88800592a400: 00 00 00 00 00 00 00 00 00 00 00 00 00 00 00 00
     ffff88800592a480: fc fc fc fc fc fc fc fc fc fc fc fc fc fc fc fc
    >ffff88800592a500: fc fc fc fc fc fc fc fc fc fc fc fc fc fc fc fc
                                                                 ^
     ffff88800592a580: fc fc fc fc fc fc fc fc fc fc fc fc fc fc fc fc
     ffff88800592a600: fc fc fc fc fc fc fc fc fc fc fc fc fc fc fc fc
    ==================================================================
    
    Fixes: 0f0d2c876c96 (nvme: free sq/cq dbbuf pointers when dbbuf set fails)
    Acked-by: Chao Shi <[email protected]>
    Acked-by: Weidong Zhu <[email protected]>
    Acked-by: Dave Tian <[email protected]>
    Signed-off-by: Sungwoo Kim <[email protected]>
    Signed-off-by: Keith Busch <[email protected]>
    Signed-off-by: Sasha Levin <[email protected]>

Octeontx2-af: Add proper checks for fwdata [+ + +]

Author: Hariprasad Kelam <[email protected]>
Date:   Fri Feb 27 16:34:51 2026 +0800

    Octeontx2-af: Add proper checks for fwdata
    
    [ Upstream commit 4a3dba48188208e4f66822800e042686784d29d1 ]
    
    firmware populates MAC address, link modes (supported, advertised)
    and EEPROM data in shared firmware structure which kernel access
    via MAC block(CGX/RPM).
    
    Accessing fwdata, on boards booted with out MAC block leading to
    kernel panics.
    
    Internal error: Oops: 0000000096000005 [#1]  SMP
    [   10.460721] Modules linked in:
    [   10.463779] CPU: 0 UID: 0 PID: 174 Comm: kworker/0:3 Not tainted 6.19.0-rc5-00154-g76ec646abdf7-dirty #3 PREEMPT
    [   10.474045] Hardware name: Marvell OcteonTX CN98XX board (DT)
    [   10.479793] Workqueue: events work_for_cpu_fn
    [   10.484159] pstate: 80400009 (Nzcv daif +PAN -UAO -TCO -DIT -SSBS BTYPE=--)
    [   10.491124] pc : rvu_sdp_init+0x18/0x114
    [   10.495051] lr : rvu_probe+0xe58/0x1d18
    
    Fixes: 997814491cee ("Octeontx2-af: Fetch MAC channel info from firmware")
    Fixes: 5f21226b79fd ("Octeontx2-pf: ethtool: support multi advertise mode")
    Signed-off-by: Hariprasad Kelam <[email protected]>
    Link: https://patch.msgid.link/[email protected]
    Signed-off-by: Jakub Kicinski <[email protected]>
    Signed-off-by: Rajani Kantha <[email protected]>
    Signed-off-by: Greg Kroah-Hartman <[email protected]>

octeontx2-af: devlink: fix NIX RAS reporter recovery condition [+ + +]

Author: Alok Tiwari <[email protected]>
Date:   Tue Mar 10 11:48:16 2026 -0700

    octeontx2-af: devlink: fix NIX RAS reporter recovery condition
    
    [ Upstream commit dc26ca99b835e21e76a58b1463b84adb0ca34f58 ]
    
    The NIX RAS health reporter recovery routine checks nix_af_rvu_int to
    decide whether to re-enable NIX_AF_RAS interrupts. This is the RVU
    interrupt status field and is unrelated to RAS events, so the recovery
    flow may incorrectly skip re-enabling NIX_AF_RAS interrupts.
    
    Check nix_af_rvu_ras instead before writing NIX_AF_RAS_ENA_W1S.
    
    Fixes: 5ed66306eab6 ("octeontx2-af: Add devlink health reporters for NIX")
    Signed-off-by: Alok Tiwari <[email protected]>
    Link: https://patch.msgid.link/[email protected]
    Signed-off-by: Jakub Kicinski <[email protected]>
    Signed-off-by: Sasha Levin <[email protected]>

octeontx2-af: devlink: fix NIX RAS reporter to use RAS interrupt status [+ + +]

Author: Alok Tiwari <[email protected]>
Date:   Tue Mar 10 11:48:17 2026 -0700

    octeontx2-af: devlink: fix NIX RAS reporter to use RAS interrupt status
    
    [ Upstream commit 87f7dff3ec75b91def0024ebaaf732457f47a63b ]
    
    The NIX RAS health report path uses nix_af_rvu_err when handling the
    NIX_AF_RVU_RAS case, so the report prints the ERR interrupt status rather
    than the RAS interrupt status.
    
    Use nix_af_rvu_ras for the NIX_AF_RVU_RAS report.
    
    Fixes: 5ed66306eab6 ("octeontx2-af: Add devlink health reporters for NIX")
    Signed-off-by: Alok Tiwari <[email protected]>
    Link: https://patch.msgid.link/[email protected]
    Signed-off-by: Jakub Kicinski <[email protected]>
    Signed-off-by: Sasha Levin <[email protected]>

parisc: Check kernel mapping earlier at bootup [+ + +]

Author: Helge Deller <[email protected]>
Date:   Tue Mar 3 23:36:11 2026 +0100

    parisc: Check kernel mapping earlier at bootup
    
    commit 17c144f1104bfc29a3ce3f7d0931a1bfb7a3558c upstream.
    
    The check if the initial mapping is sufficient needs to happen much
    earlier during bootup. Move this test directly to the start_parisc()
    function and use native PDC iodc functions to print the warning, because
    panic() and printk() are not functional yet.
    
    This fixes boot when enabling various KALLSYSMS options which need
    much more space.
    
    Signed-off-by: Helge Deller <[email protected]>
    Cc: <[email protected]> # v6.0+
    Signed-off-by: Greg Kroah-Hartman <[email protected]>

parisc: Fix initial page table creation for boot [+ + +]

Author: Helge Deller <[email protected]>
Date:   Wed Mar 4 22:24:18 2026 +0100

    parisc: Fix initial page table creation for boot
    
    commit 8475d8fe21ec9c7eb2faca555fbc5b68cf0d2597 upstream.
    
    The KERNEL_INITIAL_ORDER value defines the initial size (usually 32 or
    64 MB) of the page table during bootup. Up until now the whole area was
    initialized with PTE entries, but there was no check if we filled too
    many entries.  Change the code to fill up with so many entries that the
    "_end" symbol can be reached by the kernel, but not more entries than
    actually fit into the initial PTE tables.
    
    Signed-off-by: Helge Deller <[email protected]>
    Cc: <[email protected]> # v6.0+
    Signed-off-by: Greg Kroah-Hartman <[email protected]>

parisc: Flush correct cache in cacheflush() syscall [+ + +]

Author: Helge Deller <[email protected]>
Date:   Mon Mar 9 15:16:37 2026 +0100

    parisc: Flush correct cache in cacheflush() syscall
    
    commit 2c98a8fbd6aa647414c6248dacf254ebe91c79ad upstream.
    
    The assembly flush instructions were swapped for I- and D-cache flags:
    
    SYSCALL_DEFINE3(cacheflush, ...)
    {
            if (cache & DCACHE) {
                            "fic ...\n"
            }
            if (cache & ICACHE && error == 0) {
                            "fdc ...\n"
            }
    
    Fix it by using fdc for DCACHE, and fic for ICACHE flushing.
    
    Reported-by: Felix Lechner <[email protected]>
    Fixes: c6d96328fecd ("parisc: Add cacheflush() syscall")
    Cc: <[email protected]> # v6.5+
    Signed-off-by: Helge Deller <[email protected]>
    Signed-off-by: Greg Kroah-Hartman <[email protected]>

parisc: Increase initial mapping to 64 MB with KALLSYMS [+ + +]

Author: Helge Deller <[email protected]>
Date:   Tue Mar 3 23:36:10 2026 +0100

    parisc: Increase initial mapping to 64 MB with KALLSYMS
    
    commit 8e732934fb81282be41602550e7e07baf265e972 upstream.
    
    The 32MB initial kernel mapping can become too small when CONFIG_KALLSYMS
    is used. Increase the mapping to 64 MB in this case.
    
    Signed-off-by: Helge Deller <[email protected]>
    Cc: <[email protected]> # v6.0+
    Signed-off-by: Greg Kroah-Hartman <[email protected]>

perf annotate: Fix hashmap__new() error checking [+ + +]

Author: Chen Ni <[email protected]>
Date:   Fri Mar 6 11:56:48 2026 +0800

    perf annotate: Fix hashmap__new() error checking
    
    [ Upstream commit bf29cb3641b80bac759c3332b02e0b270e16bf94 ]
    
    The hashmap__new() function never returns NULL, it returns error
    pointers. Fix the error checking to match.
    
    Additionally, set src->samples to NULL to prevent any later code from
    accidentally using the error pointer.
    
    Fixes: d3e7cad6f36d9e80 ("perf annotate: Add a hashmap for symbol histogram")
    Reviewed-by: Ian Rogers <[email protected]>
    Signed-off-by: Chen Ni <[email protected]>
    Cc: Adrian Hunter <[email protected]>
    Cc: Alexander Shishkin <[email protected]>
    Cc: Ingo Molnar <[email protected]>
    Cc: James Clark <[email protected]>
    Cc: Jiri Olsa <[email protected]>
    Cc: Mark Rutland <[email protected]>
    Cc: Namhyung Kim <[email protected]>
    Cc: Peter Zijlstra <[email protected]>
    Cc: Tianyou Li <[email protected]>
    Signed-off-by: Arnaldo Carvalho de Melo <[email protected]>
    Signed-off-by: Sasha Levin <[email protected]>

perf disasm: Fix off-by-one bug in outside check [+ + +]

Author: Peter Collingbourne <[email protected]>
Date:   Wed Mar 4 11:06:12 2026 -0800

    perf disasm: Fix off-by-one bug in outside check
    
    [ Upstream commit b3ce769203a99d6f3c6d6269ec09232a8c5da422 ]
    
    If a branch target points to one past the end of a function, the branch
    should be treated as a branch to another function.
    
    This can happen e.g. with a tail call to a function that is laid out
    immediately after the caller.
    
    Fixes: 751b1783da784299 ("perf annotate: Mark jumps to outher functions with the call arrow")
    Reviewed-by: Ian Rogers <[email protected]>
    Signed-off-by: Peter Collingbourne <[email protected]>
    Cc: Adrian Hunter <[email protected]>
    Cc: Alexander Shishkin <[email protected]>
    Cc: Bill Wendling <[email protected]>
    Cc: Ingo Molnar <[email protected]>
    Cc: James Clark <[email protected]>
    Cc: Jiri Olsa <[email protected]>
    Cc: Justin Stitt <[email protected]>
    Cc: Mark Rutland <[email protected]>
    Cc: Namhyung Kim <[email protected]>
    Cc: Nathan Chancellor <[email protected]>
    Cc: Nick Desaulniers <[email protected]>
    Cc: Peter Zijlstra <[email protected]>
    Link: https://linux-review.googlesource.com/id/Ide471112e82d68177e0faf08ca411d9fcf0a7bdf
    Signed-off-by: Arnaldo Carvalho de Melo <[email protected]>
    Signed-off-by: Sasha Levin <[email protected]>

perf ftrace: Fix hashmap__new() error checking [+ + +]

Author: Chen Ni <[email protected]>
Date:   Fri Mar 6 12:10:52 2026 +0800

    perf ftrace: Fix hashmap__new() error checking
    
    [ Upstream commit be34705aa527872e5ce83927b7bc9307ba8095ca ]
    
    The hashmap__new() function never returns NULL, it returns error
    pointers. Fix the error checking to match.
    
    Additionally, set ftrace->profile_hash to NULL on error, and return the
    exact error code from hashmap__new().
    
    Fixes: 0f223813edd051a5 ("perf ftrace: Add 'profile' command")
    Suggested-by: Ian Rogers <[email protected]>
    Signed-off-by: Chen Ni <[email protected]>
    Cc: Adrian Hunter <[email protected]>
    Cc: Alexander Shishkin <[email protected]>
    Cc: Ingo Molnar <[email protected]>
    Cc: James Clark <[email protected]>
    Cc: Jiri Olsa <[email protected]>
    Cc: Mark Rutland <[email protected]>
    Cc: Namhyung Kim <[email protected]>
    Cc: Peter Zijlstra <[email protected]>
    Signed-off-by: Arnaldo Carvalho de Melo <[email protected]>
    Signed-off-by: Sasha Levin <[email protected]>

perf/x86/intel/uncore: Add per-scheduler IMC CAS count events [+ + +]

Author: Zide Chen <[email protected]>
Date:   Mon Mar 9 06:44:06 2026 -0400

    perf/x86/intel/uncore: Add per-scheduler IMC CAS count events
    
    [ Upstream commit 6a8a48644c4b804123e59dbfc5d6cd29a0194046 ]
    
    IMC on SPR and EMR does not support sub-channels.  In contrast, CPUs
    that use gnr_uncores[] (e.g. Granite Rapids and Sierra Forest)
    implement two command schedulers (SCH0/SCH1) per memory channel,
    providing logically independent command and data paths.
    
    Do not reuse the spr_uncore_imc[] configuration for these CPUs.
    Instead, introduce a dedicated gnr_uncore_imc[] with per-scheduler
    events, so userspace can monitor SCH0 and SCH1 independently.
    
    On these CPUs, replace cas_count_{read,write} with
    cas_count_{read,write}_sch{0,1}.  This may break existing userspace
    that relies on cas_count_{read,write}, prompting it to switch to the
    per-scheduler events, as the legacy event reports only partial
    traffic (SCH0).
    
    Fixes: 632c4bf6d007 ("perf/x86/intel/uncore: Support Granite Rapids")
    Fixes: cb4a6ccf3583 ("perf/x86/intel/uncore: Support Sierra Forest and Grand Ridge")
    Reported-by: Reinette Chatre <[email protected]>
    Signed-off-by: Zide Chen <[email protected]>
    Signed-off-by: Peter Zijlstra (Intel) <[email protected]>
    Reviewed-by: Dapeng Mi <[email protected]>
    Cc: [email protected]
    Link: https://patch.msgid.link/[email protected]
    Signed-off-by: Sasha Levin <[email protected]>
    Signed-off-by: Greg Kroah-Hartman <[email protected]>

perf/x86/intel/uncore: Support more units on Granite Rapids [+ + +]

Author: Kan Liang <[email protected]>
Date:   Mon Mar 9 06:44:05 2026 -0400

    perf/x86/intel/uncore: Support more units on Granite Rapids
    
    [ Upstream commit 6d642735cdb6cdb814d2b6c81652caa53ce04842 ]
    
    The same CXL PMONs support is also avaiable on GNR. Apply
    spr_uncore_cxlcm and spr_uncore_cxldp to GNR as well.
    
    The other units were broken on early HW samples, so they were ignored in
    the early enabling patch. The issue has been fixed and verified on the
    later production HW. Add UPI, B2UPI, B2HOT, PCIEX16 and PCIEX8 for GNR.
    
    Signed-off-by: Kan Liang <[email protected]>
    Signed-off-by: Peter Zijlstra (Intel) <[email protected]>
    Tested-by: Eric Hu <[email protected]>
    Link: https://lkml.kernel.org/r/[email protected]
    Stable-dep-of: 6a8a48644c4b ("perf/x86/intel/uncore: Add per-scheduler IMC CAS count events")
    Signed-off-by: Sasha Levin <[email protected]>
    Signed-off-by: Greg Kroah-Hartman <[email protected]>

perf/x86/intel: Add missing branch counters constraint apply [+ + +]

Author: Dapeng Mi <[email protected]>
Date:   Mon Mar 23 09:19:20 2026 -0400

    perf/x86/intel: Add missing branch counters constraint apply
    
    [ Upstream commit 1d07bbd7ea36ea0b8dfa8068dbe67eb3a32d9590 ]
    
    When running the command:
    'perf record -e "{instructions,instructions:p}" -j any,counter sleep 1',
    a "shift-out-of-bounds" warning is reported on CWF.
    
      UBSAN: shift-out-of-bounds in /kbuild/src/consumer/arch/x86/events/intel/lbr.c:970:15
      shift exponent 64 is too large for 64-bit type 'long long unsigned int'
      ......
      intel_pmu_lbr_counters_reorder.isra.0.cold+0x2a/0xa7
      intel_pmu_lbr_save_brstack+0xc0/0x4c0
      setup_arch_pebs_sample_data+0x114b/0x2400
    
    The warning occurs because the second "instructions:p" event, which
    involves branch counters sampling, is incorrectly programmed to fixed
    counter 0 instead of the general-purpose (GP) counters 0-3 that support
    branch counters sampling. Currently only GP counters 0-3 support branch
    counters sampling on CWF, any event involving branch counters sampling
    should be programed on GP counters 0-3. Since the counter index of fixed
    counter 0 is 32, it leads to the "src" value in below code is right
    shifted 64 bits and trigger the "shift-out-of-bounds" warning.
    
    cnt = (src >> (order[j] * LBR_INFO_BR_CNTR_BITS)) & LBR_INFO_BR_CNTR_MASK;
    
    The root cause is the loss of the branch counters constraint for the
    new event in the branch counters sampling event group. Since it isn't
    yet part of the sibling list. This results in the second
    "instructions:p" event being programmed on fixed counter 0 incorrectly
    instead of the appropriate GP counters 0-3.
    
    To address this, we apply the missing branch counters constraint for
    the last event in the group. Additionally, we introduce a new function,
    `intel_set_branch_counter_constr()`, to apply the branch counters
    constraint and avoid code duplication.
    
    Fixes: 33744916196b ("perf/x86/intel: Support branch counters logging")
    Reported-by: Xudong Hao <[email protected]>
    Signed-off-by: Dapeng Mi <[email protected]>
    Signed-off-by: Peter Zijlstra (Intel) <[email protected]>
    Link: https://patch.msgid.link/[email protected]
    Cc: [email protected]
    Signed-off-by: Sasha Levin <[email protected]>
    Signed-off-by: Greg Kroah-Hartman <[email protected]>

platform/x86/amd/pmc: Add support for Van Gogh SoC [+ + +]

Author: Antheas Kapenekakis <[email protected]>
Date:   Tue Mar 3 12:15:12 2026 +0800

    platform/x86/amd/pmc: Add support for Van Gogh SoC
    
    [ Upstream commit db4a3f0fbedb0398f77b9047e8b8bb2b49f355bb ]
    
    The ROG Xbox Ally (non-X) SoC features a similar architecture to the
    Steam Deck. While the Steam Deck supports S3 (s2idle causes a crash),
    this support was dropped by the Xbox Ally which only S0ix suspend.
    
    Since the handler is missing here, this causes the device to not suspend
    and the AMD GPU driver to crash while trying to resume afterwards due to
    a power hang.
    
    Closes: https://gitlab.freedesktop.org/drm/amd/-/issues/4659
    Signed-off-by: Antheas Kapenekakis <[email protected]>
    Reviewed-by: Mario Limonciello (AMD) <[email protected]>
    Acked-by: Shyam Sundar S K <[email protected]>
    Link: https://patch.msgid.link/[email protected]
    Reviewed-by: Ilpo Järvinen <[email protected]>
    Signed-off-by: Ilpo Järvinen <[email protected]>
    [ Adjust context ]
    Signed-off-by: Alva Lan <[email protected]>
    Signed-off-by: Greg Kroah-Hartman <[email protected]>

platform/x86: hp-bioscfg: Support allocations of larger data [+ + +]

Author: Mario Limonciello <[email protected]>
Date:   Mon Mar 9 07:31:54 2026 -0400

    platform/x86: hp-bioscfg: Support allocations of larger data
    
    [ Upstream commit 916727cfdb72cd01fef3fa6746e648f8cb70e713 ]
    
    Some systems have much larger amounts of enumeration attributes
    than have been previously encountered. This can lead to page allocation
    failures when using kcalloc().  Switch over to using kvcalloc() to
    allow larger allocations.
    
    Fixes: 6b2770bfd6f92 ("platform/x86: hp-bioscfg: enum-attributes")
    Cc: [email protected]
    Reported-by: Paul Kerry <[email protected]>
    Tested-by: Paul Kerry <[email protected]>
    Closes: https://bugs.debian.org/1127612
    Signed-off-by: Mario Limonciello <[email protected]>
    Link: https://patch.msgid.link/[email protected]
    Reviewed-by: Ilpo Järvinen <[email protected]>
    Signed-off-by: Ilpo Järvinen <[email protected]>
    [ kcalloc() => kvcalloc() ]
    Signed-off-by: Sasha Levin <[email protected]>
    Signed-off-by: Greg Kroah-Hartman <[email protected]>

PM: runtime: Fix a race condition related to device removal [+ + +]

Author: Bart Van Assche <[email protected]>
Date:   Thu Mar 12 11:27:20 2026 -0700

    PM: runtime: Fix a race condition related to device removal
    
    [ Upstream commit 29ab768277617452d88c0607c9299cdc63b6e9ff ]
    
    The following code in pm_runtime_work() may dereference the dev->parent
    pointer after the parent device has been freed:
    
            /* Maybe the parent is now able to suspend. */
            if (parent && !parent->power.ignore_children) {
                    spin_unlock(&dev->power.lock);
    
                    spin_lock(&parent->power.lock);
                    rpm_idle(parent, RPM_ASYNC);
                    spin_unlock(&parent->power.lock);
    
                    spin_lock(&dev->power.lock);
            }
    
    Fix this by inserting a flush_work() call in pm_runtime_remove().
    
    Without this patch blktest block/001 triggers the following complaint
    sporadically:
    
    BUG: KASAN: slab-use-after-free in lock_acquire+0x70/0x160
    Read of size 1 at addr ffff88812bef7198 by task kworker/u553:1/3081
    Workqueue: pm pm_runtime_work
    Call Trace:
     <TASK>
     dump_stack_lvl+0x61/0x80
     print_address_description.constprop.0+0x8b/0x310
     print_report+0xfd/0x1d7
     kasan_report+0xd8/0x1d0
     __kasan_check_byte+0x42/0x60
     lock_acquire.part.0+0x38/0x230
     lock_acquire+0x70/0x160
     _raw_spin_lock+0x36/0x50
     rpm_suspend+0xc6a/0xfe0
     rpm_idle+0x578/0x770
     pm_runtime_work+0xee/0x120
     process_one_work+0xde3/0x1410
     worker_thread+0x5eb/0xfe0
     kthread+0x37b/0x480
     ret_from_fork+0x6cb/0x920
     ret_from_fork_asm+0x11/0x20
     </TASK>
    
    Allocated by task 4314:
     kasan_save_stack+0x2a/0x50
     kasan_save_track+0x18/0x40
     kasan_save_alloc_info+0x3d/0x50
     __kasan_kmalloc+0xa0/0xb0
     __kmalloc_noprof+0x311/0x990
     scsi_alloc_target+0x122/0xb60 [scsi_mod]
     __scsi_scan_target+0x101/0x460 [scsi_mod]
     scsi_scan_channel+0x179/0x1c0 [scsi_mod]
     scsi_scan_host_selected+0x259/0x2d0 [scsi_mod]
     store_scan+0x2d2/0x390 [scsi_mod]
     dev_attr_store+0x43/0x80
     sysfs_kf_write+0xde/0x140
     kernfs_fop_write_iter+0x3ef/0x670
     vfs_write+0x506/0x1470
     ksys_write+0xfd/0x230
     __x64_sys_write+0x76/0xc0
     x64_sys_call+0x213/0x1810
     do_syscall_64+0xee/0xfc0
     entry_SYSCALL_64_after_hwframe+0x4b/0x53
    
    Freed by task 4314:
     kasan_save_stack+0x2a/0x50
     kasan_save_track+0x18/0x40
     kasan_save_free_info+0x3f/0x50
     __kasan_slab_free+0x67/0x80
     kfree+0x225/0x6c0
     scsi_target_dev_release+0x3d/0x60 [scsi_mod]
     device_release+0xa3/0x220
     kobject_cleanup+0x105/0x3a0
     kobject_put+0x72/0xd0
     put_device+0x17/0x20
     scsi_device_dev_release+0xacf/0x12c0 [scsi_mod]
     device_release+0xa3/0x220
     kobject_cleanup+0x105/0x3a0
     kobject_put+0x72/0xd0
     put_device+0x17/0x20
     scsi_device_put+0x7f/0xc0 [scsi_mod]
     sdev_store_delete+0xa5/0x120 [scsi_mod]
     dev_attr_store+0x43/0x80
     sysfs_kf_write+0xde/0x140
     kernfs_fop_write_iter+0x3ef/0x670
     vfs_write+0x506/0x1470
     ksys_write+0xfd/0x230
     __x64_sys_write+0x76/0xc0
     x64_sys_call+0x213/0x1810
    
    Reported-by: Ming Lei <[email protected]>
    Closes: https://lore.kernel.org/all/ZxdNvLNI8QaOfD2d@fedora/
    Reported-by: [email protected]
    Closes: https://lore.kernel.org/all/[email protected]/
    Fixes: 5e928f77a09a ("PM: Introduce core framework for run-time PM of I/O devices (rev. 17)")
    Signed-off-by: Bart Van Assche <[email protected]>
    Link: https://patch.msgid.link/[email protected]
    Signed-off-by: Rafael J. Wysocki <[email protected]>
    Signed-off-by: Sasha Levin <[email protected]>

pmdomain: bcm: bcm2835-power: Fix broken reset status read [+ + +]

Author: Maíra Canal <[email protected]>
Date:   Thu Feb 12 11:49:44 2026 -0300

    pmdomain: bcm: bcm2835-power: Fix broken reset status read
    
    commit 550bae2c0931dbb664a61b08c21cf156f0a5362a upstream.
    
    bcm2835_reset_status() has a misplaced parenthesis on every PM_READ()
    call. Since PM_READ(reg) expands to readl(power->base + (reg)), the
    expression:
    
        PM_READ(PM_GRAFX & PM_V3DRSTN)
    
    computes the bitwise AND of the register offset PM_GRAFX with the
    bitmask PM_V3DRSTN before using the result as a register offset, reading
    from the wrong MMIO address instead of the intended PM_GRAFX register.
    The same issue affects the PM_IMAGE cases.
    
    Fix by moving the closing parenthesis so PM_READ() receives only the
    register offset, and the bitmask is applied to the value returned by
    the read.
    
    Fixes: 670c672608a1 ("soc: bcm: bcm2835-pm: Add support for power domains under a new binding.")
    Signed-off-by: Maíra Canal <[email protected]>
    Reviewed-by: Florian Fainelli <[email protected]>
    Reviewed-by: Stefan Wahren <[email protected]>
    Cc: [email protected]
    Signed-off-by: Ulf Hansson <[email protected]>
    Signed-off-by: Greg Kroah-Hartman <[email protected]>

pmdomain: bcm: bcm2835-power: Increase ASB control timeout [+ + +]

Author: Maíra Canal <[email protected]>
Date:   Tue Mar 17 19:41:49 2026 -0300

    pmdomain: bcm: bcm2835-power: Increase ASB control timeout
    
    commit b826d2c0b0ecb844c84431ba6b502e744f5d919a upstream.
    
    The bcm2835_asb_control() function uses a tight polling loop to wait
    for the ASB bridge to acknowledge a request. During intensive workloads,
    this handshake intermittently fails for V3D's master ASB on BCM2711,
    resulting in "Failed to disable ASB master for v3d" errors during
    runtime PM suspend. As a consequence, the failed power-off leaves V3D in
    a broken state, leading to bus faults or system hangs on later accesses.
    
    As the timeout is insufficient in some scenarios, increase the polling
    timeout from 1us to 5us, which is still negligible in the context of a
    power domain transition. Also, replace the open-coded ktime_get_ns()/
    cpu_relax() polling loop with readl_poll_timeout_atomic().
    
    Cc: [email protected]
    Fixes: 670c672608a1 ("soc: bcm: bcm2835-pm: Add support for power domains under a new binding.")
    Signed-off-by: Maíra Canal <[email protected]>
    Reviewed-by: Stefan Wahren <[email protected]>
    Signed-off-by: Ulf Hansson <[email protected]>
    Signed-off-by: Greg Kroah-Hartman <[email protected]>

powerpc/crash: adjust the elfcorehdr size [+ + +]

Author: Sourabh Jain <[email protected]>
Date:   Fri Feb 27 22:48:01 2026 +0530

    powerpc/crash: adjust the elfcorehdr size
    
    [ Upstream commit 04e707cb77c272cb0bb2e2e3c5c7f844d804a089 ]
    
    With crash hotplug support enabled, additional memory is allocated to
    the elfcorehdr kexec segment to accommodate resources added during
    memory hotplug events. However, the kdump FDT is not updated with the
    same size, which can result in elfcorehdr corruption in the kdump
    kernel.
    
    Update elf_headers_sz (the kimage member representing the size of the
    elfcorehdr kexec segment) to reflect the total memory allocated for the
    elfcorehdr segment instead of the elfcorehdr buffer size at the time of
    kdump load. This allows of_kexec_alloc_and_setup_fdt() to reserve the
    full elfcorehdr memory in the kdump FDT and prevents elfcorehdr
    corruption.
    
    Fixes: 849599b702ef8 ("powerpc/crash: add crash memory hotplug support")
    Reviewed-by: Hari Bathini <[email protected]>
    Signed-off-by: Sourabh Jain <[email protected]>
    Signed-off-by: Madhavan Srinivasan <[email protected]>
    Link: https://patch.msgid.link/[email protected]
    Signed-off-by: Sasha Levin <[email protected]>

powerpc/kexec/core: use big-endian types for crash variables [+ + +]

Author: Sourabh Jain <[email protected]>
Date:   Wed Dec 24 20:42:57 2025 +0530

    powerpc/kexec/core: use big-endian types for crash variables
    
    [ Upstream commit 20197b967a6a29dab81495f25a988515bda84cfe ]
    
    Use explicit word-sized big-endian types for kexec and crash related
    variables. This makes the endianness unambiguous and avoids type
    mismatches that trigger sparse warnings.
    
    The change addresses sparse warnings like below (seen on both 32-bit
    and 64-bit builds):
    
    CHECK   ../arch/powerpc/kexec/core.c
    sparse:    expected unsigned int static [addressable] [toplevel] [usertype] crashk_base
    sparse:    got restricted __be32 [usertype]
    sparse: warning: incorrect type in assignment (different base types)
    sparse:    expected unsigned int static [addressable] [toplevel] [usertype] crashk_size
    sparse:    got restricted __be32 [usertype]
    sparse: warning: incorrect type in assignment (different base types)
    sparse:    expected unsigned long long static [addressable] [toplevel] mem_limit
    sparse:    got restricted __be32 [usertype]
    sparse: warning: incorrect type in assignment (different base types)
    sparse:    expected unsigned int static [addressable] [toplevel] [usertype] kernel_end
    sparse:    got restricted __be32 [usertype]
    
    No functional change intended.
    
    Fixes: ea961a828fe7 ("powerpc: Fix endian issues in kexec and crash dump code")
    Reported-by: kernel test robot <[email protected]>
    Closes: https://lore.kernel.org/oe-kbuild-all/[email protected]/
    Signed-off-by: Sourabh Jain <[email protected]>
    Tested-by: Venkat Rao Bagalkote <[email protected]>
    Signed-off-by: Madhavan Srinivasan <[email protected]>
    Link: https://patch.msgid.link/[email protected]
    Signed-off-by: Sasha Levin <[email protected]>

powerpc/uaccess: Fix inline assembly for clang build on PPC32 [+ + +]

Author: Christophe Leroy (CS GROUP) <[email protected]>
Date:   Tue Feb 3 08:30:41 2026 +0100

    powerpc/uaccess: Fix inline assembly for clang build on PPC32
    
    [ Upstream commit 0ee95a1d458630272d0415d0ffa9424fcb606c90 ]
    
    Test robot reports the following error with clang-16.0.6:
    
       In file included from kernel/rseq.c:75:
       include/linux/rseq_entry.h:141:3: error: invalid operand for instruction
                       unsafe_get_user(offset, &ucs->post_commit_offset, efault);
                       ^
       include/linux/uaccess.h:608:2: note: expanded from macro 'unsafe_get_user'
               arch_unsafe_get_user(x, ptr, local_label);      \
               ^
       arch/powerpc/include/asm/uaccess.h:518:2: note: expanded from macro 'arch_unsafe_get_user'
               __get_user_size_goto(__gu_val, __gu_addr, sizeof(*(p)), e); \
               ^
       arch/powerpc/include/asm/uaccess.h:284:2: note: expanded from macro '__get_user_size_goto'
               __get_user_size_allowed(x, ptr, size, __gus_retval);    \
               ^
       arch/powerpc/include/asm/uaccess.h:275:10: note: expanded from macro '__get_user_size_allowed'
               case 8: __get_user_asm2(x, (u64 __user *)ptr, retval);  break;  \
                       ^
       arch/powerpc/include/asm/uaccess.h:258:4: note: expanded from macro '__get_user_asm2'
                       "       li %1+1,0\n"                    \
                        ^
       <inline asm>:7:5: note: instantiated into assembly here
               li 31+1,0
                  ^
       1 error generated.
    
    On PPC32, for 64 bits vars a pair of registers is used. Usually the
    lower register in the pair is the high part and the higher register is
    the low part. GCC uses r3/r4 ... r11/r12 ... r14/r15 ... r30/r31
    
    In older kernel code inline assembly was using %1 and %1+1 to represent
    64 bits values. However here it looks like clang uses r31 as high part,
    allthough r32 doesn't exist hence the error.
    
    Allthoug %1+1 should work, most places now use %L1 instead of %1+1, so
    let's do the same here.
    
    With that change, the build doesn't fail anymore and a disassembly shows
    clang uses r17/r18 and r31/r14 pair when GCC would have used r16/r17 and
    r30/r31:
    
            Disassembly of section .fixup:
    
            00000000 <.fixup>:
               0:   38 a0 ff f2     li      r5,-14
               4:   3a 20 00 00     li      r17,0
               8:   3a 40 00 00     li      r18,0
               c:   48 00 00 00     b       c <.fixup+0xc>
                                    c: R_PPC_REL24  .text+0xbc
              10:   38 a0 ff f2     li      r5,-14
              14:   3b e0 00 00     li      r31,0
              18:   39 c0 00 00     li      r14,0
              1c:   48 00 00 00     b       1c <.fixup+0x1c>
                                    1c: R_PPC_REL24 .text+0x144
    
    Reported-by: kernel test robot <[email protected]>
    Closes: https://lore.kernel.org/oe-kbuild-all/[email protected]/
    Fixes: c20beffeec3c ("powerpc/uaccess: Use flexible addressing with __put_user()/__get_user()")
    Signed-off-by: Christophe Leroy (CS GROUP) <[email protected]>
    Acked-by: Nathan Chancellor <[email protected]>
    Signed-off-by: Madhavan Srinivasan <[email protected]>
    Link: https://patch.msgid.link/8ca3a657a650e497a96bfe7acde2f637dadab344.1770103646.git.chleroy@kernel.org
    Signed-off-by: Sasha Levin <[email protected]>

powerpc64/bpf: fix kfunc call support [+ + +]

Author: Hari Bathini <[email protected]>
Date:   Thu Mar 19 08:51:38 2026 -0400

    powerpc64/bpf: fix kfunc call support
    
    [ Upstream commit 01b6ac72729610ae732ca2a66e3a642e23f6cd60 ]
    
    Commit 61688a82e047 ("powerpc/bpf: enable kfunc call") inadvertently
    enabled kfunc call support for 32-bit powerpc but that support will
    not be possible until ABI mismatch between 32-bit powerpc and eBPF is
    handled in 32-bit powerpc JIT code. Till then, advertise support only
    for 64-bit powerpc. Also, in powerpc ABI, caller needs to extend the
    arguments properly based on signedness. The JIT code is responsible
    for handling this explicitly for kfunc calls as verifier can't handle
    this for each architecture-specific ABI needs. But this was not taken
    care of while kfunc call support was enabled for powerpc. Fix it by
    handling this with bpf_jit_find_kfunc_model() and using zero_extend()
    & sign_extend() helper functions.
    
    Fixes: 61688a82e047 ("powerpc/bpf: enable kfunc call")
    Cc: [email protected]
    Signed-off-by: Hari Bathini <[email protected]>
    Signed-off-by: Madhavan Srinivasan <[email protected]>
    Link: https://patch.msgid.link/[email protected]
    Signed-off-by: Sasha Levin <[email protected]>
    Signed-off-by: Greg Kroah-Hartman <[email protected]>

powerpc64/bpf: Fold bpf_jit_emit_func_call_hlp() into bpf_jit_emit_func_call_rel() [+ + +]

Author: Naveen N Rao <[email protected]>
Date:   Thu Mar 19 08:51:37 2026 -0400

    powerpc64/bpf: Fold bpf_jit_emit_func_call_hlp() into bpf_jit_emit_func_call_rel()
    
    [ Upstream commit 9670f6d2097c4f97e15c67920dfddc664d7ee91c ]
    
    Commit 61688a82e047 ("powerpc/bpf: enable kfunc call") enhanced
    bpf_jit_emit_func_call_hlp() to handle calls out to module region, where
    bpf progs are generated. The only difference now between
    bpf_jit_emit_func_call_hlp() and bpf_jit_emit_func_call_rel() is in
    handling of the initial pass where target function address is not known.
    Fold that logic into bpf_jit_emit_func_call_hlp() and rename it to
    bpf_jit_emit_func_call_rel() to simplify bpf function call JIT code.
    
    We don't actually need to load/restore TOC across a call out to a
    different kernel helper or to a different bpf program since they all
    work with the kernel TOC. We only need to do it if we have to call out
    to a module function. So, guard TOC load/restore with appropriate
    conditions.
    
    Signed-off-by: Naveen N Rao <[email protected]>
    Signed-off-by: Michael Ellerman <[email protected]>
    Link: https://patch.msgid.link/[email protected]
    Stable-dep-of: 01b6ac727296 ("powerpc64/bpf: fix kfunc call support")
    Signed-off-by: Sasha Levin <[email protected]>
    Signed-off-by: Greg Kroah-Hartman <[email protected]>

powerpc: 83xx: km83xx: Fix keymile vendor prefix [+ + +]

Author: J. Neuschäfer <[email protected]>
Date:   Tue Mar 3 16:31:42 2026 +0100

    powerpc: 83xx: km83xx: Fix keymile vendor prefix
    
    [ Upstream commit 691417ffe7821721e0a28bd25ad8c0dc0d4ae4ad ]
    
    When kmeter.c was refactored into km83xx.c in 2011, the "keymile" vendor
    prefix was changed to upper-case "Keymile". The devicetree at
    arch/powerpc/boot/dts/kmeter1.dts never underwent the same change,
    suggesting that this was simply a mistake.
    
    Fixes: 93e2b95c81042d ("powerpc/83xx: rename and update kmeter1")
    Signed-off-by: J. Neuschäfer <[email protected]>
    Reviewed-by: Heiko Schocher <[email protected]>
    Signed-off-by: Madhavan Srinivasan <[email protected]>
    Link: https://patch.msgid.link/[email protected]
    Signed-off-by: Sasha Levin <[email protected]>

qmi_wwan: allow max_mtu above hard_mtu to control rx_urb_size [+ + +]

Author: Laurent Vivier <[email protected]>
Date:   Wed Mar 4 14:43:38 2026 +0100

    qmi_wwan: allow max_mtu above hard_mtu to control rx_urb_size
    
    commit 55f854dd5bdd8e19b936a00ef1f8d776ac32c7b0 upstream.
    
    Commit c7159e960f14 ("usbnet: limit max_mtu based on device's hard_mtu")
    capped net->max_mtu to the device's hard_mtu in usbnet_probe(). While
    this correctly prevents oversized packets on standard USB network
    devices, it breaks the qmi_wwan driver.
    
    qmi_wwan relies on userspace (e.g. ModemManager) setting a large MTU on
    the wwan0 interface to configure rx_urb_size via usbnet_change_mtu().
    QMI modems negotiate USB transfer sizes of 16,383 or 32,767 bytes, and
    the USB receive buffers must be sized accordingly. With max_mtu capped
    to hard_mtu (~1500 bytes), userspace can no longer raise the MTU, the
    receive buffers remain small, and download speeds drop from >300 Mbps
    to ~0.8 Mbps.
    
    Introduce a FLAG_NOMAXMTU driver flag that allows individual usbnet
    drivers to opt out of the max_mtu cap. Set this flag in qmi_wwan's
    driver_info structures to restore the previous behavior for QMI devices,
    while keeping the safety fix in place for all other usbnet drivers.
    
    Fixes: c7159e960f14 ("usbnet: limit max_mtu based on device's hard_mtu")
    Cc: [email protected]
    Link: https://lore.kernel.org/lkml/CAPh3n803k8JcBPV5qEzUB-oKzWkAs-D5CU7z=Vd_nLRCr5ZqQg@mail.gmail.com/
    Reported-by: Koen Vandeputte <[email protected]>
    Tested-by: Daniele Palmas <[email protected]>
    Signed-off-by: Laurent Vivier <[email protected]>
    Link: https://patch.msgid.link/[email protected]
    Signed-off-by: Jakub Kicinski <[email protected]>
    Signed-off-by: Greg Kroah-Hartman <[email protected]>

regulator: pca9450: Correct interrupt type [+ + +]

Author: Peng Fan <[email protected]>
Date:   Tue Mar 10 12:25:52 2026 +0800

    regulator: pca9450: Correct interrupt type
    
    [ Upstream commit 5d0efaf47ee90ac60efae790acee3a3ed99ebf80 ]
    
    Kernel warning on i.MX8MP-EVK when doing module test:
    irq: type mismatch, failed to map hwirq-3 for gpio@30200000!
    
    Per PCA945[X] specification: The IRQ_B pin is pulled low when any unmasked
    interrupt bit status is changed and it is released high once application
    processor read INT1 register.
    
    So the interrupt should be configured as IRQF_TRIGGER_LOW, not
    IRQF_TRIGGER_FALLING.
    
    Fixes: 0935ff5f1f0a4 ("regulator: pca9450: add pca9450 pmic driver")
    Signed-off-by: Peng Fan <[email protected]>
    Link: https://patch.msgid.link/[email protected]
    Signed-off-by: Mark Brown <[email protected]>
    Signed-off-by: Sasha Levin <[email protected]>

remoteproc: mediatek: Unprepare SCP clock during system suspend [+ + +]

Author: Tzung-Bi Shih <[email protected]>
Date:   Fri Feb 6 03:30:33 2026 +0000

    remoteproc: mediatek: Unprepare SCP clock during system suspend
    
    [ Upstream commit 35c3f72a2d55dbf52f28f4ecae51c76be1acf545 ]
    
    Prior to commit d935187cfb27 ("remoteproc: mediatek: Break lock
    dependency to prepare_lock"), `scp->clk` was prepared and enabled only
    when it needs to communicate with the SCP.  The commit d935187cfb27
    moved the prepare operation to remoteproc's prepare(), keeping the clock
    prepared as long as the SCP is running.
    
    The power consumption due to the prolonged clock preparation can be
    negligible when the system is running, as SCP is designed to be a very
    power efficient processor.
    
    However, the clock remains prepared even when the system enters system
    suspend.  This prevents the underlying clock controller (and potentially
    the parent PLLs) from shutting down, which increases power consumption
    and may block the system from entering deep sleep states.
    
    Add suspend and resume callbacks.  Unprepare the clock in suspend() if
    it was active and re-prepare it in resume() to ensure the clock is
    properly disabled during system suspend, while maintaining the "always
    prepared" semantics while the system is active.  The driver doesn't
    implement .attach() callback, hence it only checks for RPROC_RUNNING.
    
    Fixes: d935187cfb27 ("remoteproc: mediatek: Break lock dependency to prepare_lock")
    Reviewed-by: AngeloGioacchino Del Regno <[email protected]>
    Signed-off-by: Tzung-Bi Shih <[email protected]>
    Link: https://lore.kernel.org/r/[email protected]
    Signed-off-by: Mathieu Poirier <[email protected]>
    Signed-off-by: Sasha Levin <[email protected]>

remoteproc: sysmon: Correct subsys_name_len type in QMI request [+ + +]

Author: Bjorn Andersson <[email protected]>
Date:   Fri Feb 20 15:11:48 2026 -0600

    remoteproc: sysmon: Correct subsys_name_len type in QMI request
    
    [ Upstream commit da994db94e60f9a9411108ddf4d1836147ad4c9c ]
    
    The QMI message encoder has up until recently read a single byte (as
    elem_size == 1), but with the introduction of big endian support it's
    become apparent that this field is expected to be a full u32 -
    regardless of the size of the length in the encoded message (which is
    what elem_size specifies).
    
    The result is that the encoder now reads past the length byte and
    rejects the unreasonably large length formed when including the
    following 3 bytes from the subsys_name array.
    
    Fix this by changing to the expected type.
    
    Fixes: 1fb82ee806d1 ("remoteproc: qcom: Introduce sysmon")
    Signed-off-by: Bjorn Andersson <[email protected]>
    Reviewed-by: Chris Lew <[email protected]>
    Link: https://lore.kernel.org/r/20260220-qmi-encode-invalid-length-v2-1-5674be35ab29@oss.qualcomm.com
    Signed-off-by: Bjorn Andersson <[email protected]>
    Signed-off-by: Sasha Levin <[email protected]>

Revert "arm64: dts: qcom: sdm845-oneplus: Mark l14a regulator as boot-on" [+ + +]

Author: Sasha Levin <[email protected]>
Date:   Sun Mar 15 03:16:41 2026 -0400

    Revert "arm64: dts: qcom: sdm845-oneplus: Mark l14a regulator as boot-on"
    
    This reverts commit a7037c3eb0130a6167138e69178895b22758d7f3.
    
    The backport applied regulator-boot-on to vreg_l12a_1p8 (ldo12) instead
    of vreg_l14a_1p88 (ldo14) due to identical surrounding context lines.
    
    Reported-by: Marco Mattiolo <[email protected]>
    Signed-off-by: Sasha Levin <[email protected]>

Revert "tcpm: allow looking for role_sw device in the main node" [+ + +]

Author: Xu Yang <[email protected]>
Date:   Mon Mar 9 15:43:12 2026 +0800

    Revert "tcpm: allow looking for role_sw device in the main node"
    
    commit 6b275bfaa16be3fb1689fa6794e445ecd127a1b4 upstream.
    
    This reverts commit 1366cd228b0c67b60a2c0c26ef37fe9f7cfedb7f.
    
    The fwnode_usb_role_switch_get() returns NULL only if no connection is
    found, returns ERR_PTR(-EPROBE_DEFER) if connection is found but deferred
    probe is needed, or a valid pointer of usb_role_switch.
    
    When switching from a NULL check to IS_ERR_OR_NULL(), usb_role_switch_get()
    returns NULL and overwrites the ERR_PTR(-EPROBE_DEFER) returned by
    fwnode_usb_role_switch_get(). This causes the deferred probe indication to
    be lost, preventing the USB role switch from ever being retrieved.
    
    Fixes: 1366cd228b0c ("tcpm: allow looking for role_sw device in the main node")
    Cc: stable <[email protected]>
    Signed-off-by: Xu Yang <[email protected]>
    Tested-by: Arnaud Ferraris <[email protected]>
    Reviewed-by: Heikki Krogerus <[email protected]>
    Link: https://patch.msgid.link/[email protected]
    Signed-off-by: Greg Kroah-Hartman <[email protected]>

ring-buffer: Fix to update per-subbuf entries of persistent ring buffer [+ + +]

Author: Masami Hiramatsu (Google) <[email protected]>
Date:   Thu Mar 19 18:12:19 2026 +0900

    ring-buffer: Fix to update per-subbuf entries of persistent ring buffer
    
    commit f35dbac6942171dc4ce9398d1d216a59224590a9 upstream.
    
    Since the validation loop in rb_meta_validate_events() updates the same
    cpu_buffer->head_page->entries, the other subbuf entries are not updated.
    Fix to use head_page to update the entries field, since it is the cursor
    in this loop.
    
    Cc: [email protected]
    Cc: Mathieu Desnoyers <[email protected]>
    Cc: Ian Rogers <[email protected]>
    Fixes: 5f3b6e839f3c ("ring-buffer: Validate boot range memory events")
    Link: https://patch.msgid.link/177391153882.193994.17158784065013676533.stgit@mhiramat.tok.corp.google.com
    Signed-off-by: Masami Hiramatsu (Google) <[email protected]>
    Signed-off-by: Steven Rostedt (Google) <[email protected]>
    Signed-off-by: Greg Kroah-Hartman <[email protected]>

rust: kbuild: allow `unused_features` [+ + +]

Author: Miguel Ojeda <[email protected]>
Date:   Thu Mar 12 12:10:14 2026 +0100

    rust: kbuild: allow `unused_features`
    
    commit 592c61f3bfceaa29f8275696bd67c3dfad7ef72e upstream.
    
    Starting with the upcoming Rust 1.96.0 (to be released 2026-05-28),
    `rustc` introduces the new lint `unused_features` [1], which warns [2]:
    
        warning: feature `used_with_arg` is declared but not used
         --> <crate attribute>:1:93
          |
        1 | #![feature(asm_const,asm_goto,arbitrary_self_types,lint_reasons,offset_of_nested,raw_ref_op,used_with_arg)]
          |                                                                                             ^^^^^^^^^^^^^
          |
          = note: `#[warn(unused_features)]` (part of `#[warn(unused)]`) on by default
    
    The original goal of using `-Zcrate-attr` automatically was that there
    is a consistent set of features enabled and managed globally for all
    Rust kernel code (modulo exceptions like the `rust/` crated).
    
    While we could require crates to enable features manually (even if we
    still keep the `-Zallow-features=` list, i.e. removing the `-Zcrate-attr`
    list), it is not really worth making all developers worry about it just
    for a new lint.
    
    The features are expected to eventually become stable anyway (most already
    did), and thus having to remove features in every file that may use them
    is not worth it either.
    
    Thus just allow the new lint globally.
    
    The lint actually existed for a long time, which is why `rustc` does
    not complain about an unknown lint in the stable versions we support,
    but it was "disabled" years ago [3], and now it was made to work again.
    
    For extra context, the new implementation of the lint has already been
    improved to avoid linting about features that became stable thanks to
    Benno's report and the ensuing discussion [4] [5], but while that helps,
    it is still the case that we may have features enabled that are not used
    for one reason or another in a particular crate.
    
    Cc: [email protected] # Needed in 6.12.y and later (Rust is pinned in older LTSs).
    Link: https://github.com/rust-lang/rust/pull/152164 [1]
    Link: https://github.com/Rust-for-Linux/pin-init/pull/114 [2]
    Link: https://github.com/rust-lang/rust/issues/44232 [3]
    Link: https://github.com/rust-lang/rust/issues/153523 [4]
    Link: https://github.com/rust-lang/rust/pull/153610 [5]
    Reviewed-by: Benno Lossin <[email protected]>
    Reviewed-by: Gary Guo <[email protected]>
    Link: https://patch.msgid.link/[email protected]
    Signed-off-by: Miguel Ojeda <[email protected]>
    Signed-off-by: Greg Kroah-Hartman <[email protected]>

rxrpc: Fix recvmsg() unconditional requeue [+ + +]

Author: David Howells <[email protected]>
Date:   Thu Feb 26 16:23:46 2026 +0800

    rxrpc: Fix recvmsg() unconditional requeue
    
    [ Upstream commit 2c28769a51deb6022d7fbd499987e237a01dd63a ]
    
    If rxrpc_recvmsg() fails because MSG_DONTWAIT was specified but the call at
    the front of the recvmsg queue already has its mutex locked, it requeues
    the call - whether or not the call is already queued.  The call may be on
    the queue because MSG_PEEK was also passed and so the call was not dequeued
    or because the I/O thread requeued it.
    
    The unconditional requeue may then corrupt the recvmsg queue, leading to
    things like UAFs or refcount underruns.
    
    Fix this by only requeuing the call if it isn't already on the queue - and
    moving it to the front if it is already queued.  If we don't queue it, we
    have to put the ref we obtained by dequeuing it.
    
    Also, MSG_PEEK doesn't dequeue the call so shouldn't call
    rxrpc_notify_socket() for the call if we didn't use up all the data on the
    queue, so fix that also.
    
    Fixes: 540b1c48c37a ("rxrpc: Fix deadlock between call creation and sendmsg/recvmsg")
    Reported-by: Faith <[email protected]>
    Reported-by: Pumpkin Chang <[email protected]>
    Signed-off-by: David Howells <[email protected]>
    Acked-by: Marc Dionne <[email protected]>
    cc: Nir Ohfeld <[email protected]>
    cc: Willy Tarreau <[email protected]>
    cc: Simon Horman <[email protected]>
    cc: [email protected]
    cc: [email protected]
    Link: https://patch.msgid.link/[email protected]
    Signed-off-by: Jakub Kicinski <[email protected]>
    [Use spin_unlock instead of spin_unlock_irq to maintain context consistency.]
    Signed-off-by: Robert Garcia <[email protected]>
    Signed-off-by: Greg Kroah-Hartman <[email protected]>

s390/dasd: Copy detected format information to secondary device [+ + +]

Author: Stefan Haberland <[email protected]>
Date:   Tue Mar 10 15:23:30 2026 +0100

    s390/dasd: Copy detected format information to secondary device
    
    commit 4c527c7e030672efd788d0806d7a68972a7ba3c1 upstream.
    
    During online processing for a DASD device an IO operation is started to
    determine the format of the device. CDL format contains specifically
    sized blocks at the beginning of the disk.
    
    For a PPRC secondary device no real IO operation is possible therefore
    this IO request can not be started and this step is skipped for online
    processing of secondary devices. This is generally fine since the
    secondary is a copy of the primary device.
    
    In case of an additional partition detection that is run after a swap
    operation the format information is needed to properly drive partition
    detection IO.
    
    Currently the information is not passed leading to IO errors during
    partition detection and a wrongly detected partition table which in turn
    might lead to data corruption on the disk with the wrong partition table.
    
    Fix by passing the format information from primary to secondary device.
    
    Fixes: 413862caad6f ("s390/dasd: add copy pair swap capability")
    Cc: [email protected] #6.1
    Reviewed-by: Jan Hoeppner <[email protected]>
    Acked-by: Eduard Shishkin <[email protected]>
    Signed-off-by: Stefan Haberland <[email protected]>
    Link: https://patch.msgid.link/[email protected]
    Signed-off-by: Jens Axboe <[email protected]>
    Signed-off-by: Greg Kroah-Hartman <[email protected]>

s390/dasd: Move quiesce state with pprc swap [+ + +]

Author: Stefan Haberland <[email protected]>
Date:   Tue Mar 10 15:23:29 2026 +0100

    s390/dasd: Move quiesce state with pprc swap
    
    commit 40e9cd4ae8ec43b107ed2bff422a8fa39dcf4e4b upstream.
    
    Quiesce and resume is a mechanism to suspend operations on DASD devices.
    In the context of a controlled copy pair swap operation, the quiesce
    operation is usually issued before the actual swap and a resume
    afterwards.
    
    During the swap operation, the underlying device is exchanged. Therefore,
    the quiesce flag must be moved to the secondary device to ensure a
    consistent quiesce state after the swap.
    
    The secondary device itself cannot be suspended separately because there
    is no separate block device representation for it.
    
    Fixes: 413862caad6f ("s390/dasd: add copy pair swap capability")
    Cc: [email protected] #6.1
    Reviewed-by: Jan Hoeppner <[email protected]>
    Signed-off-by: Stefan Haberland <[email protected]>
    Link: https://patch.msgid.link/[email protected]
    Signed-off-by: Jens Axboe <[email protected]>
    Signed-off-by: Greg Kroah-Hartman <[email protected]>

s390/pfault: Fix virtual vs physical address confusion [+ + +]

Author: Alexander Gordeev <[email protected]>
Date:   Tue Feb 24 07:41:07 2026 +0100

    s390/pfault: Fix virtual vs physical address confusion
    
    commit d879ac6756b662a085a743e76023c768c3241579 upstream.
    
    When Linux is running as guest, runs a user space process and the
    user space process accesses a page that the host has paged out,
    the guest gets a pfault interrupt and schedules a different process.
    Without this mechanism the host would have to suspend the whole
    virtual CPU until the page has been paged in.
    
    To setup the pfault interrupt the real address of parameter list
    should be passed to DIAGNOSE 0x258, but a virtual address is passed
    instead.
    
    That has a performance impact, since the pfault setup never succeeds,
    the interrupt is never delivered to a guest and the whole virtual CPU
    is suspended as result.
    
    Cc: [email protected]
    Fixes: c98d2ecae08f ("s390/mm: Uncouple physical vs virtual address spaces")
    Reported-by: Claudio Imbrenda <[email protected]>
    Reviewed-by: Heiko Carstens <[email protected]>
    Signed-off-by: Alexander Gordeev <[email protected]>
    Signed-off-by: Vasily Gorbik <[email protected]>
    Signed-off-by: Greg Kroah-Hartman <[email protected]>

s390/stackleak: Fix __stackleak_poison() inline assembly constraint [+ + +]

Author: Heiko Carstens <[email protected]>
Date:   Mon Mar 2 14:35:00 2026 +0100

    s390/stackleak: Fix __stackleak_poison() inline assembly constraint
    
    commit 674c5ff0f440a051ebf299d29a4c013133d81a65 upstream.
    
    The __stackleak_poison() inline assembly comes with a "count" operand where
    the "d" constraint is used. "count" is used with the exrl instruction and
    "d" means that the compiler may allocate any register from 0 to 15.
    
    If the compiler would allocate register 0 then the exrl instruction would
    not or the value of "count" into the executed instruction - resulting in a
    stackframe which is only partially poisoned.
    
    Use the correct "a" constraint, which excludes register 0 from register
    allocation.
    
    Fixes: 2a405f6bb3a5 ("s390/stackleak: provide fast __stackleak_poison() implementation")
    Cc: [email protected]
    Signed-off-by: Heiko Carstens <[email protected]>
    Reviewed-by: Vasily Gorbik <[email protected]>
    Link: https://lore.kernel.org/r/[email protected]
    Signed-off-by: Vasily Gorbik <[email protected]>
    Signed-off-by: Greg Kroah-Hartman <[email protected]>

s390/xor: Fix xor_xc_2() inline assembly constraints [+ + +]

Author: Heiko Carstens <[email protected]>
Date:   Mon Mar 2 14:34:58 2026 +0100

    s390/xor: Fix xor_xc_2() inline assembly constraints
    
    commit f775276edc0c505dc0f782773796c189f31a1123 upstream.
    
    The inline assembly constraints for xor_xc_2() are incorrect. "bytes",
    "p1", and "p2" are input operands, while all three of them are modified
    within the inline assembly. Given that the function consists only of this
    inline assembly it seems unlikely that this may cause any problems, however
    fix this in any case.
    
    Fixes: 2cfc5f9ce7f5 ("s390/xor: optimized xor routing using the XC instruction")
    Cc: [email protected]
    Signed-off-by: Heiko Carstens <[email protected]>
    Reviewed-by: Vasily Gorbik <[email protected]>
    Link: https://lore.kernel.org/r/[email protected]
    Signed-off-by: Vasily Gorbik <[email protected]>
    Signed-off-by: Greg Kroah-Hartman <[email protected]>

s390/zcrypt: Enable AUTOSEL_DOM for CCA serialnr sysfs attribute [+ + +]

Author: Harald Freudenberger <[email protected]>
Date:   Thu Mar 19 06:50:31 2026 -0400

    s390/zcrypt: Enable AUTOSEL_DOM for CCA serialnr sysfs attribute
    
    [ Upstream commit 598bbefa8032cc58b564a81d1ad68bd815c8dc0f ]
    
    The serialnr sysfs attribute for CCA cards when queried always
    used the default domain for sending the request down to the card.
    If for any reason exactly this default domain is disabled then
    the attribute code fails to retrieve the CCA info and the sysfs
    entry shows an empty string. Works as designed but the serial
    number is a card attribute and thus it does not matter which
    domain is used for the query. So if there are other domains on
    this card available, these could be used.
    
    So extend the code to use AUTOSEL_DOM for the domain value to
    address any online domain within the card for querying the cca
    info and thus show the serialnr as long as there is one domain
    usable regardless of the default domain setting.
    
    Fixes: 8f291ebf3270 ("s390/zcrypt: enable card/domain autoselect on ep11 cprbs")
    Suggested-by: Ingo Franzki <[email protected]>
    Signed-off-by: Harald Freudenberger <[email protected]>
    Reviewed-by: Ingo Franzki <[email protected]>
    Cc: [email protected]
    Signed-off-by: Vasily Gorbik <[email protected]>
    [ preserved zc->online as the fourth argument to cca_get_info() ]
    Signed-off-by: Sasha Levin <[email protected]>
    Signed-off-by: Greg Kroah-Hartman <[email protected]>

sched/fair: Fix zero_vruntime tracking [+ + +]

Author: Peter Zijlstra <[email protected]>
Date:   Mon Feb 9 15:28:16 2026 +0100

    sched/fair: Fix zero_vruntime tracking
    
    commit b3d99f43c72b56cf7a104a364e7fb34b0702828b upstream.
    
    It turns out that zero_vruntime tracking is broken when there is but a single
    task running. Current update paths are through __{en,de}queue_entity(), and
    when there is but a single task, pick_next_task() will always return that one
    task, and put_prev_set_next_task() will end up in neither function.
    
    This can cause entity_key() to grow indefinitely large and cause overflows,
    leading to much pain and suffering.
    
    Furtermore, doing update_zero_vruntime() from __{de,en}queue_entity(), which
    are called from {set_next,put_prev}_entity() has problems because:
    
     - set_next_entity() calls __dequeue_entity() before it does cfs_rq->curr = se.
       This means the avg_vruntime() will see the removal but not current, missing
       the entity for accounting.
    
     - put_prev_entity() calls __enqueue_entity() before it does cfs_rq->curr =
       NULL. This means the avg_vruntime() will see the addition *and* current,
       leading to double accounting.
    
    Both cases are incorrect/inconsistent.
    
    Noting that avg_vruntime is already called on each {en,de}queue, remove the
    explicit avg_vruntime() calls (which removes an extra 64bit division for each
    {en,de}queue) and have avg_vruntime() update zero_vruntime itself.
    
    Additionally, have the tick call avg_vruntime() -- discarding the result, but
    for the side-effect of updating zero_vruntime.
    
    While there, optimize avg_vruntime() by noting that the average of one value is
    rather trivial to compute.
    
    Test case:
      # taskset -c -p 1 $$
      # taskset -c 2 bash -c 'while :; do :; done&'
      # cat /sys/kernel/debug/sched/debug | awk '/^cpu#/ {P=0} /^cpu#2,/ {P=1} {if (P) print $0}' | grep -e zero_vruntime -e "^>"
    
    PRE:
        .zero_vruntime                 : 31316.407903
      >R            bash   487     50787.345112   E       50789.145972           2.800000     50780.298364        16     120         0.000000         0.000000         0.000000        /
        .zero_vruntime                 : 382548.253179
      >R            bash   487    427275.204288   E      427276.003584           2.800000    427268.157540        23     120         0.000000         0.000000         0.000000        /
    
    POST:
        .zero_vruntime                 : 17259.709467
      >R            bash   526     17259.709467   E       17262.509467           2.800000     16915.031624         9     120         0.000000         0.000000         0.000000        /
        .zero_vruntime                 : 18702.723356
      >R            bash   526     18702.723356   E       18705.523356           2.800000     18358.045513         9     120         0.000000         0.000000         0.000000        /
    
    Fixes: 79f3f9bedd14 ("sched/eevdf: Fix min_vruntime vs avg_vruntime")
    Reported-by: K Prateek Nayak <[email protected]>
    Signed-off-by: Peter Zijlstra (Intel) <[email protected]>
    Tested-by: K Prateek Nayak <[email protected]>
    Tested-by: Shubhang Kaushik <[email protected]>
    Link: https://patch.msgid.link/20260219080624.438854780%40infradead.org
    Tested-by: Eric Hagberg <[email protected]>
    Signed-off-by: Greg Kroah-Hartman <[email protected]>

sched: idle: Consolidate the handling of two special cases [+ + +]

Author: Rafael J. Wysocki <[email protected]>
Date:   Fri Mar 13 13:25:41 2026 +0100

    sched: idle: Consolidate the handling of two special cases
    
    [ Upstream commit f4c31b07b136839e0fb3026f8a5b6543e3b14d2f ]
    
    There are two special cases in the idle loop that are handled
    inconsistently even though they are analogous.
    
    The first one is when a cpuidle driver is absent and the default CPU
    idle time power management implemented by the architecture code is used.
    In that case, the scheduler tick is stopped every time before invoking
    default_idle_call().
    
    The second one is when a cpuidle driver is present, but there is only
    one idle state in its table.  In that case, the scheduler tick is never
    stopped at all.
    
    Since each of these approaches has its drawbacks, reconcile them with
    the help of one simple heuristic.  Namely, stop the tick if the CPU has
    been woken up by it in the previous iteration of the idle loop, or let
    it tick otherwise.
    
    Signed-off-by: Rafael J. Wysocki <[email protected]>
    Reviewed-by: Christian Loehle <[email protected]>
    Reviewed-by: Frederic Weisbecker <[email protected]>
    Reviewed-by: Qais Yousef <[email protected]>
    Reviewed-by: Aboorva Devarajan <[email protected]>
    Fixes: ed98c3491998 ("sched: idle: Do not stop the tick before cpuidle_idle_call()")
    [ rjw: Added Fixes tag, changelog edits ]
    Link: https://patch.msgid.link/[email protected]
    Signed-off-by: Rafael J. Wysocki <[email protected]>
    Signed-off-by: Sasha Levin <[email protected]>

sched: idle: Make skipping governor callbacks more consistent [+ + +]

Author: Rafael J. Wysocki <[email protected]>
Date:   Sat Mar 7 17:12:05 2026 +0100

    sched: idle: Make skipping governor callbacks more consistent
    
    [ Upstream commit d557640e4ce589a24dca5ca7ce3b9680f471325f ]
    
    If the cpuidle governor .select() callback is skipped because there
    is only one idle state in the cpuidle driver, the .reflect() callback
    should be skipped as well, at least for consistency (if not for
    correctness), so do it.
    
    Fixes: e5c9ffc6ae1b ("cpuidle: Skip governor when only one idle state is available")
    Signed-off-by: Rafael J. Wysocki <[email protected]>
    Reviewed-by: Christian Loehle <[email protected]>
    Reviewed-by: Aboorva Devarajan <[email protected]>
    Reviewed-by: Frederic Weisbecker <[email protected]>
    Link: https://patch.msgid.link/[email protected]
    Signed-off-by: Sasha Levin <[email protected]>

sched_ext: Disable preemption between scx_claim_exit() and kicking helper work [+ + +]

Author: Tejun Heo <[email protected]>
Date:   Tue Mar 17 19:49:26 2026 -0400

    sched_ext: Disable preemption between scx_claim_exit() and kicking helper work
    
    [ Upstream commit 83236b2e43dba00bee5b82eb5758816b1a674f6a ]
    
    scx_claim_exit() atomically sets exit_kind, which prevents scx_error() from
    triggering further error handling. After claiming exit, the caller must kick
    the helper kthread work which initiates bypass mode and teardown.
    
    If the calling task gets preempted between claiming exit and kicking the
    helper work, and the BPF scheduler fails to schedule it back (since error
    handling is now disabled), the helper work is never queued, bypass mode
    never activates, tasks stop being dispatched, and the system wedges.
    
    Disable preemption across scx_claim_exit() and the subsequent work kicking
    in all callers - scx_disable() and scx_vexit(). Add
    lockdep_assert_preemption_disabled() to scx_claim_exit() to enforce the
    requirement.
    
    Fixes: f0e1a0643a59 ("sched_ext: Implement BPF extensible scheduler class")
    Cc: [email protected] # v6.12+
    Reviewed-by: Andrea Righi <[email protected]>
    Signed-off-by: Tejun Heo <[email protected]>
    [ adapted per-scheduler struct (sch->exit_kind, scx_disable, scx_vexit) to global variables (scx_exit_kind, scx_ops_disable, scx_ops_exit_kind) ]
    Signed-off-by: Sasha Levin <[email protected]>
    Signed-off-by: Greg Kroah-Hartman <[email protected]>

sched_ext: Fix starvation of scx_enable() under fair-class saturation [+ + +]

Author: Tejun Heo <[email protected]>
Date:   Tue Mar 17 20:10:24 2026 -0400

    sched_ext: Fix starvation of scx_enable() under fair-class saturation
    
    [ Upstream commit b06ccbabe2506fd70b9167a644978b049150224a ]
    
    During scx_enable(), the READY -> ENABLED task switching loop changes the
    calling thread's sched_class from fair to ext. Since fair has higher
    priority than ext, saturating fair-class workloads can indefinitely starve
    the enable thread, hanging the system. This was introduced when the enable
    path switched from preempt_disable() to scx_bypass() which doesn't protect
    against fair-class starvation. Note that the original preempt_disable()
    protection wasn't complete either - in partial switch modes, the calling
    thread could still be starved after preempt_enable() as it may have been
    switched to ext class.
    
    Fix it by offloading the enable body to a dedicated system-wide RT
    (SCHED_FIFO) kthread which cannot be starved by either fair or ext class
    tasks. scx_enable() lazily creates the kthread on first use and passes the
    ops pointer through a struct scx_enable_cmd containing the kthread_work,
    then synchronously waits for completion.
    
    The workfn runs on a different kthread from sch->helper (which runs
    disable_work), so it can safely flush disable_work on the error path
    without deadlock.
    
    Fixes: 8c2090c504e9 ("sched_ext: Initialize in bypass mode")
    Cc: [email protected] # v6.12+
    Signed-off-by: Tejun Heo <[email protected]>
    [ adapted per-scheduler scx_sched struct references to globals ]
    Signed-off-by: Sasha Levin <[email protected]>
    Signed-off-by: Greg Kroah-Hartman <[email protected]>

sched_ext: Remove redundant css_put() in scx_cgroup_init() [+ + +]

Author: Cheng-Yang Chou <[email protected]>
Date:   Tue Mar 3 22:35:30 2026 +0800

    sched_ext: Remove redundant css_put() in scx_cgroup_init()
    
    commit 1336b579f6079fb8520be03624fcd9ba443c930b upstream.
    
    The iterator css_for_each_descendant_pre() walks the cgroup hierarchy
    under cgroup_lock(). It does not increment the reference counts on
    yielded css structs.
    
    According to the cgroup documentation, css_put() should only be used
    to release a reference obtained via css_get() or css_tryget_online().
    Since the iterator does not use either of these to acquire a reference,
    calling css_put() in the error path of scx_cgroup_init() causes a
    refcount underflow.
    
    Remove the unbalanced css_put() to prevent a potential Use-After-Free
    (UAF) vulnerability.
    
    Fixes: 819513666966 ("sched_ext: Add cgroup support")
    Cc: [email protected] # v6.12+
    Signed-off-by: Cheng-Yang Chou <[email protected]>
    Reviewed-by: Andrea Righi <[email protected]>
    Signed-off-by: Tejun Heo <[email protected]>
    Signed-off-by: Greg Kroah-Hartman <[email protected]>

scsi: core: Fix error handling for scsi_alloc_sdev() [+ + +]

Author: Junxiao Bi <[email protected]>
Date:   Wed Mar 4 08:46:03 2026 -0800

    scsi: core: Fix error handling for scsi_alloc_sdev()
    
    commit 4ce7ada40c008fa21b7e52ab9d04e8746e2e9325 upstream.
    
    After scsi_sysfs_device_initialize() was called, error paths must call
    __scsi_remove_device().
    
    Fixes: 1ac22c8eae81 ("scsi: core: Fix refcount leak for tagset_refcnt")
    Cc: [email protected]
    Signed-off-by: Junxiao Bi <[email protected]>
    Reviewed-by: John Garry <[email protected]>
    Reviewed-by: Bart Van Assche <[email protected]>
    Link: https://patch.msgid.link/[email protected]
    Signed-off-by: Martin K. Petersen <[email protected]>
    Signed-off-by: Greg Kroah-Hartman <[email protected]>

scsi: hisi_sas: Add time interval between two H2D FIS following soft reset spec [+ + +]

Author: Xingui Yang <[email protected]>
Date:   Tue Oct 8 10:18:19 2024 +0800

    scsi: hisi_sas: Add time interval between two H2D FIS following soft reset spec
    
    [ Upstream commit 3c62791322e42d1afd65acfdb5b3a371bde21ede ]
    
    Spec says at least 5us between two H2D FIS when do soft reset, but be
    generous and sleep for about 1ms.
    
    Signed-off-by: Xingui Yang <[email protected]>
    Link: https://lore.kernel.org/r/[email protected]
    Reviewed-by: Yihang Li <[email protected]>
    Signed-off-by: Martin K. Petersen <[email protected]>
    Stable-dep-of: 8ddc0c269165 ("scsi: hisi_sas: Fix NULL pointer exception during user_scan()")
    Signed-off-by: Sasha Levin <[email protected]>

scsi: hisi_sas: Fix NULL pointer exception during user_scan() [+ + +]

Author: Xingui Yang <[email protected]>
Date:   Thu Mar 5 14:40:39 2026 +0800

    scsi: hisi_sas: Fix NULL pointer exception during user_scan()
    
    [ Upstream commit 8ddc0c26916574395447ebf4cff684314f6873a9 ]
    
    user_scan() invokes updated sas_user_scan() for channel 0, and if
    successful, iteratively scans remaining channels (1 to shost->max_channel)
    via scsi_scan_host_selected() in commit 37c4e72b0651 ("scsi: Fix
    sas_user_scan() to handle wildcard and multi-channel scans"). However,
    hisi_sas supports only one channel, and the current value of max_channel is
    1. sas_user_scan() for channel 1 will trigger the following NULL pointer
    exception:
    
    [  441.554662] Unable to handle kernel NULL pointer dereference at virtual address 00000000000008b0
    [  441.554699] Mem abort info:
    [  441.554710]   ESR = 0x0000000096000004
    [  441.554718]   EC = 0x25: DABT (current EL), IL = 32 bits
    [  441.554723]   SET = 0, FnV = 0
    [  441.554726]   EA = 0, S1PTW = 0
    [  441.554730]   FSC = 0x04: level 0 translation fault
    [  441.554735] Data abort info:
    [  441.554737]   ISV = 0, ISS = 0x00000004, ISS2 = 0x00000000
    [  441.554742]   CM = 0, WnR = 0, TnD = 0, TagAccess = 0
    [  441.554747]   GCS = 0, Overlay = 0, DirtyBit = 0, Xs = 0
    [  441.554752] user pgtable: 4k pages, 48-bit VAs, pgdp=00000828377a6000
    [  441.554757] [00000000000008b0] pgd=0000000000000000, p4d=0000000000000000
    [  441.554769] Internal error: Oops: 0000000096000004 [#1]  SMP
    [  441.629589] Modules linked in: arm_spe_pmu arm_smmuv3_pmu tpm_tis_spi hisi_uncore_sllc_pmu hisi_uncore_pa_pmu hisi_uncore_l3c_pmu hisi_uncore_hha_pmu hisi_uncore_ddrc_pmu hisi_uncore_cpa_pmu hns3_pmu hisi_ptt hisi_pcie_pmu tpm_tis_core spidev spi_hisi_sfc_v3xx hisi_uncore_pmu spi_dw_mmio fuse hclge hclge_common hisi_sec2 hisi_hpre hisi_zip hisi_qm hns3 hisi_sas_v3_hw sm3_ce sbsa_gwdt hnae3 hisi_sas_main uacce hisi_dma i2c_hisi dm_mirror dm_region_hash dm_log dm_mod
    [  441.670819] CPU: 46 UID: 0 PID: 6994 Comm: bash Kdump: loaded Not tainted 7.0.0-rc2+ #84 PREEMPT
    [  441.691327] pstate: 81400009 (Nzcv daif +PAN -UAO -TCO +DIT -SSBS BTYPE=--)
    [  441.698277] pc : sas_find_dev_by_rphy+0x44/0x118
    [  441.702896] lr : sas_find_dev_by_rphy+0x3c/0x118
    [  441.707502] sp : ffff80009abbba40
    [  441.710805] x29: ffff80009abbba40 x28: ffff082819a40008 x27: ffff082810c37c08
    [  441.717930] x26: ffff082810c37c28 x25: ffff082819a40290 x24: ffff082810c37c00
    [  441.725054] x23: 0000000000000000 x22: 0000000000000001 x21: ffff082819a40000
    [  441.732179] x20: ffff082819a40290 x19: 0000000000000000 x18: 0000000000000020
    [  441.739304] x17: 0000000000000000 x16: ffffb5dad6bda690 x15: 00000000ffffffff
    [  441.746428] x14: ffff082814c3b26c x13: 00000000ffffffff x12: ffff082814c3b26a
    [  441.753553] x11: 00000000000000c0 x10: 000000000000003a x9 : ffffb5dad5ea94f4
    [  441.760678] x8 : 000000000000003a x7 : ffff80009abbbab0 x6 : 0000000000000030
    [  441.767802] x5 : 0000000000000000 x4 : 0000000000000000 x3 : 0000000000000000
    [  441.774926] x2 : ffff08280f35a300 x1 : ffffb5dad7127180 x0 : 0000000000000000
    [  441.782053] Call trace:
    [  441.784488]  sas_find_dev_by_rphy+0x44/0x118 (P)
    [  441.789095]  sas_target_alloc+0x24/0xb0
    [  441.792920]  scsi_alloc_target+0x290/0x330
    [  441.797010]  __scsi_scan_target+0x88/0x258
    [  441.801096]  scsi_scan_channel+0x74/0xb8
    [  441.805008]  scsi_scan_host_selected+0x170/0x188
    [  441.809615]  sas_user_scan+0xfc/0x148
    [  441.813267]  store_scan+0x10c/0x180
    [  441.816743]  dev_attr_store+0x20/0x40
    [  441.820398]  sysfs_kf_write+0x84/0xa8
    [  441.824054]  kernfs_fop_write_iter+0x130/0x1c8
    [  441.828487]  vfs_write+0x2c0/0x370
    [  441.831880]  ksys_write+0x74/0x118
    [  441.835271]  __arm64_sys_write+0x24/0x38
    [  441.839182]  invoke_syscall+0x50/0x120
    [  441.842919]  el0_svc_common.constprop.0+0xc8/0xf0
    [  441.847611]  do_el0_svc+0x24/0x38
    [  441.850913]  el0_svc+0x38/0x158
    [  441.854043]  el0t_64_sync_handler+0xa0/0xe8
    [  441.858214]  el0t_64_sync+0x1ac/0x1b0
    [  441.861865] Code: aa1303e0 97ff70a8 34ffff80 d10a4273 (f9445a75)
    [  441.867946] ---[ end trace 0000000000000000 ]---
    
    Therefore, set max_channel to 0.
    
    Fixes: e21fe3a52692 ("scsi: hisi_sas: add initialisation for v3 pci-based controller")
    Signed-off-by: Xingui Yang <[email protected]>
    Signed-off-by: Yihang Li <[email protected]>
    Link: https://patch.msgid.link/[email protected]
    Signed-off-by: Martin K. Petersen <[email protected]>
    Signed-off-by: Sasha Levin <[email protected]>

scsi: hisi_sas: Use macro instead of magic number [+ + +]

Author: Yihang Li <[email protected]>
Date:   Mon Apr 14 16:08:42 2025 +0800

    scsi: hisi_sas: Use macro instead of magic number
    
    [ Upstream commit 4ca7fe99fc8485fcd04b367f37dc7a48f1355419 ]
    
    The hisi_sas driver has a large number of magic numbers which makes for
    unfriendly code reading. Use macro definitions instead.
    
    Signed-off-by: Yihang Li <[email protected]>
    Link: https://lore.kernel.org/r/[email protected]
    Signed-off-by: Martin K. Petersen <[email protected]>
    Stable-dep-of: 8ddc0c269165 ("scsi: hisi_sas: Fix NULL pointer exception during user_scan()")
    Signed-off-by: Sasha Levin <[email protected]>

scsi: mpi3mr: Add NULL checks when resetting request and reply queues [+ + +]

Author: Ranjan Kumar <[email protected]>
Date:   Thu Feb 12 12:30:26 2026 +0530

    scsi: mpi3mr: Add NULL checks when resetting request and reply queues
    
    [ Upstream commit fa96392ebebc8fade2b878acb14cce0f71016503 ]
    
    The driver encountered a crash during resource cleanup when the reply and
    request queues were NULL due to freed memory.  This issue occurred when the
    creation of reply or request queues failed, and the driver freed the memory
    first, but attempted to mem set the content of the freed memory, leading to
    a system crash.
    
    Add NULL pointer checks for reply and request queues before accessing the
    reply/request memory during cleanup
    
    Signed-off-by: Ranjan Kumar <[email protected]>
    Link: https://patch.msgid.link/[email protected]
    Signed-off-by: Martin K. Petersen <[email protected]>
    Signed-off-by: Sasha Levin <[email protected]>

scsi: ses: Fix devices attaching to different hosts [+ + +]

Author: Tomas Henzl <[email protected]>
Date:   Tue Feb 10 20:18:50 2026 +0100

    scsi: ses: Fix devices attaching to different hosts
    
    [ Upstream commit 70ca8caa96ce473647054f5c7b9dab5423902402 ]
    
    On a multipath SAS system some devices don't end up with correct symlinks
    from the SCSI device to its enclosure. Some devices even have enclosure
    links pointing to enclosures attached to different SCSI hosts.
    
    ses_match_to_enclosure() calls enclosure_for_each_device() which iterates
    over all enclosures on the system, not just enclosures attached to the
    current SCSI host.
    
    Replace the iteration with a direct call to ses_enclosure_find_by_addr().
    
    Reviewed-by: David Jeffery <[email protected]>
    Signed-off-by: Tomas Henzl <[email protected]>
    Link: https://patch.msgid.link/[email protected]
    Signed-off-by: Martin K. Petersen <[email protected]>
    Signed-off-by: Sasha Levin <[email protected]>

scsi: storvsc: Fix scheduling while atomic on PREEMPT_RT [+ + +]

Author: Jan Kiszka <[email protected]>
Date:   Thu Jan 29 15:30:39 2026 +0100

    scsi: storvsc: Fix scheduling while atomic on PREEMPT_RT
    
    [ Upstream commit 57297736c08233987e5d29ce6584c6ca2a831b12 ]
    
    This resolves the follow splat and lock-up when running with PREEMPT_RT
    enabled on Hyper-V:
    
    [  415.140818] BUG: scheduling while atomic: stress-ng-iomix/1048/0x00000002
    [  415.140822] INFO: lockdep is turned off.
    [  415.140823] Modules linked in: intel_rapl_msr intel_rapl_common intel_uncore_frequency_common intel_pmc_core pmt_telemetry pmt_discovery pmt_class intel_pmc_ssram_telemetry intel_vsec ghash_clmulni_intel aesni_intel rapl binfmt_misc nls_ascii nls_cp437 vfat fat snd_pcm hyperv_drm snd_timer drm_client_lib drm_shmem_helper snd sg soundcore drm_kms_helper pcspkr hv_balloon hv_utils evdev joydev drm configfs efi_pstore nfnetlink vsock_loopback vmw_vsock_virtio_transport_common hv_sock vmw_vsock_vmci_transport vsock vmw_vmci efivarfs autofs4 ext4 crc16 mbcache jbd2 sr_mod sd_mod cdrom hv_storvsc serio_raw hid_generic scsi_transport_fc hid_hyperv scsi_mod hid hv_netvsc hyperv_keyboard scsi_common
    [  415.140846] Preemption disabled at:
    [  415.140847] [<ffffffffc0656171>] storvsc_queuecommand+0x2e1/0xbe0 [hv_storvsc]
    [  415.140854] CPU: 8 UID: 0 PID: 1048 Comm: stress-ng-iomix Not tainted 6.19.0-rc7 #30 PREEMPT_{RT,(full)}
    [  415.140856] Hardware name: Microsoft Corporation Virtual Machine/Virtual Machine, BIOS Hyper-V UEFI Release v4.1 09/04/2024
    [  415.140857] Call Trace:
    [  415.140861]  <TASK>
    [  415.140861]  ? storvsc_queuecommand+0x2e1/0xbe0 [hv_storvsc]
    [  415.140863]  dump_stack_lvl+0x91/0xb0
    [  415.140870]  __schedule_bug+0x9c/0xc0
    [  415.140875]  __schedule+0xdf6/0x1300
    [  415.140877]  ? rtlock_slowlock_locked+0x56c/0x1980
    [  415.140879]  ? rcu_is_watching+0x12/0x60
    [  415.140883]  schedule_rtlock+0x21/0x40
    [  415.140885]  rtlock_slowlock_locked+0x502/0x1980
    [  415.140891]  rt_spin_lock+0x89/0x1e0
    [  415.140893]  hv_ringbuffer_write+0x87/0x2a0
    [  415.140899]  vmbus_sendpacket_mpb_desc+0xb6/0xe0
    [  415.140900]  ? rcu_is_watching+0x12/0x60
    [  415.140902]  storvsc_queuecommand+0x669/0xbe0 [hv_storvsc]
    [  415.140904]  ? HARDIRQ_verbose+0x10/0x10
    [  415.140908]  ? __rq_qos_issue+0x28/0x40
    [  415.140911]  scsi_queue_rq+0x760/0xd80 [scsi_mod]
    [  415.140926]  __blk_mq_issue_directly+0x4a/0xc0
    [  415.140928]  blk_mq_issue_direct+0x87/0x2b0
    [  415.140931]  blk_mq_dispatch_queue_requests+0x120/0x440
    [  415.140933]  blk_mq_flush_plug_list+0x7a/0x1a0
    [  415.140935]  __blk_flush_plug+0xf4/0x150
    [  415.140940]  __submit_bio+0x2b2/0x5c0
    [  415.140944]  ? submit_bio_noacct_nocheck+0x272/0x360
    [  415.140946]  submit_bio_noacct_nocheck+0x272/0x360
    [  415.140951]  ext4_read_bh_lock+0x3e/0x60 [ext4]
    [  415.140995]  ext4_block_write_begin+0x396/0x650 [ext4]
    [  415.141018]  ? __pfx_ext4_da_get_block_prep+0x10/0x10 [ext4]
    [  415.141038]  ext4_da_write_begin+0x1c4/0x350 [ext4]
    [  415.141060]  generic_perform_write+0x14e/0x2c0
    [  415.141065]  ext4_buffered_write_iter+0x6b/0x120 [ext4]
    [  415.141083]  vfs_write+0x2ca/0x570
    [  415.141087]  ksys_write+0x76/0xf0
    [  415.141089]  do_syscall_64+0x99/0x1490
    [  415.141093]  ? rcu_is_watching+0x12/0x60
    [  415.141095]  ? finish_task_switch.isra.0+0xdf/0x3d0
    [  415.141097]  ? rcu_is_watching+0x12/0x60
    [  415.141098]  ? lock_release+0x1f0/0x2a0
    [  415.141100]  ? rcu_is_watching+0x12/0x60
    [  415.141101]  ? finish_task_switch.isra.0+0xe4/0x3d0
    [  415.141103]  ? rcu_is_watching+0x12/0x60
    [  415.141104]  ? __schedule+0xb34/0x1300
    [  415.141106]  ? hrtimer_try_to_cancel+0x1d/0x170
    [  415.141109]  ? do_nanosleep+0x8b/0x160
    [  415.141111]  ? hrtimer_nanosleep+0x89/0x100
    [  415.141114]  ? __pfx_hrtimer_wakeup+0x10/0x10
    [  415.141116]  ? xfd_validate_state+0x26/0x90
    [  415.141118]  ? rcu_is_watching+0x12/0x60
    [  415.141120]  ? do_syscall_64+0x1e0/0x1490
    [  415.141121]  ? do_syscall_64+0x1e0/0x1490
    [  415.141123]  ? rcu_is_watching+0x12/0x60
    [  415.141124]  ? do_syscall_64+0x1e0/0x1490
    [  415.141125]  ? do_syscall_64+0x1e0/0x1490
    [  415.141127]  ? irqentry_exit+0x140/0x7e0
    [  415.141129]  entry_SYSCALL_64_after_hwframe+0x76/0x7e
    
    get_cpu() disables preemption while the spinlock hv_ringbuffer_write is
    using is converted to an rt-mutex under PREEMPT_RT.
    
    Signed-off-by: Jan Kiszka <[email protected]>
    Tested-by: Florian Bezdeka <[email protected]>
    Reviewed-by: Michael Kelley <[email protected]>
    Tested-by: Michael Kelley <[email protected]>
    Link: https://patch.msgid.link/[email protected]
    Signed-off-by: Martin K. Petersen <[email protected]>
    Signed-off-by: Sasha Levin <[email protected]>

scsi: ufs: core: Fix possible NULL pointer dereference in ufshcd_add_command_trace() [+ + +]

Author: Peter Wang <[email protected]>
Date:   Mon Feb 23 14:56:09 2026 +0800

    scsi: ufs: core: Fix possible NULL pointer dereference in ufshcd_add_command_trace()
    
    [ Upstream commit 30df81f2228d65bddf492db3929d9fcaffd38fc5 ]
    
    The kernel log indicates a crash in ufshcd_add_command_trace, due to a NULL
    pointer dereference when accessing hwq->id.  This can happen if
    ufshcd_mcq_req_to_hwq() returns NULL.
    
    This patch adds a NULL check for hwq before accessing its id field to
    prevent a kernel crash.
    
    Kernel log excerpt:
    [<ffffffd5d192dc4c>] notify_die+0x4c/0x8c
    [<ffffffd5d1814e58>] __die+0x60/0xb0
    [<ffffffd5d1814d64>] die+0x4c/0xe0
    [<ffffffd5d181575c>] die_kernel_fault+0x74/0x88
    [<ffffffd5d1864db4>] __do_kernel_fault+0x314/0x318
    [<ffffffd5d2a3cdf8>] do_page_fault+0xa4/0x5f8
    [<ffffffd5d2a3cd34>] do_translation_fault+0x34/0x54
    [<ffffffd5d1864524>] do_mem_abort+0x50/0xa8
    [<ffffffd5d2a297dc>] el1_abort+0x3c/0x64
    [<ffffffd5d2a29718>] el1h_64_sync_handler+0x44/0xcc
    [<ffffffd5d181133c>] el1h_64_sync+0x80/0x88
    [<ffffffd5d255c1dc>] ufshcd_add_command_trace+0x23c/0x320
    [<ffffffd5d255bad8>] ufshcd_compl_one_cqe+0xa4/0x404
    [<ffffffd5d2572968>] ufshcd_mcq_poll_cqe_lock+0xac/0x104
    [<ffffffd5d11c7460>] ufs_mtk_mcq_intr+0x54/0x74 [ufs_mediatek_mod]
    [<ffffffd5d19ab92c>] __handle_irq_event_percpu+0xc8/0x348
    [<ffffffd5d19abca8>] handle_irq_event+0x3c/0xa8
    [<ffffffd5d19b1f0c>] handle_fasteoi_irq+0xf8/0x294
    [<ffffffd5d19aa778>] generic_handle_domain_irq+0x54/0x80
    [<ffffffd5d18102bc>] gic_handle_irq+0x1d4/0x330
    [<ffffffd5d1838210>] call_on_irq_stack+0x44/0x68
    [<ffffffd5d183af30>] do_interrupt_handler+0x78/0xd8
    [<ffffffd5d2a29c00>] el1_interrupt+0x48/0xa8
    [<ffffffd5d2a29ba8>] el1h_64_irq_handler+0x14/0x24
    [<ffffffd5d18113c4>] el1h_64_irq+0x80/0x88
    [<ffffffd5d2527fb4>] arch_local_irq_enable+0x4/0x1c
    [<ffffffd5d25282e4>] cpuidle_enter+0x34/0x54
    [<ffffffd5d195a678>] do_idle+0x1dc/0x2f8
    [<ffffffd5d195a7c4>] cpu_startup_entry+0x30/0x3c
    [<ffffffd5d18155c4>] secondary_start_kernel+0x134/0x1ac
    [<ffffffd5d18640bc>] __secondary_switched+0xc4/0xcc
    
    Signed-off-by: Peter Wang <[email protected]>
    Reviewed-by: Bart Van Assche <[email protected]>
    Link: https://patch.msgid.link/[email protected]
    Signed-off-by: Martin K. Petersen <[email protected]>
    Signed-off-by: Sasha Levin <[email protected]>

scsi: ufs: core: Fix SError in ufshcd_rtc_work() during UFS suspend [+ + +]

Author: Wang Shuaiwei <[email protected]>
Date:   Sat Mar 7 11:51:28 2026 +0800

    scsi: ufs: core: Fix SError in ufshcd_rtc_work() during UFS suspend
    
    [ Upstream commit b0bd84c39289ef6a6c3827dd52c875659291970a ]
    
    In __ufshcd_wl_suspend(), cancel_delayed_work_sync() is called to cancel
    the UFS RTC work, but it is placed after ufshcd_vops_suspend(hba, pm_op,
    POST_CHANGE). This creates a race condition where ufshcd_rtc_work() can
    still be running while ufshcd_vops_suspend() is executing. When
    UFSHCD_CAP_CLK_GATING is not supported, the condition
    !hba->clk_gating.active_reqs is always true, causing ufshcd_update_rtc()
    to be executed. Since ufshcd_vops_suspend() typically performs clock
    gating operations, executing ufshcd_update_rtc() at that moment triggers
    an SError. The kernel panic trace is as follows:
    
    Kernel panic - not syncing: Asynchronous SError Interrupt
    Call trace:
     dump_backtrace+0xec/0x128
     show_stack+0x18/0x28
     dump_stack_lvl+0x40/0xa0
     dump_stack+0x18/0x24
     panic+0x148/0x374
     nmi_panic+0x3c/0x8c
     arm64_serror_panic+0x64/0x8c
     do_serror+0xc4/0xc8
     el1h_64_error_handler+0x34/0x4c
     el1h_64_error+0x68/0x6c
     el1_interrupt+0x20/0x58
     el1h_64_irq_handler+0x18/0x24
     el1h_64_irq+0x68/0x6c
     ktime_get+0xc4/0x12c
     ufshcd_mcq_sq_stop+0x4c/0xec
     ufshcd_mcq_sq_cleanup+0x64/0x1dc
     ufshcd_clear_cmd+0x38/0x134
     ufshcd_issue_dev_cmd+0x298/0x4d0
     ufshcd_exec_dev_cmd+0x1a4/0x1c4
     ufshcd_query_attr+0xbc/0x19c
     ufshcd_rtc_work+0x10c/0x1c8
     process_scheduled_works+0x1c4/0x45c
     worker_thread+0x32c/0x3e8
     kthread+0x120/0x1d8
     ret_from_fork+0x10/0x20
    
    Fix this by moving cancel_delayed_work_sync() before the call to
    ufshcd_vops_suspend(hba, pm_op, PRE_CHANGE), ensuring the UFS RTC work is
    fully completed or cancelled at that point.
    
    Cc: Bean Huo <[email protected]>
    Fixes: 6bf999e0eb41 ("scsi: ufs: core: Add UFS RTC support")
    Reviewed-by: Bart Van Assche <[email protected]>
    Signed-off-by: Wang Shuaiwei <[email protected]>
    Link: https://patch.msgid.link/[email protected]
    Signed-off-by: Martin K. Petersen <[email protected]>
    Signed-off-by: Sasha Levin <[email protected]>

scsi: ufs: core: Fix shift out of bounds when MAXQ=32 [+ + +]

Author: wangshuaiwei <[email protected]>
Date:   Tue Feb 24 14:32:28 2026 +0800

    scsi: ufs: core: Fix shift out of bounds when MAXQ=32
    
    [ Upstream commit 2f38fd99c0004676d835ae96ac4f3b54edc02c82 ]
    
    According to JESD223F, the maximum number of queues (MAXQ) is 32. When MCQ
    is enabled and ESI is disabled, nr_hw_queues=32 causes a shift overflow
    problem.
    
    Fix this by using 64-bit intermediate values to handle the nr_hw_queues=32
    case safely.
    
    Signed-off-by: wangshuaiwei <[email protected]>
    Reviewed-by: Bart Van Assche <[email protected]>
    Link: https://patch.msgid.link/[email protected]
    Signed-off-by: Martin K. Petersen <[email protected]>
    Signed-off-by: Sasha Levin <[email protected]>

selftests/hid: fix compilation when bpf_wq and hid_device are not exported [+ + +]

Author: Benjamin Tissoires <[email protected]>
Date:   Fri Mar 13 08:40:24 2026 +0100

    selftests/hid: fix compilation when bpf_wq and hid_device are not exported
    
    commit 5d4c6c132ea9a967d48890dd03e6a786c060e968 upstream.
    
    This can happen in situations when CONFIG_HID_SUPPORT is set to no, or
    some complex situations where struct bpf_wq is not exported.
    
    So do the usual dance of hiding them before including vmlinux.h, and
    then redefining them and make use of CO-RE to have the correct offsets.
    
    Reported-by: kernel test robot <[email protected]>
    Closes: https://lore.kernel.org/oe-kbuild-all/[email protected]/
    Fixes: fe8d561db3e8 ("selftests/hid: add wq test for hid_bpf_input_report()")
    Cc: [email protected]
    Acked-by: Jiri Kosina <[email protected]>
    Reviewed-by: Thomas Weißschuh <[email protected]>
    Signed-off-by: Benjamin Tissoires <[email protected]>
    Signed-off-by: Greg Kroah-Hartman <[email protected]>

selftests: mptcp: add a check for 'add_addr_accepted' [+ + +]

Author: Gang Yan <[email protected]>
Date:   Mon Mar 9 12:15:52 2026 -0400

    selftests: mptcp: add a check for 'add_addr_accepted'
    
    [ Upstream commit 0eee0fdf9b7b0baf698f9b426384aa9714d76a51 ]
    
    The previous patch fixed an issue with the 'add_addr_accepted' counter.
    This was not spot by the test suite.
    
    Check this counter and 'add_addr_signal' in MPTCP Join 'delete re-add
    signal' test. This should help spotting similar regressions later on.
    These counters are crucial for ensuring the MPTCP path manager correctly
    handles the subflow creation via 'ADD_ADDR'.
    
    Signed-off-by: Gang Yan <[email protected]>
    Reviewed-by: Geliang Tang <[email protected]>
    Reviewed-by: Matthieu Baerts (NGI0) <[email protected]>
    Signed-off-by: Matthieu Baerts (NGI0) <[email protected]>
    Link: https://patch.msgid.link/20251118-net-mptcp-misc-fixes-6-18-rc6-v1-11-806d3781c95f@kernel.org
    Signed-off-by: Jakub Kicinski <[email protected]>
    Stable-dep-of: 560edd99b5f5 ("selftests: mptcp: join: check RM_ADDR not sent over same subflow")
    Signed-off-by: Sasha Levin <[email protected]>
    Signed-off-by: Greg Kroah-Hartman <[email protected]>

selftests: mptcp: join: check RM_ADDR not sent over same subflow [+ + +]

Author: Matthieu Baerts (NGI0) <[email protected]>
Date:   Mon Mar 9 12:15:53 2026 -0400

    selftests: mptcp: join: check RM_ADDR not sent over same subflow
    
    [ Upstream commit 560edd99b5f58b2d4bbe3c8e51e1eed68d887b0e ]
    
    This validates the previous commit: RM_ADDR were sent over the first
    found active subflow which could be the same as the one being removed.
    It is more likely to loose this notification.
    
    For this check, RM_ADDR are explicitly dropped when trying to send them
    over the initial subflow, when removing the endpoint attached to it. If
    it is dropped, the test will complain because some RM_ADDR have not been
    received.
    
    Note that only the RM_ADDR are dropped, to allow the linked subflow to
    be quickly and cleanly closed. To only drop those RM_ADDR, a cBPF byte
    code is used. If the IPTables commands fail, that's OK, the tests will
    continue to pass, but not validate this part. This can be ignored:
    another subtest fully depends on such command, and will be marked as
    skipped.
    
    The 'Fixes' tag here below is the same as the one from the previous
    commit: this patch here is not fixing anything wrong in the selftests,
    but it validates the previous fix for an issue introduced by this commit
    ID.
    
    Fixes: 8dd5efb1f91b ("mptcp: send ack for rm_addr")
    Cc: [email protected]
    Reviewed-by: Mat Martineau <[email protected]>
    Signed-off-by: Matthieu Baerts (NGI0) <[email protected]>
    Link: https://patch.msgid.link/20260303-net-mptcp-misc-fixes-7-0-rc2-v1-3-4b5462b6f016@kernel.org
    Signed-off-by: Jakub Kicinski <[email protected]>
    Signed-off-by: Sasha Levin <[email protected]>
    Signed-off-by: Greg Kroah-Hartman <[email protected]>

serial: 8250: Add late synchronize_irq() to shutdown to handle DW UART BUSY [+ + +]

Author: Ilpo Järvinen <[email protected]>
Date:   Tue Feb 3 19:10:48 2026 +0200

    serial: 8250: Add late synchronize_irq() to shutdown to handle DW UART BUSY
    
    commit e0a368ae79531ff92105a2692f10d83052055856 upstream.
    
    When DW UART is !uart_16550_compatible, it can indicate BUSY at any
    point (when under constant Rx pressure) unless a complex sequence of
    steps is performed. Any LCR write can run a foul with the condition
    that prevents writing LCR while the UART is BUSY, which triggers
    BUSY_DETECT interrupt that seems unmaskable using IER bits.
    
    Normal flow is that dw8250_handle_irq() handles BUSY_DETECT condition
    by reading USR register. This BUSY feature, however, breaks the
    assumptions made in serial8250_do_shutdown(), which runs
    synchronize_irq() after clearing IER and assumes no interrupts can
    occur after that point but then proceeds to update LCR, which on DW
    UART can trigger an interrupt.
    
    If serial8250_do_shutdown() releases the interrupt handler before the
    handler has run and processed the BUSY_DETECT condition by read the USR
    register, the IRQ is not deasserted resulting in interrupt storm that
    triggers "irq x: nobody cared" warning leading to disabling the IRQ.
    
    Add late synchronize_irq() into serial8250_do_shutdown() to ensure
    BUSY_DETECT from DW UART is handled before port's interrupt handler is
    released. Alternative would be to add DW UART specific shutdown
    function but it would mostly duplicate the generic code and the extra
    synchronize_irq() seems pretty harmless in serial8250_do_shutdown().
    
    Fixes: 7d4008ebb1c9 ("tty: add a DesignWare 8250 driver")
    Cc: stable <[email protected]>
    Reported-by: Bandal, Shankar <[email protected]>
    Tested-by: Bandal, Shankar <[email protected]>
    Tested-by: Murthy, Shanth <[email protected]>
    Reviewed-by: Andy Shevchenko <[email protected]>
    Signed-off-by: Ilpo Järvinen <[email protected]>
    Link: https://patch.msgid.link/[email protected]
    Signed-off-by: Greg Kroah-Hartman <[email protected]>

serial: 8250: Fix TX deadlock when using DMA [+ + +]

Author: Raul E Rangel <[email protected]>
Date:   Mon Feb 9 13:58:18 2026 -0700

    serial: 8250: Fix TX deadlock when using DMA
    
    commit a424a34b8faddf97b5af41689087e7a230f79ba7 upstream.
    
    `dmaengine_terminate_async` does not guarantee that the
    `__dma_tx_complete` callback will run. The callback is currently the
    only place where `dma->tx_running` gets cleared. If the transaction is
    canceled and the callback never runs, then `dma->tx_running` will never
    get cleared and we will never schedule new TX DMA transactions again.
    
    This change makes it so we clear `dma->tx_running` after we terminate
    the DMA transaction. This is "safe" because `serial8250_tx_dma_flush`
    is holding the UART port lock. The first thing the callback does is also
    grab the UART port lock, so access to `dma->tx_running` is serialized.
    
    Fixes: 9e512eaaf8f4 ("serial: 8250: Fix fifo underflow on flush")
    Cc: stable <[email protected]>
    Signed-off-by: Raul E Rangel <[email protected]>
    Link: https://patch.msgid.link/20260209135815.1.I16366ecb0f62f3c96fe3dd5763fcf6f3c2b4d8cd@changeid
    Signed-off-by: Greg Kroah-Hartman <[email protected]>

serial: 8250_pci: add support for the AX99100 [+ + +]

Author: Martin Roukala (né Peres) <[email protected]>
Date:   Mon Mar 9 15:53:10 2026 +0200

    serial: 8250_pci: add support for the AX99100
    
    commit 9c0072bc33d349c83d223e64be30794e11938a6b upstream.
    
    This is found in popular brands such as StarTech.com or Delock, and has
    been a source of frustration to quite a few people, if I can trust
    Amazon comments complaining about Linux support via the official
    out-of-the-tree driver.
    
    Signed-off-by: Martin Roukala (né Peres) <[email protected]>
    Cc: stable <[email protected]>
    Link: https://patch.msgid.link/[email protected]
    Signed-off-by: Greg Kroah-Hartman <[email protected]>

serial: caif: hold tty->link reference in ldisc_open and ser_release [+ + +]

Author: Shuangpeng Bai <[email protected]>
Date:   Thu Mar 5 22:40:06 2026 -0500

    serial: caif: hold tty->link reference in ldisc_open and ser_release
    
    [ Upstream commit 288598d80a068a0e9281de35bcb4ce495f189e2a ]
    
    A reproducer triggers a KASAN slab-use-after-free in pty_write_room()
    when caif_serial's TX path calls tty_write_room(). The faulting access
    is on tty->link->port.
    
    Hold an extra kref on tty->link for the lifetime of the caif_serial line
    discipline: get it in ldisc_open() and drop it in ser_release(), and
    also drop it on the ldisc_open() error path.
    
    With this change applied, the reproducer no longer triggers the UAF in
    my testing.
    
    Link: https://gist.github.com/shuangpengbai/c898debad6bdf170a84be7e6b3d8707f
    Link: https://lore.kernel.org/netdev/[email protected]
    Fixes: e31d5a05948e ("caif: tty's are kref objects so take a reference")
    Signed-off-by: Shuangpeng Bai <[email protected]>
    Reviewed-by: Jiayuan Chen <[email protected]>
    Link: https://patch.msgid.link/[email protected]
    Signed-off-by: Jakub Kicinski <[email protected]>
    Signed-off-by: Sasha Levin <[email protected]>

serial: uartlite: fix PM runtime usage count underflow on probe [+ + +]

Author: Maciej Andrzejewski ICEYE <[email protected]>
Date:   Thu Mar 5 13:37:51 2026 +0100

    serial: uartlite: fix PM runtime usage count underflow on probe
    
    commit d54801cd509515f674a5aac1d3ea1401d2a05863 upstream.
    
    ulite_probe() calls pm_runtime_put_autosuspend() at the end of probe
    without holding a corresponding PM runtime reference for non-console
    ports.
    
    During ulite_assign(), uart_add_one_port() triggers uart_configure_port()
    which calls ulite_pm() via uart_change_pm(). For non-console ports, the
    UART core performs a balanced get/put cycle:
    
      uart_change_pm(ON)  -> ulite_pm() -> pm_runtime_get_sync()        +1
      uart_change_pm(OFF) -> ulite_pm() -> pm_runtime_put_autosuspend() -1
    
    This leaves no spare reference for the pm_runtime_put_autosuspend() at
    the end of probe. The PM runtime core prevents the count from actually
    going below zero, and instead triggers a
    "Runtime PM usage count underflow!" warning.
    
    For console ports the bug is masked: the UART core skips the
    uart_change_pm(OFF) call, so the UART core's unbalanced get happens to
    pair with probe's trailing put.
    
    Add pm_runtime_get_noresume() before pm_runtime_enable() to take an
    explicit probe-owned reference that the trailing
    pm_runtime_put_autosuspend() can release. This ensures a correct usage
    count regardless of whether the port is a console.
    
    Fixes: 5bbe10a6942d ("tty: serial: uartlite: Add runtime pm support")
    Cc: stable <[email protected]>
    Signed-off-by: Maciej Andrzejewski ICEYE <[email protected]>
    Link: https://patch.msgid.link/[email protected]
    Signed-off-by: Greg Kroah-Hartman <[email protected]>

smb/server: Fix another refcount leak in smb2_open() [+ + +]

Author: Guenter Roeck <[email protected]>
Date:   Thu Feb 26 21:54:21 2026 -0800

    smb/server: Fix another refcount leak in smb2_open()
    
    [ Upstream commit c15e7c62feb3751cbdd458555819df1d70374890 ]
    
    If ksmbd_override_fsids() fails, we jump to err_out2. At that point, fp is
    NULL because it hasn't been assigned dh_info.fp yet, so ksmbd_fd_put(work,
    fp) will not be called. However, dh_info.fp was already inserted into the
    session file table by ksmbd_reopen_durable_fd(), so it will leak in the
    session file table until the session is closed.
    
    Move fp = dh_info.fp; ahead of the ksmbd_override_fsids() check to fix the
    problem.
    
    Found by an experimental AI code review agent at Google.
    
    Fixes: c8efcc786146a ("ksmbd: add support for durable handles v1/v2")
    Signed-off-by: Guenter Roeck <[email protected]>
    Reviewed-by: ChenXiaoSong <[email protected]>
    Acked-by: Namjae Jeon <[email protected]>
    Signed-off-by: Steve French <[email protected]>
    Signed-off-by: Sasha Levin <[email protected]>

smb: client: Compare MACs in constant time [+ + +]

Author: Eric Biggers <[email protected]>
Date:   Tue Mar 10 12:50:49 2026 -0700

    smb: client: Compare MACs in constant time
    
    commit 26bc83b88bbbf054f0980a4a42047a8d1e210e4c upstream.
    
    To prevent timing attacks, MAC comparisons need to be constant-time.
    Replace the memcmp() with the correct function, crypto_memneq().
    
    Fixes: 1da177e4c3f4 ("Linux-2.6.12-rc2")
    Cc: [email protected]
    Acked-by: Paulo Alcantara (Red Hat) <[email protected]>
    Signed-off-by: Eric Biggers <[email protected]>
    Signed-off-by: Steve French <[email protected]>
    Signed-off-by: Greg Kroah-Hartman <[email protected]>

smb: client: fix atomic open with O_DIRECT & O_SYNC [+ + +]

Author: Paulo Alcantara <[email protected]>
Date:   Sat Mar 7 18:20:16 2026 -0300

    smb: client: fix atomic open with O_DIRECT & O_SYNC
    
    commit 4a7d2729dc99437dbb880a64c47828c0d191b308 upstream.
    
    When user application requests O_DIRECT|O_SYNC along with O_CREAT on
    open(2), CREATE_NO_BUFFER and CREATE_WRITE_THROUGH bits were missed in
    CREATE request when performing an atomic open, thus leading to
    potentially data integrity issues.
    
    Fix this by setting those missing bits in CREATE request when
    O_DIRECT|O_SYNC has been specified in cifs_do_create().
    
    Fixes: 1da177e4c3f4 ("Linux-2.6.12-rc2")
    Signed-off-by: Paulo Alcantara (Red Hat) <[email protected]>
    Reviewed-by: David Howells <[email protected]>
    Acked-by: Henrique Carvalho <[email protected]>
    Cc: Tom Talpey <[email protected]>
    Cc: [email protected]
    Cc: [email protected]
    Signed-off-by: Steve French <[email protected]>
    Signed-off-by: Greg Kroah-Hartman <[email protected]>

smb: client: fix iface port assignment in parse_server_interfaces [+ + +]

Author: Henrique Carvalho <[email protected]>
Date:   Wed Mar 11 20:17:23 2026 -0300

    smb: client: fix iface port assignment in parse_server_interfaces
    
    commit d4c7210d2f3ea481a6481f03040a64d9077a6172 upstream.
    
    parse_server_interfaces() initializes interface socket addresses with
    CIFS_PORT. When the mount uses a non-default port this overwrites the
    configured destination port.
    
    Later, cifs_chan_update_iface() copies this sockaddr into server->dstaddr,
    causing reconnect attempts to use the wrong port after server interface
    updates.
    
    Use the existing port from server->dstaddr instead.
    
    Cc: [email protected]
    Fixes: fe856be475f7 ("CIFS: parse and store info on iface queries")
    Tested-by: Dr. Thomas Orgis <[email protected]>
    Reviewed-by: Enzo Matsumiya <[email protected]>
    Signed-off-by: Henrique Carvalho <[email protected]>
    Signed-off-by: Steve French <[email protected]>
    Signed-off-by: Greg Kroah-Hartman <[email protected]>

smb: client: fix in-place encryption corruption in SMB2_write() [+ + +]

Author: Bharath SM <[email protected]>
Date:   Mon Mar 9 16:00:49 2026 +0530

    smb: client: fix in-place encryption corruption in SMB2_write()
    
    commit d78840a6a38d312dc1a51a65317bb67e46f0b929 upstream.
    
    SMB2_write() places write payload in iov[1..n] as part of rq_iov.
    smb3_init_transform_rq() pointer-shares rq_iov, so crypt_message()
    encrypts iov[1] in-place, replacing the original plaintext with
    ciphertext. On a replayable error, the retry sends the same iov[1]
    which now contains ciphertext instead of the original data,
    resulting in corruption.
    
    The corruption is most likely to be observed when connections are
    unstable, as reconnects trigger write retries that re-send the
    already-encrypted data.
    
    This affects SFU mknod, MF symlinks, etc. On kernels before
    6.10 (prior to the netfs conversion), sync writes also used
    this path and were similarly affected. The async write path
    wasn't unaffected as it uses rq_iter which gets deep-copied.
    
    Fix by moving the write payload into rq_iter via iov_iter_kvec(),
    so smb3_init_transform_rq() deep-copies it before encryption.
    
    Cc: [email protected] #6.3+
    Acked-by: Henrique Carvalho <[email protected]>
    Acked-by: Shyam Prasad N <[email protected]>
    Acked-by: Paulo Alcantara (Red Hat) <[email protected]>
    Signed-off-by: Bharath SM <[email protected]>
    Signed-off-by: Steve French <[email protected]>
    Signed-off-by: Greg Kroah-Hartman <[email protected]>

smb: client: fix krb5 mount with username option [+ + +]

Author: Paulo Alcantara <[email protected]>
Date:   Fri Mar 13 00:03:38 2026 -0300

    smb: client: fix krb5 mount with username option
    
    commit 12b4c5d98cd7ca46d5035a57bcd995df614c14e1 upstream.
    
    Customer reported that some of their krb5 mounts were failing against
    a single server as the client was trying to mount the shares with
    wrong credentials.  It turned out the client was reusing SMB session
    from first mount to try mounting the other shares, even though a
    different username= option had been specified to the other mounts.
    
    By using username mount option along with sec=krb5 to search for
    principals from keytab is supported by cifs.upcall(8) since
    cifs-utils-4.8.  So fix this by matching username mount option in
    match_session() even with Kerberos.
    
    For example, the second mount below should fail with -ENOKEY as there
    is no 'foobar' principal in keytab (/etc/krb5.keytab).  The client
    ends up reusing SMB session from first mount to perform the second
    one, which is wrong.
    
    ```
    $ ktutil
    ktutil:  add_entry -password -p testuser -k 1 -e aes256-cts
    Password for [email protected]:
    ktutil:  write_kt /etc/krb5.keytab
    ktutil:  quit
    $ klist -ke
    Keytab name: FILE:/etc/krb5.keytab
    KVNO Principal
     ---- ----------------------------------------------------------------
       1 [email protected] (aes256-cts-hmac-sha1-96)
    $ mount.cifs //w22-root2/scratch /mnt/1 -o sec=krb5,username=testuser
    $ mount.cifs //w22-root2/scratch /mnt/2 -o sec=krb5,username=foobar
    $ mount -t cifs | grep -Po 'username=\K\w+'
    testuser
    testuser
    ```
    
    Reported-by: Oscar Santos <[email protected]>
    Signed-off-by: Paulo Alcantara (Red Hat) <[email protected]>
    Cc: David Howells <[email protected]>
    Cc: [email protected]
    Cc: [email protected]
    Signed-off-by: Steve French <[email protected]>
    Signed-off-by: Greg Kroah-Hartman <[email protected]>

smb: server: fix use-after-free in smb2_open() [+ + +]

Author: Marios Makassikis <[email protected]>
Date:   Tue Mar 3 11:14:32 2026 +0100

    smb: server: fix use-after-free in smb2_open()
    
    commit 1e689a56173827669a35da7cb2a3c78ed5c53680 upstream.
    
    The opinfo pointer obtained via rcu_dereference(fp->f_opinfo) is
    dereferenced after rcu_read_unlock(), creating a use-after-free
    window.
    
    Cc: [email protected]
    Signed-off-by: Marios Makassikis <[email protected]>
    Acked-by: Namjae Jeon <[email protected]>
    Signed-off-by: Steve French <[email protected]>
    Signed-off-by: Greg Kroah-Hartman <[email protected]>

soc: fsl: cpm1: qmc: Fix error check for devm_ioremap_resource() in qmc_qe_init_resources() [+ + +]

Author: Chen Ni <[email protected]>
Date:   Mon Feb 9 09:59:04 2026 +0800

    soc: fsl: cpm1: qmc: Fix error check for devm_ioremap_resource() in qmc_qe_init_resources()
    
    [ Upstream commit 3f4e403304186d79fddace860360540fc3af97f9 ]
    
    Fix wrong variable used for error checking after devm_ioremap_resource()
    call. The function checks qmc->scc_pram instead of qmc->dpram, which
    could lead to incorrect error handling.
    
    Fixes: eb680d563089 ("soc: fsl: cpm1: qmc: Add support for QUICC Engine (QE) implementation")
    Signed-off-by: Chen Ni <[email protected]>
    Acked-by: Herve Codina <[email protected]>
    Link: https://lore.kernel.org/r/[email protected]
    Signed-off-by: Christophe Leroy (CS GROUP) <[email protected]>
    Signed-off-by: Sasha Levin <[email protected]>

soc: fsl: qbman: fix race condition in qman_destroy_fq [+ + +]

Author: Richard Genoud <[email protected]>
Date:   Tue Dec 23 08:25:49 2025 +0100

    soc: fsl: qbman: fix race condition in qman_destroy_fq
    
    [ Upstream commit 014077044e874e270ec480515edbc1cadb976cf2 ]
    
    When QMAN_FQ_FLAG_DYNAMIC_FQID is set, there's a race condition between
    fq_table[fq->idx] state and freeing/allocating from the pool and
    WARN_ON(fq_table[fq->idx]) in qman_create_fq() gets triggered.
    
    Indeed, we can have:
             Thread A                             Thread B
        qman_destroy_fq()                    qman_create_fq()
          qman_release_fqid()
            qman_shutdown_fq()
            gen_pool_free()
               -- At this point, the fqid is available again --
                                               qman_alloc_fqid()
               -- so, we can get the just-freed fqid in thread B --
                                               fq->fqid = fqid;
                                               fq->idx = fqid * 2;
                                               WARN_ON(fq_table[fq->idx]);
                                               fq_table[fq->idx] = fq;
         fq_table[fq->idx] = NULL;
    
    And adding some logs between qman_release_fqid() and
    fq_table[fq->idx] = NULL makes the WARN_ON() trigger a lot more.
    
    To prevent that, ensure that fq_table[fq->idx] is set to NULL before
    gen_pool_free() is called by using smp_wmb().
    
    Fixes: c535e923bb97 ("soc/fsl: Introduce DPAA 1.x QMan device driver")
    Signed-off-by: Richard Genoud <[email protected]>
    Tested-by: CHAMPSEIX Thomas <[email protected]>
    Link: https://lore.kernel.org/r/[email protected]
    Signed-off-by: Christophe Leroy (CS GROUP) <[email protected]>
    Signed-off-by: Sasha Levin <[email protected]>

soc: microchip: mpfs: Fix memory leak in mpfs_sys_controller_probe() [+ + +]

Author: Zilin Guan <[email protected]>
Date:   Sun Dec 28 12:48:36 2025 +0000

    soc: microchip: mpfs: Fix memory leak in mpfs_sys_controller_probe()
    
    [ Upstream commit 5a741f8cc6fe62542f955cd8d24933a1b6589cbd ]
    
    In mpfs_sys_controller_probe(), if of_get_mtd_device_by_node() fails,
    the function returns immediately without freeing the allocated memory
    for sys_controller, leading to a memory leak.
    
    Fix this by jumping to the out_free label to ensure the memory is
    properly freed.
    
    Also, consolidate the error handling for the mbox_request_channel()
    failure case to use the same label.
    
    Fixes: 742aa6c563d2 ("soc: microchip: mpfs: enable access to the system controller's flash")
    Co-developed-by: Jianhao Xu <[email protected]>
    Signed-off-by: Jianhao Xu <[email protected]>
    Signed-off-by: Zilin Guan <[email protected]>
    Signed-off-by: Conor Dooley <[email protected]>
    Signed-off-by: Sasha Levin <[email protected]>

soc: rockchip: grf: Add missing of_node_put() when returning [+ + +]

Author: Shawn Lin <[email protected]>
Date:   Wed Feb 11 21:02:37 2026 +0800

    soc: rockchip: grf: Add missing of_node_put() when returning
    
    [ Upstream commit 24ed11ee5bacf9a9aca18fc6b47667c7f38d578b ]
    
    Fix the smatch checking:
    drivers/soc/rockchip/grf.c:249 rockchip_grf_init()
    warn: inconsistent refcounting 'np->kobj.kref.refcount.refs.counter':
    
    Reported-by: Dan Carpenter <[email protected]>
    Fixes: 75fb63ae0312 ("soc: rockchip: grf: Support multiple grf to be handled")
    Closes: https://lore.kernel.org/all/[email protected]/
    Signed-off-by: Shawn Lin <[email protected]>
    Link: https://patch.msgid.link/[email protected]
    Signed-off-by: Heiko Stuebner <[email protected]>
    Signed-off-by: Sasha Levin <[email protected]>

spi: cadence-quadspi: Implement refcount to handle unbind during busy [+ + +]

Author: Khairul Anuar Romli <[email protected]>
Date:   Wed Feb 25 16:02:39 2026 +0800

    spi: cadence-quadspi: Implement refcount to handle unbind during busy
    
    [ Upstream commit 7446284023e8ef694fb392348185349c773eefb3 ]
    
    driver support indirect read and indirect write operation with
    assumption no force device removal(unbind) operation. However
    force device removal(removal) is still available to root superuser.
    
    Unbinding driver during operation causes kernel crash. This changes
    ensure driver able to handle such operation for indirect read and
    indirect write by implementing refcount to track attached devices
    to the controller and gracefully wait and until attached devices
    remove operation completed before proceed with removal operation.
    
    Signed-off-by: Khairul Anuar Romli <[email protected]>
    Reviewed-by: Matthew Gerlach <[email protected]>
    Reviewed-by: Niravkumar L Rabara <[email protected]>
    Link: https://patch.msgid.link/8704fd6bd2ff4d37bba4a0eacf5eba3ba001079e.1756168074.git.khairul.anuar.romli@altera.com
    Signed-off-by: Mark Brown <[email protected]>
    Signed-off-by: Robert Garcia <[email protected]>
    Signed-off-by: Greg Kroah-Hartman <[email protected]>

spi: fix statistics allocation [+ + +]

Author: Johan Hovold <[email protected]>
Date:   Thu Mar 12 16:18:14 2026 +0100

    spi: fix statistics allocation
    
    commit dee0774bbb2abb172e9069ce5ffef579b12b3ae9 upstream.
    
    The controller per-cpu statistics is not allocated until after the
    controller has been registered with driver core, which leaves a window
    where accessing the sysfs attributes can trigger a NULL-pointer
    dereference.
    
    Fix this by moving the statistics allocation to controller allocation
    while tying its lifetime to that of the controller (rather than using
    implicit devres).
    
    Fixes: 6598b91b5ac3 ("spi: spi.c: Convert statistics to per-cpu u64_stats_t")
    Cc: [email protected]      # 6.0
    Cc: David Jander <[email protected]>
    Signed-off-by: Johan Hovold <[email protected]>
    Link: https://patch.msgid.link/[email protected]
    Signed-off-by: Mark Brown <[email protected]>
    Signed-off-by: Greg Kroah-Hartman <[email protected]>

spi: fix use-after-free on controller registration failure [+ + +]

Author: Johan Hovold <[email protected]>
Date:   Thu Mar 12 16:18:13 2026 +0100

    spi: fix use-after-free on controller registration failure
    
    commit 8634e05b08ead636e926022f4a98416e13440df9 upstream.
    
    Make sure to deregister from driver core also in the unlikely event that
    per-cpu statistics allocation fails during controller registration to
    avoid use-after-free (of driver resources) and unclocked register
    accesses.
    
    Fixes: 6598b91b5ac3 ("spi: spi.c: Convert statistics to per-cpu u64_stats_t")
    Cc: [email protected]      # 6.0
    Cc: David Jander <[email protected]>
    Signed-off-by: Johan Hovold <[email protected]>
    Link: https://patch.msgid.link/[email protected]
    Signed-off-by: Mark Brown <[email protected]>
    Signed-off-by: Greg Kroah-Hartman <[email protected]>

staging: rtl8723bs: fix potential out-of-bounds read in rtw_restruct_wmm_ie [+ + +]

Author: Luka Gejak <[email protected]>
Date:   Tue Feb 24 14:26:47 2026 +0100

    staging: rtl8723bs: fix potential out-of-bounds read in rtw_restruct_wmm_ie
    
    commit a75281626fc8fa6dc6c9cc314ee423e8bc45203b upstream.
    
    The current code checks 'i + 5 < in_len' at the end of the if statement.
    However, it accesses 'in_ie[i + 5]' before that check, which can lead
    to an out-of-bounds read. Move the length check to the beginning of the
    conditional to ensure the index is within bounds before accessing the
    array.
    
    Fixes: 554c0a3abf21 ("staging: Add rtl8723bs sdio wifi driver")
    Cc: stable <[email protected]>
    Signed-off-by: Luka Gejak <[email protected]>
    Reviewed-by: Dan Carpenter <[email protected]>
    Link: https://patch.msgid.link/[email protected]
    Signed-off-by: Greg Kroah-Hartman <[email protected]>

staging: rtl8723bs: properly validate the data in rtw_get_ie_ex() [+ + +]

Author: Greg Kroah-Hartman <[email protected]>
Date:   Mon Feb 23 14:31:35 2026 +0100

    staging: rtl8723bs: properly validate the data in rtw_get_ie_ex()
    
    commit f0109b9d3e1e455429279d602f6276e34689750a upstream.
    
    Just like in commit 154828bf9559 ("staging: rtl8723bs: fix out-of-bounds
    read in rtw_get_ie() parser"), we don't trust the data in the frame so
    we should check the length better before acting on it
    
    Cc: stable <[email protected]>
    Assisted-by: gkh_clanker_2000
    Tested-by: Navaneeth K <[email protected]>
    Reviewed-by: Navaneeth K <[email protected]>
    Link: https://patch.msgid.link/2026022336-arrange-footwork-6e54@gregkh
    Signed-off-by: Greg Kroah-Hartman <[email protected]>

sunrpc: fix cache_request leak in cache_release [+ + +]

Author: Jeff Layton <[email protected]>
Date:   Mon Feb 23 12:09:58 2026 -0500

    sunrpc: fix cache_request leak in cache_release
    
    commit 17ad31b3a43b72aec3a3d83605891e1397d0d065 upstream.
    
    When a reader's file descriptor is closed while in the middle of reading
    a cache_request (rp->offset != 0), cache_release() decrements the
    request's readers count but never checks whether it should free the
    request.
    
    In cache_read(), when readers drops to 0 and CACHE_PENDING is clear, the
    cache_request is removed from the queue and freed along with its buffer
    and cache_head reference. cache_release() lacks this cleanup.
    
    The only other path that frees requests with readers == 0 is
    cache_dequeue(), but it runs only when CACHE_PENDING transitions from
    set to clear. If that transition already happened while readers was
    still non-zero, cache_dequeue() will have skipped the request, and no
    subsequent call will clean it up.
    
    Add the same cleanup logic from cache_read() to cache_release(): after
    decrementing readers, check if it reached 0 with CACHE_PENDING clear,
    and if so, dequeue and free the cache_request.
    
    Reported-by: NeilBrown <[email protected]>
    Fixes: 1da177e4c3f4 ("Linux-2.6.12-rc2")
    Cc: [email protected]
    Signed-off-by: Jeff Layton <[email protected]>
    Signed-off-by: Chuck Lever <[email protected]>
    Signed-off-by: Greg Kroah-Hartman <[email protected]>

time/jiffies: Mark jiffies_64_to_clock_t() notrace [+ + +]

Author: Steven Rostedt <[email protected]>
Date:   Fri Mar 6 21:24:03 2026 -0500

    time/jiffies: Mark jiffies_64_to_clock_t() notrace
    
    [ Upstream commit 755a648e78f12574482d4698d877375793867fa1 ]
    
    The trace_clock_jiffies() function that handles the "uptime" clock for
    tracing calls jiffies_64_to_clock_t(). This causes the function tracer to
    constantly recurse when the tracing clock is set to "uptime". Mark it
    notrace to prevent unnecessary recursion when using the "uptime" clock.
    
    Fixes: 58d4e21e50ff3 ("tracing: Fix wraparound problems in "uptime" trace clock")
    Signed-off-by: Steven Rostedt (Google) <[email protected]>
    Signed-off-by: Thomas Gleixner <[email protected]>
    Link: https://patch.msgid.link/20260306212403.72270bb2@robin
    Signed-off-by: Sasha Levin <[email protected]>

tipc: fix divide-by-zero in tipc_sk_filter_connect() [+ + +]

Author: Mehul Rao <[email protected]>
Date:   Tue Mar 10 13:07:30 2026 -0400

    tipc: fix divide-by-zero in tipc_sk_filter_connect()
    
    commit 6c5a9baa15de240e747263aba435a0951da8d8d2 upstream.
    
    A user can set conn_timeout to any value via
    setsockopt(TIPC_CONN_TIMEOUT), including values less than 4.  When a
    SYN is rejected with TIPC_ERR_OVERLOAD and the retry path in
    tipc_sk_filter_connect() executes:
    
        delay %= (tsk->conn_timeout / 4);
    
    If conn_timeout is in the range [0, 3], the integer division yields 0,
    and the modulo operation triggers a divide-by-zero exception, causing a
    kernel oops/panic.
    
    Fix this by clamping conn_timeout to a minimum of 4 at the point of use
    in tipc_sk_filter_connect().
    
    Oops: divide error: 0000 [#1] SMP KASAN NOPTI
    CPU: 0 UID: 0 PID: 119 Comm: poc-F144 Not tainted 7.0.0-rc2+
    RIP: 0010:tipc_sk_filter_rcv (net/tipc/socket.c:2236 net/tipc/socket.c:2362)
    Call Trace:
     tipc_sk_backlog_rcv (include/linux/instrumented.h:82 include/linux/atomic/atomic-instrumented.h:32 include/net/sock.h:2357 net/tipc/socket.c:2406)
     __release_sock (include/net/sock.h:1185 net/core/sock.c:3213)
     release_sock (net/core/sock.c:3797)
     tipc_connect (net/tipc/socket.c:2570)
     __sys_connect (include/linux/file.h:62 include/linux/file.h:83 net/socket.c:2098)
    
    Fixes: 6787927475e5 ("tipc: buffer overflow handling in listener socket")
    Cc: [email protected]
    Signed-off-by: Mehul Rao <[email protected]>
    Reviewed-by: Tung Nguyen <[email protected]>
    Link: https://patch.msgid.link/[email protected]
    Signed-off-by: Jakub Kicinski <[email protected]>
    Signed-off-by: Greg Kroah-Hartman <[email protected]>

tools/bootconfig: fix fd leak in load_xbc_file() on fstat failure [+ + +]

Author: Josh Law <[email protected]>
Date:   Thu Mar 19 08:43:06 2026 +0900

    tools/bootconfig: fix fd leak in load_xbc_file() on fstat failure
    
    [ Upstream commit 3b2c2ab4ceb82af484310c3087541eab00ea288b ]
    
    If fstat() fails after open() succeeds, the function returns without
    closing the file descriptor. Also preserve errno across close(), since
    close() may overwrite it before the error is returned.
    
    Link: https://lore.kernel.org/all/[email protected]/
    
    Fixes: 950313ebf79c ("tools: bootconfig: Add bootconfig command")
    Signed-off-by: Josh Law <[email protected]>
    Signed-off-by: Masami Hiramatsu (Google) <[email protected]>
    Signed-off-by: Sasha Levin <[email protected]>

tracing: Add recursion protection in kernel stack trace recording [+ + +]

Author: Steven Rostedt <[email protected]>
Date:   Sat Feb 28 10:51:24 2026 +0800

    tracing: Add recursion protection in kernel stack trace recording
    
    [ Upstream commit 5f1ef0dfcb5b7f4a91a9b0e0ba533efd9f7e2cdb ]
    
    A bug was reported about an infinite recursion caused by tracing the rcu
    events with the kernel stack trace trigger enabled. The stack trace code
    called back into RCU which then called the stack trace again.
    
    Expand the ftrace recursion protection to add a set of bits to protect
    events from recursion. Each bit represents the context that the event is
    in (normal, softirq, interrupt and NMI).
    
    Have the stack trace code use the interrupt context to protect against
    recursion.
    
    Note, the bug showed an issue in both the RCU code as well as the tracing
    stacktrace code. This only handles the tracing stack trace side of the
    bug. The RCU fix will be handled separately.
    
    Link: https://lore.kernel.org/all/[email protected]/
    
    Cc: [email protected]
    Cc: Masami Hiramatsu <[email protected]>
    Cc: Mathieu Desnoyers <[email protected]>
    Cc: Joel Fernandes <[email protected]>
    Cc: "Paul E. McKenney" <[email protected]>
    Cc: Boqun Feng <[email protected]>
    Link: https://patch.msgid.link/[email protected]
    Reported-by: Yao Kai <[email protected]>
    Tested-by: Yao Kai <[email protected]>
    Fixes: 5f5fa7ea89dc ("rcu: Don't use negative nesting depth in __rcu_read_unlock()")
    Signed-off-by: Steven Rostedt (Google) <[email protected]>
    Signed-off-by: Leon Chen <[email protected]>
    Signed-off-by: Greg Kroah-Hartman <[email protected]>

tracing: Fix enabling multiple events on the kernel command line and bootconfig [+ + +]

Author: Andrei-Alexandru Tachici <[email protected]>
Date:   Mon Mar 2 11:27:34 2026 +0100

    tracing: Fix enabling multiple events on the kernel command line and bootconfig
    
    commit 3b1679e086bb869ca02722f6bd29b3573a6a0e7e upstream.
    
    Multiple events can be enabled on the kernel command line via a comma
    separator. But if the are specified one at a time, then only the last
    event is enabled. This is because the event names are saved in a temporary
    buffer, and each call by the init cmdline code will reset that buffer.
    
    This also affects names in the boot config file, as it may call the
    callback multiple times with an example of:
    
      kernel.trace_event = ":mod:rproc_qcom_common", ":mod:qrtr", ":mod:qcom_aoss"
    
    Change the cmdline callback function to append a comma and the next value
    if the temporary buffer already has content.
    
    Cc: [email protected]
    Cc: Masami Hiramatsu <[email protected]>
    Cc: Mathieu Desnoyers <[email protected]>
    Link: https://patch.msgid.link/20260302-trace-events-allow-multiple-modules-v1-1-ce4436e37fb8@oss.qualcomm.com
    Signed-off-by: Andrei-Alexandru Tachici <[email protected]>
    Signed-off-by: Steven Rostedt (Google) <[email protected]>
    Signed-off-by: Greg Kroah-Hartman <[email protected]>

tracing: Fix syscall events activation by ensuring refcount hits zero [+ + +]

Author: Huiwen He <[email protected]>
Date:   Tue Feb 24 10:35:44 2026 +0800

    tracing: Fix syscall events activation by ensuring refcount hits zero
    
    commit 0a663b764dbdf135a126284f454c9f01f95a87d4 upstream.
    
    When multiple syscall events are specified in the kernel command line
    (e.g., trace_event=syscalls:sys_enter_openat,syscalls:sys_enter_close),
    they are often not captured after boot, even though they appear enabled
    in the tracing/set_event file.
    
    The issue stems from how syscall events are initialized. Syscall
    tracepoints require the global reference count (sys_tracepoint_refcount)
    to transition from 0 to 1 to trigger the registration of the syscall
    work (TIF_SYSCALL_TRACEPOINT) for tasks, including the init process (pid 1).
    
    The current implementation of early_enable_events() with disable_first=true
    used an interleaved sequence of "Disable A -> Enable A -> Disable B -> Enable B".
    If multiple syscalls are enabled, the refcount never drops to zero,
    preventing the 0->1 transition that triggers actual registration.
    
    Fix this by splitting early_enable_events() into two distinct phases:
    1. Disable all events specified in the buffer.
    2. Enable all events specified in the buffer.
    
    This ensures the refcount hits zero before re-enabling, allowing syscall
    events to be properly activated during early boot.
    
    The code is also refactored to use a helper function to avoid logic
    duplication between the disable and enable phases.
    
    Cc: [email protected]
    Cc: Masami Hiramatsu <[email protected]>
    Cc: Mathieu Desnoyers <[email protected]>
    Link: https://patch.msgid.link/[email protected]
    Fixes: ce1039bd3a89 ("tracing: Fix enabling of syscall events on the command line")
    Signed-off-by: Huiwen He <[email protected]>
    Signed-off-by: Steven Rostedt (Google) <[email protected]>
    Signed-off-by: Greg Kroah-Hartman <[email protected]>

tracing: Fix trace_buf_size= cmdline parameter with sizes >= 2G [+ + +]

Author: Calvin Owens <[email protected]>
Date:   Fri Mar 6 19:19:25 2026 -0800

    tracing: Fix trace_buf_size= cmdline parameter with sizes >= 2G
    
    commit d008ba8be8984760e36d7dcd4adbd5a41a645708 upstream.
    
    Some of the sizing logic through tracer_alloc_buffers() uses int
    internally, causing unexpected behavior if the user passes a value that
    does not fit in an int (on my x86 machine, the result is uselessly tiny
    buffers).
    
    Fix by plumbing the parameter's real type (unsigned long) through to the
    ring buffer allocation functions, which already use unsigned long.
    
    It has always been possible to create larger ring buffers via the sysfs
    interface: this only affects the cmdline parameter.
    
    Cc: [email protected]
    Cc: Masami Hiramatsu <[email protected]>
    Cc: Mathieu Desnoyers <[email protected]>
    Link: https://patch.msgid.link/bff42a4288aada08bdf74da3f5b67a2c28b761f8.1772852067.git.calvin@wbinvd.org
    Fixes: 73c5162aa362 ("tracing: keep ring buffer to minimum size till used")
    Signed-off-by: Calvin Owens <[email protected]>
    Signed-off-by: Steven Rostedt (Google) <[email protected]>
    Signed-off-by: Greg Kroah-Hartman <[email protected]>

udp_tunnel: fix NULL deref caused by udp_sock_create6 when CONFIG_IPV6=n [+ + +]

Author: Xiang Mei <[email protected]>
Date:   Mon Mar 16 18:02:41 2026 -0700

    udp_tunnel: fix NULL deref caused by udp_sock_create6 when CONFIG_IPV6=n
    
    [ Upstream commit b3a6df291fecf5f8a308953b65ca72b7fc9e015d ]
    
    When CONFIG_IPV6 is disabled, the udp_sock_create6() function returns 0
    (success) without actually creating a socket. Callers such as
    fou_create() then proceed to dereference the uninitialized socket
    pointer, resulting in a NULL pointer dereference.
    
    The captured NULL deref crash:
      BUG: kernel NULL pointer dereference, address: 0000000000000018
      RIP: 0010:fou_nl_add_doit (net/ipv4/fou_core.c:590 net/ipv4/fou_core.c:764)
      [...]
      Call Trace:
        <TASK>
        genl_family_rcv_msg_doit.constprop.0 (net/netlink/genetlink.c:1114)
        genl_rcv_msg (net/netlink/genetlink.c:1194 net/netlink/genetlink.c:1209)
        [...]
        netlink_rcv_skb (net/netlink/af_netlink.c:2550)
        genl_rcv (net/netlink/genetlink.c:1219)
        netlink_unicast (net/netlink/af_netlink.c:1319 net/netlink/af_netlink.c:1344)
        netlink_sendmsg (net/netlink/af_netlink.c:1894)
        __sock_sendmsg (net/socket.c:727 (discriminator 1) net/socket.c:742 (discriminator 1))
        __sys_sendto (./include/linux/file.h:62 (discriminator 1) ./include/linux/file.h:83 (discriminator 1) net/socket.c:2183 (discriminator 1))
        __x64_sys_sendto (net/socket.c:2213 (discriminator 1) net/socket.c:2209 (discriminator 1) net/socket.c:2209 (discriminator 1))
        do_syscall_64 (arch/x86/entry/syscall_64.c:63 (discriminator 1) arch/x86/entry/syscall_64.c:94 (discriminator 1))
        entry_SYSCALL_64_after_hwframe (net/arch/x86/entry/entry_64.S:130)
    
    This patch makes udp_sock_create6 return -EPFNOSUPPORT instead, so
    callers correctly take their error paths. There is only one caller of
    the vulnerable function and only privileged users can trigger it.
    
    Fixes: fd384412e199b ("udp_tunnel: Seperate ipv6 functions into its own file.")
    Reported-by: Weiming Shi <[email protected]>
    Signed-off-by: Xiang Mei <[email protected]>
    Link: https://patch.msgid.link/[email protected]
    Signed-off-by: Jakub Kicinski <[email protected]>
    Signed-off-by: Sasha Levin <[email protected]>

unshare: fix unshare_fs() handling [+ + +]

Author: Al Viro <[email protected]>
Date:   Sat Feb 7 08:25:24 2026 +0000

    unshare: fix unshare_fs() handling
    
    [ Upstream commit 6c4b2243cb6c0755159bd567130d5e12e7b10d9f ]
    
    There's an unpleasant corner case in unshare(2), when we have a
    CLONE_NEWNS in flags and current->fs hadn't been shared at all; in that
    case copy_mnt_ns() gets passed current->fs instead of a private copy,
    which causes interesting warts in proof of correctness]
    
    > I guess if private means fs->users == 1, the condition could still be true.
    
    Unfortunately, it's worse than just a convoluted proof of correctness.
    Consider the case when we have CLONE_NEWCGROUP in addition to CLONE_NEWNS
    (and current->fs->users == 1).
    
    We pass current->fs to copy_mnt_ns(), all right.  Suppose it succeeds and
    flips current->fs->{pwd,root} to corresponding locations in the new namespace.
    Now we proceed to copy_cgroup_ns(), which fails (e.g. with -ENOMEM).
    We call put_mnt_ns() on the namespace created by copy_mnt_ns(), it's
    destroyed and its mount tree is dissolved, but...  current->fs->root and
    current->fs->pwd are both left pointing to now detached mounts.
    
    They are pinning those, so it's not a UAF, but it leaves the calling
    process with unshare(2) failing with -ENOMEM _and_ leaving it with
    pwd and root on detached isolated mounts.  The last part is clearly a bug.
    
    There is other fun related to that mess (races with pivot_root(), including
    the one between pivot_root() and fork(), of all things), but this one
    is easy to isolate and fix - treat CLONE_NEWNS as "allocate a new
    fs_struct even if it hadn't been shared in the first place".  Sure, we could
    go for something like "if both CLONE_NEWNS *and* one of the things that might
    end up failing after copy_mnt_ns() call in create_new_namespaces() are set,
    force allocation of new fs_struct", but let's keep it simple - the cost
    of copy_fs_struct() is trivial.
    
    Another benefit is that copy_mnt_ns() with CLONE_NEWNS *always* gets
    a freshly allocated fs_struct, yet to be attached to anything.  That
    seriously simplifies the analysis...
    
    FWIW, that bug had been there since the introduction of unshare(2) ;-/
    
    Signed-off-by: Al Viro <[email protected]>
    Link: https://patch.msgid.link/20260207082524.GE3183987@ZenIV
    Tested-by: Waiman Long <[email protected]>
    Signed-off-by: Christian Brauner <[email protected]>
    Signed-off-by: Sasha Levin <[email protected]>

usb/core/quirks: Add Huawei ME906S-device to wakeup quirk [+ + +]

Author: Christoffer Sandberg <[email protected]>
Date:   Fri Mar 6 18:28:14 2026 +0100

    usb/core/quirks: Add Huawei ME906S-device to wakeup quirk
    
    commit 0326ff28d56b4fa202de36ffc8462a354f383a64 upstream.
    
    Similar to other Huawei LTE modules using this quirk, this version with
    another vid/pid suffers from spurious wakeups.
    
    Setting the quirk fixes the issue for this device as well.
    
    Cc: stable <[email protected]>
    Signed-off-by: Christoffer Sandberg <[email protected]>
    Signed-off-by: Werner Sembach <[email protected]>
    Link: https://patch.msgid.link/[email protected]
    Signed-off-by: Greg Kroah-Hartman <[email protected]>

USB: add QUIRK_NO_BOS for video capture several devices [+ + +]

Author: A1RM4X <[email protected]>
Date:   Wed Feb 4 14:26:48 2026 -0500

    USB: add QUIRK_NO_BOS for video capture several devices
    
    commit 93cd0d664661f58f7e7bed7373714ab2ace41734 upstream.
    
    Several USB capture devices also need the USB_QUIRK_NO_BOS set for them
    to work properly, odds are they are all the same chip inside, just
    different vendor/product ids.
    
    This fixes up:
      - ASUS TUF 4K PRO
      - Avermedia Live Gamer Ultra 2.1 (GC553G2)
      - UGREEN 35871
    to now run at full speed (10 Gbps/4K 60 fps mode.)
    
    Link: https://lore.kernel.org/r/CACy+XB-f-51xGpNQFCSm5pE_momTQLu=BaZggHYU1DiDmFX=ug@mail.gmail.com
    Cc: stable <[email protected]>
    Signed-off-by: A1RM4X <[email protected]>
    Signed-off-by: Greg Kroah-Hartman <[email protected]>

usb: cdc-acm: Restore CAP_BRK functionnality to CH343 [+ + +]

Author: Marc Zyngier <[email protected]>
Date:   Sun Mar 1 12:44:40 2026 +0000

    usb: cdc-acm: Restore CAP_BRK functionnality to CH343
    
    commit 14ae24cba291bddfdc296bbcbfd00cd09d0498ef upstream.
    
    The CH343 USB/serial adapter is as buggy as it is popular (very).
    One of its quirks is that despite being capable of signalling a
    BREAK condition, it doesn't advertise it.
    
    This used to work nonetheless until 66aad7d8d3ec5 ("usb: cdc-acm:
    return correct error code on unsupported break") applied some
    reasonable restrictions, preventing breaks from being emitted on
    devices that do not advertise CAP_BRK.
    
    Add a quirk for this particular device, so that breaks can still
    be produced on some of my machines attached to my console server.
    
    Fixes: 66aad7d8d3ec5 ("usb: cdc-acm: return correct error code on unsupported break")
    Signed-off-by: Marc Zyngier <[email protected]>
    Cc: stable <[email protected]>
    Cc: Oliver Neukum <[email protected]>
    Cc: Greg Kroah-Hartman <[email protected]>
    Acked-by: Oliver Neukum <[email protected]>
    Link: https://patch.msgid.link/[email protected]
    Signed-off-by: Greg Kroah-Hartman <[email protected]>

usb: class: cdc-wdm: fix reordering issue in read code path [+ + +]

Author: Oliver Neukum <[email protected]>
Date:   Wed Mar 4 14:01:12 2026 +0100

    usb: class: cdc-wdm: fix reordering issue in read code path
    
    commit 8df672bfe3ec2268c2636584202755898e547173 upstream.
    
    Quoting the bug report:
    
    Due to compiler optimization or CPU out-of-order execution, the
    desc->length update can be reordered before the memmove. If this
    happens, wdm_read() can see the new length and call copy_to_user() on
    uninitialized memory. This also violates LKMM data race rules [1].
    
    Fix it by using WRITE_ONCE and memory barriers.
    
    Fixes: afba937e540c9 ("USB: CDC WDM driver")
    Cc: stable <[email protected]>
    Signed-off-by: Oliver Neukum <[email protected]>
    Closes: https://lore.kernel.org/linux-usb/CALbr=LbrUZn_cfp7CfR-7Z5wDTHF96qeuM=3fO2m-q4cDrnC4A@mail.gmail.com/
    Reported-by: Gui-Dong Han <[email protected]>
    Reviewed-by: Gui-Dong Han <[email protected]>
    Link: https://patch.msgid.link/[email protected]
    Signed-off-by: Greg Kroah-Hartman <[email protected]>

usb: core: don't power off roothub PHYs if phy_set_mode() fails [+ + +]

Author: Gabor Juhos <[email protected]>
Date:   Wed Feb 18 21:21:07 2026 +0100

    usb: core: don't power off roothub PHYs if phy_set_mode() fails
    
    commit e293015ba76eb96ce4ebed7e3b2cb1a7d319f3e9 upstream.
    
    Remove the error path from the usb_phy_roothub_set_mode() function.
    The code is clearly wrong, because phy_set_mode() calls can't be
    balanced with phy_power_off() calls.
    
    Additionally, the usb_phy_roothub_set_mode() function is called only
    from usb_add_hcd() before it powers on the PHYs, so powering off those
    makes no sense anyway.
    
    Presumably, the code is copy-pasted from the phy_power_on() function
    without adjusting the error handling.
    
    Cc: [email protected] # v5.1+
    Fixes: b97a31348379 ("usb: core: comply to PHY framework")
    Signed-off-by: Gabor Juhos <[email protected]>
    Reviewed-by: Miquel Raynal <[email protected]>
    Link: https://patch.msgid.link/[email protected]
    Signed-off-by: Greg Kroah-Hartman <[email protected]>

USB: core: Limit the length of unkillable synchronous timeouts [+ + +]

Author: Alan Stern <[email protected]>
Date:   Tue Feb 17 22:10:32 2026 -0500

    USB: core: Limit the length of unkillable synchronous timeouts
    
    commit 1015c27a5e1a63efae2b18a9901494474b4d1dc3 upstream.
    
    The usb_control_msg(), usb_bulk_msg(), and usb_interrupt_msg() APIs in
    usbcore allow unlimited timeout durations.  And since they use
    uninterruptible waits, this leaves open the possibility of hanging a
    task for an indefinitely long time, with no way to kill it short of
    unplugging the target device.
    
    To prevent this sort of problem, enforce a maximum limit on the length
    of these unkillable timeouts.  The limit chosen here, somewhat
    arbitrarily, is 60 seconds.  On many systems (although not all) this
    is short enough to avoid triggering the kernel's hung-task detector.
    
    In addition, clear up the ambiguity of negative timeout values by
    treating them the same as 0, i.e., using the maximum allowed timeout.
    
    Signed-off-by: Alan Stern <[email protected]>
    Link: https://lore.kernel.org/linux-usb/[email protected]/
    Fixes: 1da177e4c3f4 ("Linux-2.6.12-rc2")
    CC: [email protected]
    Link: https://patch.msgid.link/[email protected]
    Signed-off-by: Greg Kroah-Hartman <[email protected]>

usb: dwc3: pci: add support for the Intel Nova Lake -H [+ + +]

Author: Heikki Krogerus <[email protected]>
Date:   Mon Mar 9 14:02:04 2026 +0100

    usb: dwc3: pci: add support for the Intel Nova Lake -H
    
    commit 17ab4d4078e22be7fd8fd6fc710c15c085a4cb1b upstream.
    
    This patch adds the necessary PCI ID for Intel Nova Lake -H
    devices.
    
    Signed-off-by: Heikki Krogerus <[email protected]>
    Cc: stable <[email protected]>
    Acked-by: Thinh Nguyen <[email protected]>
    Link: https://patch.msgid.link/[email protected]
    Signed-off-by: Greg Kroah-Hartman <[email protected]>

USB: ezcap401 needs USB_QUIRK_NO_BOS to function on 10gbs usb speed [+ + +]

Author: Vyacheslav Vahnenko <[email protected]>
Date:   Fri Mar 13 15:36:38 2026 +0300

    USB: ezcap401 needs USB_QUIRK_NO_BOS to function on 10gbs usb speed
    
    commit d0d9b1f4f5391e6a00cee81d73ed2e8f98446d5f upstream.
    
    Add USB_QUIRK_NO_BOS for ezcap401 capture card, without it dmesg will show
    "unable to get BOS descriptor or descriptor too short" and "unable to
    read config index 0 descriptor/start: -71" errors and device will not
    able to work at full speed at 10gbs
    
    Signed-off-by: Vyacheslav Vahnenko <[email protected]>
    Cc: stable <[email protected]>
    Link: https://patch.msgid.link/[email protected]
    Signed-off-by: Greg Kroah-Hartman <[email protected]>

usb: gadget: f_mass_storage: Fix potential integer overflow in check_command_size_in_blocks() [+ + +]

Author: Seungjin Bae <[email protected]>
Date:   Sat Feb 28 05:43:25 2026 -0500

    usb: gadget: f_mass_storage: Fix potential integer overflow in check_command_size_in_blocks()
    
    [ Upstream commit 8479891d1f04a8ce55366fe4ca361ccdb96f02e1 ]
    
    The `check_command_size_in_blocks()` function calculates the data size
    in bytes by left shifting `common->data_size_from_cmnd` by the block
    size (`common->curlun->blkbits`). However, it does not validate whether
    this shift operation will cause an integer overflow.
    
    Initially, the block size is set up in `fsg_lun_open()` , and the
    `common->data_size_from_cmnd` is set up in `do_scsi_command()`. During
    initialization, there is no integer overflow check for the interaction
    between two variables.
    
    So if a malicious USB host sends a SCSI READ or WRITE command
    requesting a large amount of data (`common->data_size_from_cmnd`), the
    left shift operation can wrap around. This results in a truncated data
    size, which can bypass boundary checks and potentially lead to memory
    corruption or out-of-bounds accesses.
    
    Fix this by using the check_shl_overflow() macro to safely perform the
    shift and catch any overflows.
    
    Fixes: 144974e7f9e3 ("usb: gadget: mass_storage: support multi-luns with different logic block size")
    Signed-off-by: Seungjin Bae <[email protected]>
    Reviewed-by: Alan Stern <[email protected]>
    Link: https://patch.msgid.link/[email protected]
    Signed-off-by: Greg Kroah-Hartman <[email protected]>
    Signed-off-by: Sasha Levin <[email protected]>

usb: gadget: f_ncm: Fix net_device lifecycle with device_move [+ + +]

Author: Kuen-Han Tsai <[email protected]>
Date:   Mon Mar 16 19:14:31 2026 -0400

    usb: gadget: f_ncm: Fix net_device lifecycle with device_move
    
    [ Upstream commit ec35c1969650e7cb6c8a91020e568ed46e3551b0 ]
    
    The network device outlived its parent gadget device during
    disconnection, resulting in dangling sysfs links and null pointer
    dereference problems.
    
    A prior attempt to solve this by removing SET_NETDEV_DEV entirely [1]
    was reverted due to power management ordering concerns and a NO-CARRIER
    regression.
    
    A subsequent attempt to defer net_device allocation to bind [2] broke
    1:1 mapping between function instance and network device, making it
    impossible for configfs to report the resolved interface name. This
    results in a regression where the DHCP server fails on pmOS.
    
    Use device_move to reparent the net_device between the gadget device and
    /sys/devices/virtual/ across bind/unbind cycles. This preserves the
    network interface across USB reconnection, allowing the DHCP server to
    retain their binding.
    
    Introduce gether_attach_gadget()/gether_detach_gadget() helpers and use
    __free(detach_gadget) macro to undo attachment on bind failure. The
    bind_count ensures device_move executes only on the first bind.
    
    [1] https://lore.kernel.org/lkml/[email protected]/
    [2] https://lore.kernel.org/linux-usb/[email protected]/
    
    Fixes: 40d133d7f542 ("usb: gadget: f_ncm: convert to new function interface with backward compatibility")
    Cc: stable <[email protected]>
    Signed-off-by: Kuen-Han Tsai <[email protected]>
    Link: https://patch.msgid.link/[email protected]
    Signed-off-by: Greg Kroah-Hartman <[email protected]>
    Signed-off-by: Sasha Levin <[email protected]>
    Signed-off-by: Greg Kroah-Hartman <[email protected]>

usb: gadget: f_tcm: Fix NULL pointer dereferences in nexus handling [+ + +]

Author: Jiasheng Jiang <[email protected]>
Date:   Mon Mar 16 15:36:31 2026 -0400

    usb: gadget: f_tcm: Fix NULL pointer dereferences in nexus handling
    
    [ Upstream commit b9fde507355342a2d64225d582dc8b98ff5ecb19 ]
    
    The `tpg->tpg_nexus` pointer in the USB Target driver is dynamically
    managed and tied to userspace configuration via ConfigFS. It can be
    NULL if the USB host sends requests before the nexus is fully
    established or immediately after it is dropped.
    
    Currently, functions like `bot_submit_command()` and the data
    transfer paths retrieve `tv_nexus = tpg->tpg_nexus` and immediately
    dereference `tv_nexus->tvn_se_sess` without any validation. If a
    malicious or misconfigured USB host sends a BOT (Bulk-Only Transport)
    command during this race window, it triggers a NULL pointer
    dereference, leading to a kernel panic (local DoS).
    
    This exposes an inconsistent API usage within the module, as peer
    functions like `usbg_submit_command()` and `bot_send_bad_response()`
    correctly implement a NULL check for `tv_nexus` before proceeding.
    
    Fix this by bringing consistency to the nexus handling. Add the
    missing `if (!tv_nexus)` checks to the vulnerable BOT command and
    request processing paths, aborting the command gracefully with an
    error instead of crashing the system.
    
    Fixes: c52661d60f63 ("usb-gadget: Initial merge of target module for UASP + BOT")
    Cc: stable <[email protected]>
    Signed-off-by: Jiasheng Jiang <[email protected]>
    Reviewed-by: Thinh Nguyen <[email protected]>
    Link: https://patch.msgid.link/[email protected]
    Signed-off-by: Greg Kroah-Hartman <[email protected]>
    Signed-off-by: Sasha Levin <[email protected]>
    Signed-off-by: Greg Kroah-Hartman <[email protected]>

usb: image: mdc800: kill download URB on timeout [+ + +]

Author: Ziyi Guo <[email protected]>
Date:   Mon Feb 9 15:19:37 2026 +0000

    usb: image: mdc800: kill download URB on timeout
    
    commit 1be3b77de4eb89af8ae2fd6610546be778e25589 upstream.
    
    mdc800_device_read() submits download_urb and waits for completion.
    If the timeout fires and the device has not responded, the function
    returns without killing the URB, leaving it active.
    
    A subsequent read() resubmits the same URB while it is still
    in-flight, triggering the WARN in usb_submit_urb():
    
      "URB submitted while active"
    
    Check the return value of wait_event_timeout() and kill the URB if
    it indicates timeout, ensuring the URB is complete before its status
    is inspected or the URB is resubmitted.
    
    Similar to
    - commit 372c93131998 ("USB: yurex: fix control-URB timeout handling")
    - commit b98d5000c505 ("media: rc: iguanair: handle timeouts")
    
    Signed-off-by: Ziyi Guo <[email protected]>
    Cc: stable <[email protected]>
    Link: https://patch.msgid.link/[email protected]
    Signed-off-by: Greg Kroah-Hartman <[email protected]>

usb: mdc800: handle signal and read racing [+ + +]

Author: Oliver Neukum <[email protected]>
Date:   Mon Feb 9 15:20:48 2026 +0100

    usb: mdc800: handle signal and read racing
    
    commit 2d6d260e9a3576256fe9ef6d1f7930c9ec348723 upstream.
    
    If a signal arrives after a read has partially completed,
    we need to return the number of bytes read. -EINTR is correct
    only if that number is zero.
    
    Signed-off-by: Oliver Neukum <[email protected]>
    Cc: stable <[email protected]>
    Link: https://patch.msgid.link/[email protected]
    Signed-off-by: Greg Kroah-Hartman <[email protected]>

usb: misc: uss720: properly clean up reference in uss720_probe() [+ + +]

Author: Greg Kroah-Hartman <[email protected]>
Date:   Mon Feb 23 13:19:43 2026 +0100

    usb: misc: uss720: properly clean up reference in uss720_probe()
    
    commit 45dba8011efac11a2f360383221b541f5ea53ce5 upstream.
    
    If get_1284_register() fails, the usb device reference count is
    incorrect and needs to be properly dropped before returning.  That will
    happen when the kref is dropped in the call to destroy_priv(), so jump
    to that error path instead of returning directly.
    
    Cc: stable <[email protected]>
    Assisted-by: gkh_clanker_2000
    Link: https://patch.msgid.link/2026022342-smokiness-stove-d792@gregkh
    Signed-off-by: Greg Kroah-Hartman <[email protected]>

usb: renesas_usbhs: fix use-after-free in ISR during device removal [+ + +]

Author: Fan Wu <[email protected]>
Date:   Tue Mar 3 07:33:44 2026 +0000

    usb: renesas_usbhs: fix use-after-free in ISR during device removal
    
    commit 3cbc242b88c607f55da3d0d0d336b49bf1e20412 upstream.
    
    In usbhs_remove(), the driver frees resources (including the pipe array)
    while the interrupt handler (usbhs_interrupt) is still registered. If an
    interrupt fires after usbhs_pipe_remove() but before the driver is fully
    unbound, the ISR may access freed memory, causing a use-after-free.
    
    Fix this by calling devm_free_irq() before freeing resources. This ensures
    the interrupt handler is both disabled and synchronized (waits for any
    running ISR to complete) before usbhs_pipe_remove() is called.
    
    Fixes: f1407d5c6624 ("usb: renesas_usbhs: Add Renesas USBHS common code")
    Cc: stable <[email protected]>
    Suggested-by: Alan Stern <[email protected]>
    Signed-off-by: Fan Wu <[email protected]>
    Link: https://patch.msgid.link/[email protected]
    Signed-off-by: Greg Kroah-Hartman <[email protected]>

usb: roles: get usb role switch from parent only for usb-b-connector [+ + +]

Author: Xu Yang <[email protected]>
Date:   Mon Mar 9 15:43:13 2026 +0800

    usb: roles: get usb role switch from parent only for usb-b-connector
    
    commit 8345b1539faa49fcf9c9439c3cbd97dac6eca171 upstream.
    
    usb_role_switch_is_parent() was walking up to the parent node and checking
    for the "usb-role-switch" property regardless of the type of the passed
    fwnode. This could cause unrelated device nodes to be probed as potential
    role switch parent, leading to spurious matches and "-EPROBE_DEFER" being
    returned infinitely.
    
    Till now only Type-B connector node will have a parent node which may
    present "usb-role-switch" property and register the role switch device.
    For Type-C connector node, its parent node will always be a Type-C chip
    device which will never register the role switch device. However, it may
    still present a non-boolean "usb-role-switch = <&usb_controller>" property
    for historical compatibility.
    
    So restrict the helper to only operate on Type-B connector when attempting
    to get the role switch from parent node.
    
    Fixes: 6fadd72943b8 ("usb: roles: get usb-role-switch from parent")
    Cc: stable <[email protected]>
    Signed-off-by: Xu Yang <[email protected]>
    Tested-by: Arnaud Ferraris <[email protected]>
    Reviewed-by: Heikki Krogerus <[email protected]>
    Link: https://patch.msgid.link/[email protected]
    Signed-off-by: Greg Kroah-Hartman <[email protected]>

USB: serial: f81232: fix incomplete serial port generation [+ + +]

Author: Ji-Ze Hong (Peter Hong) <[email protected]>
Date:   Fri Dec 12 15:08:31 2025 +0800

    USB: serial: f81232: fix incomplete serial port generation
    
    commit cd644b805da8a253198718741bf363c4c58862ff upstream.
    
    The Fintek F81532A/534A/535/536 family relies on the
    F81534A_CTRL_CMD_ENABLE_PORT (116h) register during initialization to
    both determine serial port status and control port creation. If the
    driver experiences fast load/unload cycles, the device state may becomes
    unstable, resulting in the incomplete generation of serial ports.
    
    Performing a dummy read operation on the register prior to the initial
    write command resolves the issue. This clears the device's stale internal
    state. Subsequent write operations will correctly generate all serial
    ports.
    
    This patch also removes the retry loop in f81534a_ctrl_set_register()
    because the stale state has been fixed.
    
    Tested on: HygonDM1SLT(Hygon C86 3250 8-core Processor)
    
    Signed-off-by: Ji-Ze Hong (Peter Hong) <[email protected]>
    Signed-off-by: Johan Hovold <[email protected]>
    Signed-off-by: Greg Kroah-Hartman <[email protected]>

usb: typec: altmode/displayport: set displayport signaling rate in configure message [+ + +]

Author: RD Babiera <[email protected]>
Date:   Tue Mar 10 20:41:05 2026 +0000

    usb: typec: altmode/displayport: set displayport signaling rate in configure message
    
    commit e8557acfa079a54b59a21f447c82a31aec7717df upstream.
    
    dp_altmode_configure sets the signaling rate to the current
    configuration's rate and then shifts the value to the Select
    Configuration bitfield. On the initial configuration, dp->data.conf
    is 0 to begin with, so the signaling rate field is never set, which
    leads to some DisplayPort Alt Mode partners sending NAK to the
    Configure message.
    
    Set the signaling rate to the capabilities supported by both the
    port and the port partner. If the cable supports DisplayPort Alt Mode,
    then include its capabilities as well.
    
    Fixes: a17fae8fc38e ("usb: typec: Add Displayport Alternate Mode 2.1 Support")
    Cc: stable <[email protected]>
    Signed-off-by: RD Babiera <[email protected]>
    Acked-by: Heikki Krogerus <[email protected]>
    Link: https://patch.msgid.link/[email protected]
    Signed-off-by: Greg Kroah-Hartman <[email protected]>

USB: usbcore: Introduce usb_bulk_msg_killable() [+ + +]

Author: Alan Stern <[email protected]>
Date:   Tue Feb 17 22:07:47 2026 -0500

    USB: usbcore: Introduce usb_bulk_msg_killable()
    
    commit 416909962e7cdf29fd01ac523c953f37708df93d upstream.
    
    The synchronous message API in usbcore (usb_control_msg(),
    usb_bulk_msg(), and so on) uses uninterruptible waits.  However,
    drivers may call these routines in the context of a user thread, which
    means it ought to be possible to at least kill them.
    
    For this reason, introduce a new usb_bulk_msg_killable() function
    which behaves the same as usb_bulk_msg() except for using
    wait_for_completion_killable_timeout() instead of
    wait_for_completion_timeout().  The same can be done later for
    usb_control_msg() later on, if it turns out to be needed.
    
    Signed-off-by: Alan Stern <[email protected]>
    Suggested-by: Oliver Neukum <[email protected]>
    Link: https://lore.kernel.org/linux-usb/[email protected]/
    Fixes: 1da177e4c3f4 ("Linux-2.6.12-rc2")
    CC: [email protected]
    Link: https://patch.msgid.link/[email protected]
    Signed-off-by: Greg Kroah-Hartman <[email protected]>

USB: usbtmc: Use usb_bulk_msg_killable() with user-specified timeouts [+ + +]

Author: Alan Stern <[email protected]>
Date:   Tue Feb 17 22:09:22 2026 -0500

    USB: usbtmc: Use usb_bulk_msg_killable() with user-specified timeouts
    
    commit 7784caa413a89487dd14dd5c41db8753483b2acb upstream.
    
    The usbtmc driver accepts timeout values specified by the user in an
    ioctl command, and uses these timeouts for some usb_bulk_msg() calls.
    Since the user can specify arbitrarily long timeouts and
    usb_bulk_msg() uses unkillable waits, call usb_bulk_msg_killable()
    instead to avoid the possibility of the user hanging a kernel thread
    indefinitely.
    
    Reported-by: [email protected]
    Closes: https://lore.kernel.org/linux-usb/[email protected]/T/#t
    Tested-by: [email protected]
    Signed-off-by: Alan Stern <[email protected]>
    Fixes: 048c6d88a021 ("usb: usbtmc: Add ioctls to set/get usb timeout")
    CC: [email protected]
    Link: https://patch.msgid.link/[email protected]
    Signed-off-by: Greg Kroah-Hartman <[email protected]>

usb: xhci: Fix memory leak in xhci_disable_slot() [+ + +]

Author: Zilin Guan <[email protected]>
Date:   Thu Mar 5 00:36:37 2026 +0200

    usb: xhci: Fix memory leak in xhci_disable_slot()
    
    commit c1c8550e70401159184130a1afc6261db01fc0ce upstream.
    
    xhci_alloc_command() allocates a command structure and, when the
    second argument is true, also allocates a completion structure.
    Currently, the error handling path in xhci_disable_slot() only frees
    the command structure using kfree(), causing the completion structure
    to leak.
    
    Use xhci_free_command() instead of kfree(). xhci_free_command() correctly
    frees both the command structure and the associated completion structure.
    Since the command structure is allocated with zero-initialization,
    command->in_ctx is NULL and will not be erroneously freed by
    xhci_free_command().
    
    This bug was found using an experimental static analysis tool we are
    developing. The tool is based on the LLVM framework and is specifically
    designed to detect memory management issues. It is currently under
    active development and not yet publicly available, but we plan to
    open-source it after our research is published.
    
    The bug was originally detected on v6.13-rc1 using our static analysis
    tool, and we have verified that the issue persists in the latest mainline
    kernel.
    
    We performed build testing on x86_64 with allyesconfig using GCC=11.4.0.
    Since triggering these error paths in xhci_disable_slot() requires specific
    hardware conditions or abnormal state, we were unable to construct a test
    case to reliably trigger these specific error paths at runtime.
    
    Fixes: 7faac1953ed1 ("xhci: avoid race between disable slot command and host runtime suspend")
    CC: [email protected]
    Signed-off-by: Zilin Guan <[email protected]>
    Signed-off-by: Mathias Nyman <[email protected]>
    Link: https://patch.msgid.link/[email protected]
    Signed-off-by: Greg Kroah-Hartman <[email protected]>

usb: xhci: Prevent interrupt storm on host controller error (HCE) [+ + +]

Author: Dayu Jiang <[email protected]>
Date:   Thu Mar 5 00:36:38 2026 +0200

    usb: xhci: Prevent interrupt storm on host controller error (HCE)
    
    commit d6d5febd12452b7fd951fdd15c3ec262f01901a4 upstream.
    
    The xHCI controller reports a Host Controller Error (HCE) in UAS Storage
    Device plug/unplug scenarios on Android devices. HCE is checked in
    xhci_irq() function and causes an interrupt storm (since the interrupt
    isn’t cleared), leading to severe system-level faults.
    
    When the xHC controller reports HCE in the interrupt handler, the driver
    only logs a warning and assumes xHC activity will stop as stated in xHCI
    specification. An interrupt storm does however continue on some hosts
    even after HCE, and only ceases after manually disabling xHC interrupt
    and stopping the controller by calling xhci_halt().
    
    Add xhci_halt() to xhci_irq() function where STS_HCE status is checked,
    mirroring the existing error handling pattern used for STS_FATAL errors.
    
    This only fixes the interrupt storm. Proper HCE recovery requires resetting
    and re-initializing the xHC.
    
    CC: [email protected]
    Signed-off-by: Dayu Jiang <[email protected]>
    Signed-off-by: Mathias Nyman <[email protected]>
    Link: https://patch.msgid.link/[email protected]
    Signed-off-by: Greg Kroah-Hartman <[email protected]>

usb: yurex: fix race in probe [+ + +]

Author: Oliver Neukum <[email protected]>
Date:   Mon Feb 9 15:37:20 2026 +0100

    usb: yurex: fix race in probe
    
    commit 7a875c09899ba0404844abfd8f0d54cdc481c151 upstream.
    
    The bbu member of the descriptor must be set to the value
    standing for uninitialized values before the URB whose
    completion handler sets bbu is submitted. Otherwise there is
    a window during which probing can overwrite already retrieved
    data.
    
    Cc: stable <[email protected]>
    Signed-off-by: Oliver Neukum <[email protected]>
    Link: https://patch.msgid.link/[email protected]
    Signed-off-by: Greg Kroah-Hartman <[email protected]>

wifi: cfg80211: cancel pmsr_free_wk in cfg80211_pmsr_wdev_down [+ + +]

Author: Peddolla Harshavardhan Reddy <[email protected]>
Date:   Thu Mar 5 21:36:59 2026 +0530

    wifi: cfg80211: cancel pmsr_free_wk in cfg80211_pmsr_wdev_down
    
    [ Upstream commit 6dccbc9f3e1d38565dff7730d2b7d1e8b16c9b09 ]
    
    When the nl80211 socket that originated a PMSR request is
    closed, cfg80211_release_pmsr() sets the request's nl_portid
    to zero and schedules pmsr_free_wk to process the abort
    asynchronously. If the interface is concurrently torn down
    before that work runs, cfg80211_pmsr_wdev_down() calls
    cfg80211_pmsr_process_abort() directly. However, the already-
    scheduled pmsr_free_wk work item remains pending and may run
    after the interface has been removed from the driver. This
    could cause the driver's abort_pmsr callback to operate on a
    torn-down interface, leading to undefined behavior and
    potential crashes.
    
    Cancel pmsr_free_wk synchronously in cfg80211_pmsr_wdev_down()
    before calling cfg80211_pmsr_process_abort(). This ensures any
    pending or in-progress work is drained before interface teardown
    proceeds, preventing the work from invoking the driver abort
    callback after the interface is gone.
    
    Fixes: 9bb7e0f24e7e ("cfg80211: add peer measurement with FTM initiator API")
    Signed-off-by: Peddolla Harshavardhan Reddy <[email protected]>
    Link: https://patch.msgid.link/[email protected]
    Signed-off-by: Johannes Berg <[email protected]>
    Signed-off-by: Sasha Levin <[email protected]>

wifi: libertas: fix use-after-free in lbs_free_adapter() [+ + +]

Author: Daniel Hodges <[email protected]>
Date:   Mon Mar 9 07:02:21 2026 -0400

    wifi: libertas: fix use-after-free in lbs_free_adapter()
    
    [ Upstream commit 03cc8f90d0537fcd4985c3319b4fafbf2e3fb1f0 ]
    
    The lbs_free_adapter() function uses timer_delete() (non-synchronous)
    for both command_timer and tx_lockup_timer before the structure is
    freed. This is incorrect because timer_delete() does not wait for
    any running timer callback to complete.
    
    If a timer callback is executing when lbs_free_adapter() is called,
    the callback will access freed memory since lbs_cfg_free() frees the
    containing structure immediately after lbs_free_adapter() returns.
    
    Both timer callbacks (lbs_cmd_timeout_handler and lbs_tx_lockup_handler)
    access priv->driver_lock, priv->cur_cmd, priv->dev, and other fields,
    which would all be use-after-free violations.
    
    Use timer_delete_sync() instead to ensure any running timer callback
    has completed before returning.
    
    This bug was introduced in commit 8f641d93c38a ("libertas: detect TX
    lockups and reset hardware") where del_timer() was used instead of
    del_timer_sync() in the cleanup path. The command_timer has had the
    same issue since the driver was first written.
    
    Fixes: 8f641d93c38a ("libertas: detect TX lockups and reset hardware")
    Fixes: 954ee164f4f4 ("[PATCH] libertas: reorganize and simplify init sequence")
    Cc: [email protected]
    Signed-off-by: Daniel Hodges <[email protected]>
    Link: https://patch.msgid.link/[email protected]
    Signed-off-by: Johannes Berg <[email protected]>
    [ del_timer() => timer_delete_sync() ]
    Signed-off-by: Sasha Levin <[email protected]>
    Signed-off-by: Greg Kroah-Hartman <[email protected]>

wifi: mac80211: fix NULL deref in mesh_matches_local() [+ + +]

Author: Xiang Mei <[email protected]>
Date:   Tue Mar 17 20:42:44 2026 -0700

    wifi: mac80211: fix NULL deref in mesh_matches_local()
    
    [ Upstream commit c73bb9a2d33bf81f6eecaa0f474b6c6dbe9855bd ]
    
    mesh_matches_local() unconditionally dereferences ie->mesh_config to
    compare mesh configuration parameters. When called from
    mesh_rx_csa_frame(), the parsed action-frame elements may not contain a
    Mesh Configuration IE, leaving ie->mesh_config NULL and triggering a
    kernel NULL pointer dereference.
    
    The other two callers are already safe:
      - ieee80211_mesh_rx_bcn_presp() checks !elems->mesh_config before
        calling mesh_matches_local()
      - mesh_plink_get_event() is only reached through
        mesh_process_plink_frame(), which checks !elems->mesh_config, too
    
    mesh_rx_csa_frame() is the only caller that passes raw parsed elements
    to mesh_matches_local() without guarding mesh_config. An adjacent
    attacker can exploit this by sending a crafted CSA action frame that
    includes a valid Mesh ID IE but omits the Mesh Configuration IE,
    crashing the kernel.
    
    The captured crash log:
    
    Oops: general protection fault, probably for non-canonical address ...
    KASAN: null-ptr-deref in range [0x0000000000000000-0x0000000000000007]
    Workqueue: events_unbound cfg80211_wiphy_work
    [...]
    Call Trace:
     <TASK>
     ? __pfx_mesh_matches_local (net/mac80211/mesh.c:65)
     ieee80211_mesh_rx_queued_mgmt (net/mac80211/mesh.c:1686)
     [...]
     ieee80211_iface_work (net/mac80211/iface.c:1754 net/mac80211/iface.c:1802)
     [...]
     cfg80211_wiphy_work (net/wireless/core.c:426)
     process_one_work (net/kernel/workqueue.c:3280)
     ? assign_work (net/kernel/workqueue.c:1219)
     worker_thread (net/kernel/workqueue.c:3352)
     ? __pfx_worker_thread (net/kernel/workqueue.c:3385)
     kthread (net/kernel/kthread.c:436)
     [...]
     ret_from_fork_asm (net/arch/x86/entry/entry_64.S:255)
     </TASK>
    
    This patch adds a NULL check for ie->mesh_config at the top of
    mesh_matches_local() to return false early when the Mesh Configuration
    IE is absent.
    
    Fixes: 2e3c8736820b ("mac80211: support functions for mesh")
    Reported-by: Weiming Shi <[email protected]>
    Signed-off-by: Xiang Mei <[email protected]>
    Link: https://patch.msgid.link/[email protected]
    Signed-off-by: Johannes Berg <[email protected]>
    Signed-off-by: Sasha Levin <[email protected]>

wifi: mac80211: Fix static_branch_dec() underflow for aql_disable. [+ + +]

Author: Kuniyuki Iwashima <[email protected]>
Date:   Fri Mar 6 07:24:02 2026 +0000

    wifi: mac80211: Fix static_branch_dec() underflow for aql_disable.
    
    [ Upstream commit b94ae8e0d5fe1bdbbfdc3854ff6ce98f6876a828 ]
    
    syzbot reported static_branch_dec() underflow in aql_enable_write(). [0]
    
    The problem is that aql_enable_write() does not serialise concurrent
    write()s to the debugfs.
    
    aql_enable_write() checks static_key_false(&aql_disable.key) and
    later calls static_branch_inc() or static_branch_dec(), but the
    state may change between the two calls.
    
    aql_disable does not need to track inc/dec.
    
    Let's use static_branch_enable() and static_branch_disable().
    
    [0]:
    val == 0
    WARNING: kernel/jump_label.c:311 at __static_key_slow_dec_cpuslocked.part.0+0x107/0x120 kernel/jump_label.c:311, CPU#0: syz.1.3155/20288
    Modules linked in:
    CPU: 0 UID: 0 PID: 20288 Comm: syz.1.3155 Tainted: G     U       L      syzkaller #0 PREEMPT(full)
    Tainted: [U]=USER, [L]=SOFTLOCKUP
    Hardware name: Google Google Compute Engine/Google Compute Engine, BIOS Google 01/24/2026
    RIP: 0010:__static_key_slow_dec_cpuslocked.part.0+0x107/0x120 kernel/jump_label.c:311
    Code: f2 c9 ff 5b 5d c3 cc cc cc cc e8 54 f2 c9 ff 48 89 df e8 ac f9 ff ff eb ad e8 45 f2 c9 ff 90 0f 0b 90 eb a2 e8 3a f2 c9 ff 90 <0f> 0b 90 eb 97 48 89 df e8 5c 4b 33 00 e9 36 ff ff ff 0f 1f 80 00
    RSP: 0018:ffffc9000b9f7c10 EFLAGS: 00010293
    RAX: 0000000000000000 RBX: ffffffff9b3e5d40 RCX: ffffffff823c57b4
    RDX: ffff8880285a0000 RSI: ffffffff823c5846 RDI: ffff8880285a0000
    RBP: 0000000000000000 R08: 0000000000000005 R09: 0000000000000000
    R10: 0000000000000000 R11: 0000000000000000 R12: 000000000000000a
    R13: 1ffff9200173ef88 R14: 0000000000000001 R15: ffffc9000b9f7e98
    FS:  00007f530dd726c0(0000) GS:ffff8881245e3000(0000) knlGS:0000000000000000
    CS:  0010 DS: 0000 ES: 0000 CR0: 0000000080050033
    CR2: 0000200000001140 CR3: 000000007cc4a000 CR4: 00000000003526f0
    Call Trace:
     <TASK>
     __static_key_slow_dec_cpuslocked kernel/jump_label.c:297 [inline]
     __static_key_slow_dec kernel/jump_label.c:321 [inline]
     static_key_slow_dec+0x7c/0xc0 kernel/jump_label.c:336
     aql_enable_write+0x2b2/0x310 net/mac80211/debugfs.c:343
     short_proxy_write+0x133/0x1a0 fs/debugfs/file.c:383
     vfs_write+0x2aa/0x1070 fs/read_write.c:684
     ksys_pwrite64 fs/read_write.c:793 [inline]
     __do_sys_pwrite64 fs/read_write.c:801 [inline]
     __se_sys_pwrite64 fs/read_write.c:798 [inline]
     __x64_sys_pwrite64+0x1eb/0x250 fs/read_write.c:798
     do_syscall_x64 arch/x86/entry/syscall_64.c:63 [inline]
     do_syscall_64+0xc9/0xf80 arch/x86/entry/syscall_64.c:94
     entry_SYSCALL_64_after_hwframe+0x77/0x7f
    RIP: 0033:0x7f530cf9aeb9
    Code: ff c3 66 2e 0f 1f 84 00 00 00 00 00 0f 1f 44 00 00 48 89 f8 48 89 f7 48 89 d6 48 89 ca 4d 89 c2 4d 89 c8 4c 8b 4c 24 08 0f 05 <48> 3d 01 f0 ff ff 73 01 c3 48 c7 c1 e8 ff ff ff f7 d8 64 89 01 48
    RSP: 002b:00007f530dd72028 EFLAGS: 00000246 ORIG_RAX: 0000000000000012
    RAX: ffffffffffffffda RBX: 00007f530d215fa0 RCX: 00007f530cf9aeb9
    RDX: 0000000000000003 RSI: 0000000000000000 RDI: 0000000000000010
    RBP: 00007f530d008c1f R08: 0000000000000000 R09: 0000000000000000
    R10: 4200000000000005 R11: 0000000000000246 R12: 0000000000000000
    R13: 00007f530d216038 R14: 00007f530d215fa0 R15: 00007ffde89fb978
     </TASK>
    
    Fixes: e908435e402a ("mac80211: introduce aql_enable node in debugfs")
    Reported-by: [email protected]
    Closes: https://lore.kernel.org/all/[email protected]/
    Signed-off-by: Kuniyuki Iwashima <[email protected]>
    Link: https://patch.msgid.link/[email protected]
    Signed-off-by: Johannes Berg <[email protected]>
    Signed-off-by: Sasha Levin <[email protected]>

wifi: mac80211: set default WMM parameters on all links [+ + +]

Author: Ramanathan Choodamani <[email protected]>
Date:   Thu Feb 5 15:12:16 2026 +0530

    wifi: mac80211: set default WMM parameters on all links
    
    [ Upstream commit 2259d14499d16b115ef8d5d2ddc867e2be7cb5b5 ]
    
    Currently, mac80211 only initializes default WMM parameters
    on the deflink during do_open(). For MLO cases, this
    leaves the additional links without proper WMM defaults
    if hostapd does not supply per-link WMM parameters, leading
    to inconsistent QoS behavior across links.
    
    Set default WMM parameters for each link during
    ieee80211_vif_update_links(), because this ensures all
    individual links in an MLD have valid WMM settings during
    bring-up and behave consistently across different BSS.
    
    Signed-off-by: Ramanathan Choodamani <[email protected]>
    Signed-off-by: Aishwarya R <[email protected]>
    Link: https://patch.msgid.link/[email protected]
    Signed-off-by: Johannes Berg <[email protected]>
    Signed-off-by: Sasha Levin <[email protected]>

wifi: wlcore: Return -ENOMEM instead of -EAGAIN if there is not enough headroom [+ + +]

Author: Guenter Roeck <[email protected]>
Date:   Tue Mar 17 23:46:36 2026 -0700

    wifi: wlcore: Return -ENOMEM instead of -EAGAIN if there is not enough headroom
    
    [ Upstream commit deb353d9bb009638b7762cae2d0b6e8fdbb41a69 ]
    
    Since upstream commit e75665dd0968 ("wifi: wlcore: ensure skb headroom
    before skb_push"), wl1271_tx_allocate() and with it
    wl1271_prepare_tx_frame() returns -EAGAIN if pskb_expand_head() fails.
    However, in wlcore_tx_work_locked(), a return value of -EAGAIN from
    wl1271_prepare_tx_frame() is interpreted as the aggregation buffer being
    full. This causes the code to flush the buffer, put the skb back at the
    head of the queue, and immediately retry the same skb in a tight while
    loop.
    
    Because wlcore_tx_work_locked() holds wl->mutex, and the retry happens
    immediately with GFP_ATOMIC, this will result in an infinite loop and a
    CPU soft lockup. Return -ENOMEM instead so the packet is dropped and
    the loop terminates.
    
    The problem was found by an experimental code review agent based on
    gemini-3.1-pro while reviewing backports into v6.18.y.
    
    Assisted-by: Gemini:gemini-3.1-pro
    Fixes: e75665dd0968 ("wifi: wlcore: ensure skb headroom before skb_push")
    Cc: Peter Astrand <[email protected]>
    Signed-off-by: Guenter Roeck <[email protected]>
    Link: https://patch.msgid.link/[email protected]
    Signed-off-by: Johannes Berg <[email protected]>
    Signed-off-by: Sasha Levin <[email protected]>

workqueue: Use POOL_BH instead of WQ_BH when checking pool flags [+ + +]

Author: Breno Leitao <[email protected]>
Date:   Thu Mar 5 08:15:37 2026 -0800

    workqueue: Use POOL_BH instead of WQ_BH when checking pool flags
    
    [ Upstream commit f42f9091be9e5ff57567a3945cfcdd498f475348 ]
    
    pr_cont_worker_id() checks pool->flags against WQ_BH, which is a
    workqueue-level flag (defined in workqueue.h). Pool flags use a
    separate namespace with POOL_* constants (defined in workqueue.c).
    The correct constant is POOL_BH. Both WQ_BH and POOL_BH are defined
    as (1 << 0) so this has no behavioral impact, but it is semantically
    wrong and inconsistent with every other pool-level BH check in the
    file.
    
    Fixes: 4cb1ef64609f ("workqueue: Implement BH workqueues to eventually replace tasklets")
    Signed-off-by: Breno Leitao <[email protected]>
    Acked-by: Song Liu <[email protected]>
    Signed-off-by: Tejun Heo <[email protected]>
    Signed-off-by: Sasha Levin <[email protected]>

x86/apic: Disable x2apic on resume if the kernel expects so [+ + +]

Author: Shashank Balaji <[email protected]>
Date:   Fri Mar 6 14:46:28 2026 +0900

    x86/apic: Disable x2apic on resume if the kernel expects so
    
    commit 8cc7dd77a1466f0ec58c03478b2e735a5b289b96 upstream.
    
    When resuming from s2ram, firmware may re-enable x2apic mode, which may have
    been disabled by the kernel during boot either because it doesn't support IRQ
    remapping or for other reasons. This causes the kernel to continue using the
    xapic interface, while the hardware is in x2apic mode, which causes hangs.
    This happens on defconfig + bare metal + s2ram.
    
    Fix this in lapic_resume() by disabling x2apic if the kernel expects it to be
    disabled, i.e. when x2apic_mode = 0.
    
    The ACPI v6.6 spec, Section 16.3 [1] says firmware restores either the
    pre-sleep configuration or initial boot configuration for each CPU, including
    MSR state:
    
      When executing from the power-on reset vector as a result of waking from an
      S2 or S3 sleep state, the platform firmware performs only the hardware
      initialization required to restore the system to either the state the
      platform was in prior to the initial operating system boot, or to the
      pre-sleep configuration state. In multiprocessor systems, non-boot
      processors should be placed in the same state as prior to the initial
      operating system boot.
    
      (further ahead)
    
      If this is an S2 or S3 wake, then the platform runtime firmware restores
      minimum context of the system before jumping to the waking vector. This
      includes:
    
            CPU configuration. Platform runtime firmware restores the pre-sleep
            configuration or initial boot configuration of each CPU (MSR, MTRR,
            firmware update, SMBase, and so on). Interrupts must be disabled (for
            IA-32 processors, disabled by CLI instruction).
    
            (and other things)
    
    So at least as per the spec, re-enablement of x2apic by the firmware is
    allowed if "x2apic on" is a part of the initial boot configuration.
    
      [1] https://uefi.org/specs/ACPI/6.6/16_Waking_and_Sleeping.html#initialization
    
      [ bp: Massage. ]
    
    Fixes: 6e1cb38a2aef ("x64, x2apic/intr-remap: add x2apic support, including enabling interrupt-remapping")
    Co-developed-by: Rahul Bukte <[email protected]>
    Signed-off-by: Rahul Bukte <[email protected]>
    Signed-off-by: Shashank Balaji <[email protected]>
    Signed-off-by: Borislav Petkov (AMD) <[email protected]>
    Reviewed-by: Thomas Gleixner <[email protected]>
    Reviewed-by: Sohil Mehta <[email protected]>
    Cc: [email protected]
    Link: https://patch.msgid.link/[email protected]
    Signed-off-by: Greg Kroah-Hartman <[email protected]>

x86/platform/uv: Handle deconfigured sockets [+ + +]

Author: Kyle Meyer <[email protected]>
Date:   Fri Mar 20 12:19:20 2026 -0500

    x86/platform/uv: Handle deconfigured sockets
    
    commit 1f6aa5bbf1d0f81a8a2aafc16136e7dd9a609ff3 upstream.
    
    When a socket is deconfigured, it's mapped to SOCK_EMPTY (0xffff). This causes
    a panic while allocating UV hub info structures.
    
    Fix this by using NUMA_NO_NODE, allowing UV hub info structures to be
    allocated on valid nodes.
    
    Fixes: 8a50c5851927 ("x86/platform/uv: UV support for sub-NUMA clustering")
    Signed-off-by: Kyle Meyer <[email protected]>
    Signed-off-by: Borislav Petkov (AMD) <[email protected]>
    Reviewed-by: Steve Wahl <[email protected]>
    Cc: [email protected]
    Link: https://patch.msgid.link/[email protected]
    Signed-off-by: Greg Kroah-Hartman <[email protected]>

x86/sev: Allow IBPB-on-Entry feature for SNP guests [+ + +]

Author: Kim Phillips <[email protected]>
Date:   Mon Mar 9 07:38:52 2026 -0400

    x86/sev: Allow IBPB-on-Entry feature for SNP guests
    
    [ Upstream commit 9073428bb204d921ae15326bb7d4558d9d269aab ]
    
    The SEV-SNP IBPB-on-Entry feature does not require a guest-side
    implementation. It was added in Zen5 h/w, after the first SNP Zen
    implementation, and thus was not accounted for when the initial set of SNP
    features were added to the kernel.
    
    In its abundant precaution, commit
    
      8c29f0165405 ("x86/sev: Add SEV-SNP guest feature negotiation support")
    
    included SEV_STATUS' IBPB-on-Entry bit as a reserved bit, thereby masking
    guests from using the feature.
    
    Allow guests to make use of IBPB-on-Entry when supported by the hypervisor, as
    the bit is now architecturally defined and safe to expose.
    
    Fixes: 8c29f0165405 ("x86/sev: Add SEV-SNP guest feature negotiation support")
    Signed-off-by: Kim Phillips <[email protected]>
    Signed-off-by: Borislav Petkov (AMD) <[email protected]>
    Reviewed-by: Nikunj A Dadhania <[email protected]>
    Reviewed-by: Tom Lendacky <[email protected]>
    Cc: [email protected]
    Link: https://patch.msgid.link/[email protected]
    [ merged missing SECURE_AVIC into RESERVED_BITS18_22 ]
    Signed-off-by: Sasha Levin <[email protected]>
    Signed-off-by: Greg Kroah-Hartman <[email protected]>

x86/uprobes: Fix XOL allocation failure for 32-bit tasks [+ + +]

Author: Oleg Nesterov <[email protected]>
Date:   Mon Mar 2 16:14:27 2026 +0100

    x86/uprobes: Fix XOL allocation failure for 32-bit tasks
    
    [ Upstream commit d55c571e4333fac71826e8db3b9753fadfbead6a ]
    
    This script
    
            #!/usr/bin/bash
    
            echo 0 > /proc/sys/kernel/randomize_va_space
    
            echo 'void main(void) {}' > TEST.c
    
            # -fcf-protection to ensure that the 1st endbr32 insn can't be emulated
            gcc -m32 -fcf-protection=branch TEST.c -o test
    
            bpftrace -e 'uprobe:./test:main {}' -c ./test
    
    "hangs", the probed ./test task enters an endless loop.
    
    The problem is that with randomize_va_space == 0
    get_unmapped_area(TASK_SIZE - PAGE_SIZE) called by xol_add_vma() can not
    just return the "addr == TASK_SIZE - PAGE_SIZE" hint, this addr is used
    by the stack vma.
    
    arch_get_unmapped_area_topdown() doesn't take TIF_ADDR32 into account and
    in_32bit_syscall() is false, this leads to info.high_limit > TASK_SIZE.
    vm_unmapped_area() happily returns the high address > TASK_SIZE and then
    get_unmapped_area() returns -ENOMEM after the "if (addr > TASK_SIZE - len)"
    check.
    
    handle_swbp() doesn't report this failure (probably it should) and silently
    restarts the probed insn. Endless loop.
    
    I think that the right fix should change the x86 get_unmapped_area() paths
    to rely on TIF_ADDR32 rather than in_32bit_syscall(). Note also that if
    CONFIG_X86_X32_ABI=y, in_x32_syscall() falsely returns true in this case
    because ->orig_ax = -1.
    
    But we need a simple fix for -stable, so this patch just sets TS_COMPAT if
    the probed task is 32-bit to make in_ia32_syscall() true.
    
    Fixes: 1b028f784e8c ("x86/mm: Introduce mmap_compat_base() for 32-bit mmap()")
    Reported-by: Paulo Andrade <[email protected]>
    Signed-off-by: Oleg Nesterov <[email protected]>
    Signed-off-by: Peter Zijlstra (Intel) <[email protected]>
    Link: https://lore.kernel.org/all/[email protected]/
    Cc: [email protected]
    Link: https://patch.msgid.link/[email protected]
    Signed-off-by: Greg Kroah-Hartman <[email protected]>

xdp: allow attaching already registered memory model to xdp_rxq_info [+ + +]

Author: Alexander Lobakin <[email protected]>
Date:   Tue Dec 3 18:37:27 2024 +0100

    xdp: allow attaching already registered memory model to xdp_rxq_info
    
    [ Upstream commit f65966fe0178c06065d354c22fb456fc4370b527 ]
    
    One may need to register memory model separately from xdp_rxq_info. One
    simple example may be XDP test run code, but in general, it might be
    useful when memory model registering is managed by one layer and then
    XDP RxQ info by a different one.
    Allow such scenarios by adding a simple helper which "attaches"
    already registered memory model to the desired xdp_rxq_info. As this
    is mostly needed for Page Pool, add a special function to do that for
    a &page_pool pointer.
    
    Reviewed-by: Toke Høiland-Jørgensen <[email protected]>
    Signed-off-by: Alexander Lobakin <[email protected]>
    Link: https://patch.msgid.link/[email protected]
    Signed-off-by: Jakub Kicinski <[email protected]>
    Stable-dep-of: 6f1a9140ecda ("net: add xmit recursion limit to tunnel xmit functions")
    Signed-off-by: Sasha Levin <[email protected]>

xdp: register system page pool as an XDP memory model [+ + +]

Author: Toke Høiland-Jørgensen <[email protected]>
Date:   Tue Dec 3 18:37:29 2024 +0100

    xdp: register system page pool as an XDP memory model
    
    [ Upstream commit e77d9aee951341119be16a991fcfc76d1154d22a ]
    
    To make the system page pool usable as a source for allocating XDP
    frames, we need to register it with xdp_reg_mem_model(), so that page
    return works correctly. This is done in preparation for using the system
    page_pool to convert XDP_PASS XSk frames to skbs; for the same reason,
    make the per-cpu variable non-static so we can access it from other
    source files as well (but w/o exporting).
    
    Signed-off-by: Toke Høiland-Jørgensen <[email protected]>
    Signed-off-by: Alexander Lobakin <[email protected]>
    Link: https://patch.msgid.link/[email protected]
    Signed-off-by: Jakub Kicinski <[email protected]>
    Stable-dep-of: 6f1a9140ecda ("net: add xmit recursion limit to tunnel xmit functions")
    Signed-off-by: Sasha Levin <[email protected]>

xen/privcmd: add boot control for restricted usage in domU [+ + +]

Author: Juergen Gross <[email protected]>
Date:   Tue Oct 14 13:28:15 2025 +0200

    xen/privcmd: add boot control for restricted usage in domU
    
    commit 1613462be621ad5103ec338a7b0ca0746ec4e5f1 upstream.
    
    When running in an unprivileged domU under Xen, the privcmd driver
    is restricted to allow only hypercalls against a target domain, for
    which the current domU is acting as a device model.
    
    Add a boot parameter "unrestricted" to allow all hypercalls (the
    hypervisor will still refuse destructive hypercalls affecting other
    guests).
    
    Make this new parameter effective only in case the domU wasn't started
    using secure boot, as otherwise hypercalls targeting the domU itself
    might result in violating the secure boot functionality.
    
    This is achieved by adding another lockdown reason, which can be
    tested to not being set when applying the "unrestricted" option.
    
    This is part of XSA-482
    
    Signed-off-by: Juergen Gross <[email protected]>
    Signed-off-by: Greg Kroah-Hartman <[email protected]>

xen/privcmd: restrict usage in unprivileged domU [+ + +]

Author: Juergen Gross <[email protected]>
Date:   Thu Oct 9 16:54:58 2025 +0200

    xen/privcmd: restrict usage in unprivileged domU
    
    commit 453b8fb68f3641fea970db88b7d9a153ed2a37e8 upstream.
    
    The Xen privcmd driver allows to issue arbitrary hypercalls from
    user space processes. This is normally no problem, as access is
    usually limited to root and the hypervisor will deny any hypercalls
    affecting other domains.
    
    In case the guest is booted using secure boot, however, the privcmd
    driver would be enabling a root user process to modify e.g. kernel
    memory contents, thus breaking the secure boot feature.
    
    The only known case where an unprivileged domU is really needing to
    use the privcmd driver is the case when it is acting as the device
    model for another guest. In this case all hypercalls issued via the
    privcmd driver will target that other guest.
    
    Fortunately the privcmd driver can already be locked down to allow
    only hypercalls targeting a specific domain, but this mode can be
    activated from user land only today.
    
    The target domain can be obtained from Xenstore, so when not running
    in dom0 restrict the privcmd driver to that target domain from the
    beginning, resolving the potential problem of breaking secure boot.
    
    This is XSA-482
    
    Reported-by: Teddy Astie <[email protected]>
    Fixes: 1c5de1939c20 ("xen: add privcmd driver")
    Signed-off-by: Juergen Gross <[email protected]>
    Signed-off-by: Greg Kroah-Hartman <[email protected]>

xfs: ensure dquot item is deleted from AIL only after log shutdown [+ + +]

Author: Long Li <[email protected]>
Date:   Thu Mar 5 16:49:22 2026 +0800

    xfs: ensure dquot item is deleted from AIL only after log shutdown
    
    commit 186ac39b8a7d3ec7ce9c5dd45e5c2730177f375c upstream.
    
    In xfs_qm_dqflush(), when a dquot flush fails due to corruption
    (the out_abort error path), the original code removed the dquot log
    item from the AIL before calling xfs_force_shutdown(). This ordering
    introduces a subtle race condition that can lead to data loss after
    a crash.
    
    The AIL tracks the oldest dirty metadata in the journal. The position
    of the tail item in the AIL determines the log tail LSN, which is the
    oldest LSN that must be preserved for crash recovery. When an item is
    removed from the AIL, the log tail can advance past the LSN of that item.
    
    The race window is as follows: if the dquot item happens to be at
    the tail of the log, removing it from the AIL allows the log tail
    to advance. If a concurrent log write is sampling the tail LSN at
    the same time and subsequently writes a complete checkpoint (i.e.,
    one containing a commit record) to disk before the shutdown takes
    effect, the journal will no longer protect the dquot's last
    modification. On the next mount, log recovery will not replay the
    dquot changes, even though they were never written back to disk,
    resulting in silent data loss.
    
    Fix this by calling xfs_force_shutdown() before xfs_trans_ail_delete()
    in the out_abort path. Once the log is shut down, no new log writes
    can complete with an updated tail LSN, making it safe to remove the
    dquot item from the AIL.
    
    Cc: [email protected]
    Fixes: b707fffda6a3 ("xfs: abort consistently on dquot flush failure")
    Signed-off-by: Long Li <[email protected]>
    Reviewed-by: Carlos Maiolino <[email protected]>
    Reviewed-by: Christoph Hellwig <[email protected]>
    Signed-off-by: Carlos Maiolino <[email protected]>
    Signed-off-by: Greg Kroah-Hartman <[email protected]>

xfs: Fix error pointer dereference [+ + +]

Author: Ethan Tidmore <[email protected]>
Date:   Mon Mar 9 09:49:55 2026 -0400

    xfs: Fix error pointer dereference
    
    [ Upstream commit cddfa648f1ab99e30e91455be19cd5ade26338c2 ]
    
    The function try_lookup_noperm() can return an error pointer and is not
    checked for one.
    
    Add checks for error pointer in xrep_adoption_check_dcache() and
    xrep_adoption_zap_dcache().
    
    Detected by Smatch:
    fs/xfs/scrub/orphanage.c:449 xrep_adoption_check_dcache() error:
    'd_child' dereferencing possible ERR_PTR()
    
    fs/xfs/scrub/orphanage.c:485 xrep_adoption_zap_dcache() error:
    'd_child' dereferencing possible ERR_PTR()
    
    Fixes: 73597e3e42b4 ("xfs: ensure dentry consistency when the orphanage adopts a file")
    Cc: [email protected] # v6.16
    Signed-off-by: Ethan Tidmore <[email protected]>
    Reviewed-by: Darrick J. Wong <[email protected]>
    Reviewed-by: Nirjhar Roy (IBM) <[email protected]>
    Signed-off-by: Carlos Maiolino <[email protected]>
    [ adapted try_lookup_noperm() calls to d_hash_and_lookup() ]
    Signed-off-by: Sasha Levin <[email protected]>
    Signed-off-by: Greg Kroah-Hartman <[email protected]>

xfs: fix integer overflow in bmap intent sort comparator [+ + +]

Author: Long Li <[email protected]>
Date:   Wed Mar 18 21:59:56 2026 -0400

    xfs: fix integer overflow in bmap intent sort comparator
    
    [ Upstream commit 362c490980867930a098b99f421268fbd7ca05fd ]
    
    xfs_bmap_update_diff_items() sorts bmap intents by inode number using
    a subtraction of two xfs_ino_t (uint64_t) values, with the result
    truncated to int. This is incorrect when two inode numbers differ by
    more than INT_MAX (2^31 - 1), which is entirely possible on large XFS
    filesystems.
    
    Fix this by replacing the subtraction with cmp_int().
    
    Cc: <[email protected]> # v4.9
    Fixes: 9f3afb57d5f1 ("xfs: implement deferred bmbt map/unmap operations")
    Signed-off-by: Long Li <[email protected]>
    Reviewed-by: Darrick J. Wong <[email protected]>
    Signed-off-by: Carlos Maiolino <[email protected]>
    [ No cmp_int() ]
    Signed-off-by: Sasha Levin <[email protected]>
    Signed-off-by: Greg Kroah-Hartman <[email protected]>

xfs: fix returned valued from xfs_defer_can_append [+ + +]

Author: Carlos Maiolino <[email protected]>
Date:   Wed Mar 4 19:54:27 2026 +0100

    xfs: fix returned valued from xfs_defer_can_append
    
    commit 54fcd2f95f8d216183965a370ec69e1aab14f5da upstream.
    
    xfs_defer_can_append returns a bool, it shouldn't be returning
    a NULL.
    
    Found by code inspection.
    
    Fixes: 4dffb2cbb483 ("xfs: allow pausing of pending deferred work items")
    Cc: <[email protected]> # v6.8
    Signed-off-by: Carlos Maiolino <[email protected]>
    Reviewed-by: Darrick J. Wong <[email protected]>
    Acked-by: Souptick Joarder <[email protected]>
    Signed-off-by: Carlos Maiolino <[email protected]>
    Signed-off-by: Greg Kroah-Hartman <[email protected]>

xfs: fix undersized l_iclog_roundoff values [+ + +]

Author: Darrick J. Wong <[email protected]>
Date:   Wed Mar 4 20:26:20 2026 -0800

    xfs: fix undersized l_iclog_roundoff values
    
    commit 52a8a1ba883defbfe3200baa22cf4cd21985d51a upstream.
    
    If the superblock doesn't list a log stripe unit, we set the incore log
    roundoff value to 512.  This leads to corrupt logs and unmountable
    filesystems in generic/617 on a disk with 4k physical sectors...
    
    XFS (sda1): Mounting V5 Filesystem ff3121ca-26e6-4b77-b742-aaff9a449e1c
    XFS (sda1): Torn write (CRC failure) detected at log block 0x318e. Truncating head block from 0x3197.
    XFS (sda1): failed to locate log tail
    XFS (sda1): log mount/recovery failed: error -74
    XFS (sda1): log mount failed
    XFS (sda1): Mounting V5 Filesystem ff3121ca-26e6-4b77-b742-aaff9a449e1c
    XFS (sda1): Ending clean mount
    
    ...on the current xfsprogs for-next which has a broken mkfs.  xfs_info
    shows this...
    
    meta-data=/dev/sda1              isize=512    agcount=4, agsize=644992 blks
             =                       sectsz=4096  attr=2, projid32bit=1
             =                       crc=1        finobt=1, sparse=1, rmapbt=1
             =                       reflink=1    bigtime=1 inobtcount=1 nrext64=1
             =                       exchange=1   metadir=1
    data     =                       bsize=4096   blocks=2579968, imaxpct=25
             =                       sunit=0      swidth=0 blks
    naming   =version 2              bsize=4096   ascii-ci=0, ftype=1, parent=1
    log      =internal log           bsize=4096   blocks=16384, version=2
             =                       sectsz=4096  sunit=0 blks, lazy-count=1
    realtime =none                   extsz=4096   blocks=0, rtextents=0
             =                       rgcount=0    rgsize=268435456 extents
             =                       zoned=0      start=0 reserved=0
    
    ...observe that the log section has sectsz=4096 sunit=0, which means
    that the roundoff factor is 512, not 4096 as you'd expect.  We should
    fix mkfs not to generate broken filesystems, but anyone can fuzz the
    ondisk superblock so we should be more cautious.  I think the inadequate
    logic predates commit a6a65fef5ef8d0, but that's clearly going to
    require a different backport.
    
    Cc: [email protected] # v5.14
    Fixes: a6a65fef5ef8d0 ("xfs: log stripe roundoff is a property of the log")
    Signed-off-by: Darrick J. Wong <[email protected]>
    Reviewed-by: Christoph Hellwig <[email protected]>
    Signed-off-by: Carlos Maiolino <[email protected]>
    Signed-off-by: Greg Kroah-Hartman <[email protected]>

xfs: get rid of the xchk_xfile_*_descr calls [+ + +]

Author: Darrick J. Wong <[email protected]>
Date:   Tue Mar 10 14:47:50 2026 -0700

    xfs: get rid of the xchk_xfile_*_descr calls
    
    [ Upstream commit 60382993a2e18041f88c7969f567f168cd3b4de3 ]
    
    The xchk_xfile_*_descr macros call kasprintf, which can fail to allocate
    memory if the formatted string is larger than 16 bytes (or whatever the
    nofail guarantees are nowadays).  Some of them could easily exceed that,
    and Jiaming Zhang found a few places where that can happen with syzbot.
    
    The descriptions are debugging aids and aren't required to be unique, so
    let's just pass in static strings and eliminate this path to failure.
    Note this patch touches a number of commits, most of which were merged
    between 6.6 and 6.14.
    
    Cc: [email protected]
    Cc: <[email protected]> # v6.12
    Fixes: ab97f4b1c03075 ("xfs: repair AGI unlinked inode bucket lists")
    Signed-off-by: "Darrick J. Wong" <[email protected]>
    Reviewed-by: Christoph Hellwig <[email protected]>
    Tested-by: Jiaming Zhang <[email protected]>
    Signed-off-by: Greg Kroah-Hartman <[email protected]>

xprtrdma: Decrement re_receiving on the early exit paths [+ + +]

Author: Eric Badger <[email protected]>
Date:   Mon Feb 23 10:28:55 2026 -0800

    xprtrdma: Decrement re_receiving on the early exit paths
    
    [ Upstream commit 7b6275c80a0c81c5f8943272292dfe67730ce849 ]
    
    In the event that rpcrdma_post_recvs() fails to create a work request
    (due to memory allocation failure, say) or otherwise exits early, we
    should decrement ep->re_receiving before returning. Otherwise we will
    hang in rpcrdma_xprt_drain() as re_receiving will never reach zero and
    the completion will never be triggered.
    
    On a system with high memory pressure, this can appear as the following
    hung task:
    
        INFO: task kworker/u385:17:8393 blocked for more than 122 seconds.
              Tainted: G S          E       6.19.0 #3
        "echo 0 > /proc/sys/kernel/hung_task_timeout_secs" disables this message.
        task:kworker/u385:17 state:D stack:0     pid:8393  tgid:8393  ppid:2      task_flags:0x4248060 flags:0x00080000
        Workqueue: xprtiod xprt_autoclose [sunrpc]
        Call Trace:
         <TASK>
         __schedule+0x48b/0x18b0
         ? ib_post_send_mad+0x247/0xae0 [ib_core]
         schedule+0x27/0xf0
         schedule_timeout+0x104/0x110
         __wait_for_common+0x98/0x180
         ? __pfx_schedule_timeout+0x10/0x10
         wait_for_completion+0x24/0x40
         rpcrdma_xprt_disconnect+0x444/0x460 [rpcrdma]
         xprt_rdma_close+0x12/0x40 [rpcrdma]
         xprt_autoclose+0x5f/0x120 [sunrpc]
         process_one_work+0x191/0x3e0
         worker_thread+0x2e3/0x420
         ? __pfx_worker_thread+0x10/0x10
         kthread+0x10d/0x230
         ? __pfx_kthread+0x10/0x10
         ret_from_fork+0x273/0x2b0
         ? __pfx_kthread+0x10/0x10
         ret_from_fork_asm+0x1a/0x30
    
    Fixes: 15788d1d1077 ("xprtrdma: Do not refresh Receive Queue while it is draining")
    Signed-off-by: Eric Badger <[email protected]>
    Reviewed-by: Chuck Lever <[email protected]>
    Signed-off-by: Anna Schumaker <[email protected]>
    Signed-off-by: Sasha Levin <[email protected]>