We've got today this message in dmesg that suggests that the error occurs during the validation of block map pages in the VDO driver... This is a production server, should we be worried about our data? For now seems to be ok, but don't know if a reboot/remount we can lose data.
[Mon May 12 14:35:04 2025] ------------[ cut here ]------------
[Mon May 12 14:35:04 2025] UBSAN: invalid-load in /home/kernel/COD/linux/drivers/md/dm-vdo/encodings.c:357:19
[Mon May 12 14:35:04 2025] load of value 4 is not a valid value for type '_Bool'
[Mon May 12 14:35:04 2025] CPU: 70 UID: 0 PID: 87816 Comm: dm_vdo0:logQ1 Not tainted 6.12.25-061225-generic #202504251035
[Mon May 12 14:35:04 2025] Hardware name: Oracle Corporation ORACLE SERVER X9-2c/TLA,MB TRAY,X9-2c, BIOS 66110100 07/17/2024
[Mon May 12 14:35:04 2025] Call Trace:
[Mon May 12 14:35:04 2025] <TASK>
[Mon May 12 14:35:04 2025] show_stack+0x49/0x60
[Mon May 12 14:35:04 2025] dump_stack_lvl+0x5f/0x90
[Mon May 12 14:35:04 2025] dump_stack+0x10/0x18
[Mon May 12 14:35:04 2025] ubsan_epilogue+0x9/0x40
[Mon May 12 14:35:04 2025] __ubsan_handle_load_invalid_value.cold+0x44/0x49
[Mon May 12 14:35:04 2025] vdo_validate_block_map_page.cold+0x14/0x1d [dm_vdo]
[Mon May 12 14:35:04 2025] page_is_loaded+0x7a/0x1d0 [dm_vdo]
[Mon May 12 14:35:04 2025] service_work_queue+0xbf/0x2e0 [dm_vdo]
[Mon May 12 14:35:04 2025] ? __pfx_autoremove_wake_function+0x10/0x10
[Mon May 12 14:35:04 2025] ? __pfx_work_queue_runner+0x10/0x10 [dm_vdo]
[Mon May 12 14:35:04 2025] work_queue_runner+0x1e/0x30 [dm_vdo]
[Mon May 12 14:35:04 2025] kthread+0xe1/0x110
[Mon May 12 14:35:04 2025] ? __pfx_kthread+0x10/0x10
[Mon May 12 14:35:04 2025] ret_from_fork+0x44/0x70
[Mon May 12 14:35:04 2025] ? __pfx_kthread+0x10/0x10
[Mon May 12 14:35:04 2025] ret_from_fork_asm+0x1a/0x30
[Mon May 12 14:35:04 2025] </TASK>
[Mon May 12 14:35:04 2025] ---[ end trace ]---
# uname -a
Linux nfs 6.12.25-061225-generic #202504251035 SMP PREEMPT_DYNAMIC Fri Apr 25 14:17:28 UTC 2025 x86_64 x86_64 x86_64 GNU/Linux
# cat /etc/os-release
PRETTY_NAME="Ubuntu 24.04.2 LTS"
NAME="Ubuntu"
VERSION_ID="24.04"
VERSION="24.04.2 LTS (Noble Numbat)"
VERSION_CODENAME=noble
ID=ubuntu
ID_LIKE=debian
HOME_URL="https://www.ubuntu.com/"
SUPPORT_URL="https://help.ubuntu.com/"
BUG_REPORT_URL="https://bugs.launchpad.net/ubuntu/"
PRIVACY_POLICY_URL="https://www.ubuntu.com/legal/terms-and-policies/privacy-policy"
UBUNTU_CODENAME=noble
LOGO=ubuntu-logo
# lscpu
Architecture: x86_64
CPU op-mode(s): 32-bit, 64-bit
Address sizes: 46 bits physical, 57 bits virtual
Byte Order: Little Endian
CPU(s): 128
On-line CPU(s) list: 0-127
Vendor ID: GenuineIntel
BIOS Vendor ID: Intel(R) Corporation
Model name: Intel(R) Xeon(R) Platinum 8358 CPU @ 2.60GHz
BIOS Model name: Intel(R) Xeon(R) Platinum 8358 CPU @ 2.60GHz CPU @ 2.6GHz
BIOS CPU family: 179
CPU family: 6
Model: 106
Thread(s) per core: 2
Core(s) per socket: 32
Socket(s): 2
Stepping: 6
CPU(s) scaling MHz: 35%
CPU max MHz: 3400.0000
CPU min MHz: 800.0000
BogoMIPS: 5200.00
Flags: fpu vme de pse tsc msr pae mce cx8 apic sep mtrr pge mca cmov pat pse36 clflush dts acpi mmx fxsr sse sse2 ss ht tm pbe syscall nx pdpe1gb rdtscp lm constant
_tsc art arch_perfmon pebs bts rep_good nopl xtopology nonstop_tsc cpuid aperfmperf pni pclmulqdq dtes64 monitor ds_cpl vmx smx est tm2 ssse3 sdbg fma cx16 x
tpr pdcm pcid dca sse4_1 sse4_2 x2apic movbe popcnt tsc_deadline_timer aes xsave avx f16c rdrand lahf_lm abm 3dnowprefetch cpuid_fault epb cat_l3 intel_ppin
ssbd mba ibrs ibpb stibp ibrs_enhanced tpr_shadow flexpriority ept vpid ept_ad fsgsbase tsc_adjust bmi1 avx2 smep bmi2 erms invpcid cqm rdt_a avx512f avx512d
q rdseed adx smap avx512ifma clflushopt clwb intel_pt avx512cd sha_ni avx512bw avx512vl xsaveopt xsavec xgetbv1 xsaves cqm_llc cqm_occup_llc cqm_mbm_total cq
m_mbm_local split_lock_detect wbnoinvd dtherm ida arat pln pts hwp hwp_act_window hwp_epp hwp_pkg_req vnmi avx512vbmi umip pku ospke avx512_vbmi2 gfni vaes v
pclmulqdq avx512_vnni avx512_bitalg avx512_vpopcntdq la57 rdpid fsrm md_clear pconfig flush_l1d arch_capabilities
Virtualization features:
Virtualization: VT-x
Caches (sum of all):
L1d: 3 MiB (64 instances)
L1i: 2 MiB (64 instances)
L2: 80 MiB (64 instances)
L3: 96 MiB (2 instances)
NUMA:
NUMA node(s): 2
NUMA node0 CPU(s): 0-31,64-95
NUMA node1 CPU(s): 32-63,96-127
Vulnerabilities:
Gather data sampling: Vulnerable
Itlb multihit: Not affected
L1tf: Not affected
Mds: Not affected
Meltdown: Not affected
Mmio stale data: Mitigation; Clear CPU buffers; SMT vulnerable
Reg file data sampling: Not affected
Retbleed: Not affected
Spec rstack overflow: Not affected
Spec store bypass: Mitigation; Speculative Store Bypass disabled via prctl
Spectre v1: Mitigation; usercopy/swapgs barriers and __user pointer sanitization
Spectre v2: Mitigation; Enhanced / Automatic IBRS; IBPB conditional; RSB filling; PBRSB-eIBRS SW sequence; BHI SW loop, KVM SW loop
Srbds: Not affected
Tsx async abort: Not affected
# lsmod | grep dm_vdo
dm_vdo 610304 1
dm_bufio 57344 1 dm_vdo
lz4_compress 24576 1 dm_vdo
# modinfo dm_vdo
filename: /lib/modules/6.12.25-061225-generic/kernel/drivers/md/dm-vdo/dm-vdo.ko.zst
license: GPL
author: Red Hat, Inc.
description: device-mapper target for transparent deduplication
srcversion: AA4049C4D3DDD682B8D1854
depends: dm-bufio,lz4_compress
intree: Y
name: dm_vdo
retpoline: Y
vermagic: 6.12.25-061225-generic SMP preempt mod_unload modversions
sig_id: PKCS#7
signer: Build time autogenerated kernel key
sig_key: 4A:C5:40:D8:1E:57:AD:70:4C:31:A6:E7:3F:4B:9B:78:8D:8A:CC:3E
sig_hashalgo: sha512
# vdostats -v
vg_vdo_12d-lv_pool_vdo_12d-vpool :
version : 36
data blocks used : 9857637495
overhead blocks used : 145655537
logical blocks used : 70073585907
physical blocks : 11745043456
logical blocks : 134217728256
1K-blocks : 46980173824
1K-blocks used : 40013172128
1K-blocks available : 6967001696
used percent : 85
saving percent : 85
block map cache size : 8589934592
block size : 4096
completed recovery count : 8
read-only recovery count : 0
operating mode : normal
recovery progress (%) : N/A
compressed fragments written : 1005840146
compressed blocks written : 344358994
compressed fragments in packer : 75
slab count : 1394
slabs opened : 1378
slabs reopened : 1064
journal disk full count : 0
journal commits requested count : 579096662
journal entries batching : 482
journal entries started : 8445027442
journal entries writing : 55
journal entries written : 8445026960
journal entries committed : 8445026905
journal blocks batching : 4
journal blocks started : 47896981
journal blocks writing : 1
journal blocks written : 47896977
journal blocks committed : 47896976
slab journal disk full count : 0
slab journal flush count : 1847328
slab journal blocked count : 0
slab journal blocks written : 8534922
slab journal tail busy count : 0
slab summary blocks written : 8501962
reference blocks written : 80366186
block map dirty pages : 4090
block map clean pages : 2093062
block map free pages : 0
block map failed pages : 0
block map incoming pages : 0
block map outgoing pages : 0
block map cache pressure : 0
block map read count : 10747587929
block map write count : 8443770099
block map failed reads : 0
block map failed writes : 0
block map reclaimed : 5720
block map read outgoing : 6279
block map found in cache : 17656713258
block map discard required : 10071081
block map wait for page : 1522476537
block map fetch required : 2097152
block map pages loaded : 12168233
block map pages saved : 14806114
block map flush count : 12702154
dedupe advice valid : 6087262806
dedupe advice stale : 17152689
concurrent data matches : 236340194
concurrent hash collisions : 0
current dedupe queries : 598
invalid advice PBN count : 0
no space error count : 0
read only error count : 0
instance : 0
512 byte emulation : off
current VDO IO requests in progress : 1629
maximum VDO IO requests in progress : 2048
dedupe advice timeouts : 0
flush out : 13144
write amplification ratio : 0.00
bios in read : 2303826589
bios in write : 8443762347
bios in empty flush : 13143
bios in discard : 0
bios in flush : 13143
bios in fua : 26082
bios in partial read : 0
bios in partial write : 0
bios in partial empty flush : 0
bios in partial discard : 0
bios in partial flush : 0
bios in partial fua : 0
bios out read : 8322974837
bios out write : 1119322682
bios out empty flush : 0
bios out discard : 0
bios out flush : 0
bios out fua : 0
bios meta read : 14636132
bios meta write : 161429846
bios meta empty flush : 27759568
bios meta discard : 0
bios meta flush : 165846168
bios meta fua : 47896981
bios journal read : 128
bios journal write : 47896977
bios journal empty flush : 10921932
bios journal discard : 0
bios journal flush : 58818909
bios journal fua : 47896977
bios page cache read : 12168233
bios page cache write : 16070588
bios page cache empty flush : 12702154
bios page cache discard : 0
bios page cache flush : 13966628
bios page cache fua : 0
bios out completed read : 8322974837
bios out completed write : 1119322670
bios out completed empty flush : 0
bios out completed discard : 0
bios out completed flush : 0
bios out completed fua : 0
bios meta completed read : 14636132
bios meta completed write : 161429846
bios meta completed empty flush : 27759568
bios meta completed discard : 0
bios meta completed flush : 27759568
bios meta completed fua : 47896981
bios journal completed read : 128
bios journal completed write : 47896977
bios journal completed empty flush : 10921932
bios journal completed discard : 0
bios journal completed flush : 10921932
bios journal completed fua : 47896977
bios page cache completed read : 12168233
bios page cache completed write : 16070588
bios page cache completed empty flush : 12702154
bios page cache completed discard : 0
bios page cache completed flush : 12702154
bios page cache completed fua : 0
bios acknowledged read : 2303826589
bios acknowledged write : 8443762340
bios acknowledged empty flush : 13143
bios acknowledged discard : 0
bios acknowledged flush : 13143
bios acknowledged fua : 26082
bios acknowledged partial read : 0
bios acknowledged partial write : 0
bios acknowledged partial empty flush : 0
bios acknowledged partial discard : 0
bios acknowledged partial flush : 0
bios acknowledged partial fua : 0
bios in progress read : 0
bios in progress write : 7
bios in progress empty flush : 0
bios in progress discard : 0
bios in progress flush : 0
bios in progress fua : 0
KVDO module bytes used : 24191002536
KVDO module peak bytes used : 24192038456
entries indexed : 147316793
posts found : 6104395304
posts not found : 1762327257
queries found : 0
queries not found : 0
updates found : 1015805518
updates not found : 0
entries discarded : 2590574420
We've got today this message in dmesg that suggests that the error occurs during the validation of block map pages in the VDO driver... This is a production server, should we be worried about our data? For now seems to be ok, but don't know if a reboot/remount we can lose data.