Solution

Aerospike_Knowledge · March 7, 2020, 1:05am

The Aerospike Knowledge Base has moved to https://support.aerospike.com. Content on https://discuss.aerospike.com is being migrated to either https://support.aerospike.com or https://docs.aerospike.com. Maintenance on articles stored in this repository ceased on December 31st 2022 and this article may be stale. If you have any questions, please do not hesitate to raise a case via https://support.aerospike.com.

Solution: Migration stalls with `record too small`

Problem Description

When a cluster is migrating the migration does not complete and nodes with incoming migrations report the following error in the logs.

Mar 05 2020 01:41:07 GMT: WARNING (flat): (flat.c:135) record too small 0
Mar 05 2020 01:41:07 GMT: WARNING (migrate): (migrate.c:1398) handle insert: got bad record

Explanation

This error will occur when there is a node in the cluster with a bad disk. The node is aware that it needs to send out a record but due to the disk error the record is of 0 size. The error will occur on the node where the migration is inbound as it cannot write an inbound record of 0 size. On checking the status of migrations it is likely that a single node will be the source of the problematic partitions.

As the issue is due to a problem with node hardware the quickest solution to allow migrations to complete would be to shutdown the problem source node. The problematic node will almost certainly be showing disk errors in dmesg which can be run manually or as part of the asadm -e collectinfo command. The dmesg output would look similar to the output below:

[11055874.801271] Buffer I/O error on dev nvme0n2, logical block 38509628, async page read
[11055888.682385] print_req_error: critical medium error, dev nvme0n2, sector 308077024

Shutting down the problem node would cause extra migration but should not have any other negative effect.

Keywords

MIGRATION STALLED DISK ERROR NODE

Timestamp

March 2020

Topic		Replies	Views
Strange behavior during migration Migration	6	3333	June 18, 2015
Losing records after node fails Configuration	3	1439	May 24, 2015
Restore cluster trouble	2	1294	May 27, 2016
Unused data on the disk - Error Code 8: Server memory error Configuration	11	7855	June 29, 2017
Unexpected partition migration state at source	1	1358	June 28, 2016

Solution - Migration stalls with `record too small`

Solution: Migration stalls with record too small