ceph集群osd存储盘Input/output erro

robin 发表于 2019-2-2 09:51:32

　　描述：

　　ceph集群osd硬盘损坏引起的写入错误。
　　日志信息：
　　2017-12-13 03:40:38.596764 7f5e32df2700 -1 filestore(/var/lib/ceph/osd/ceph-44) FileStore::_do_copy_range: write error at 1118208~-5, (5) Input/output error
　　os/filestore/FileStore.cc: In function 'int FileStore::_do_copy_range(int, int, uint64_t, uint64_t, uint64_t, bool)' thread 7f5e32df2700 time 2017-12-13 03:40:38.596798
　　os/filestore/FileStore.cc: 3628: FAILED assert(pos == end)
　　ceph version 10.2.7 (50e863e0f4bc8f4b9e31156de690d765af245185)
　　1: (ceph::__ceph_assert_fail(char const*, char const*, int, char const*)+0x8b)
　　2: (FileStore::_do_copy_range(int, int, unsigned long, unsigned long, unsigned long, bool)+0x18ec)
　　3: (GenericFileStoreBackend::clone_range(int, int, unsigned long, unsigned long, unsigned long)+0x7b)
　　4: (FileStore::_do_clone_range(int, int, unsigned long, unsigned long, unsigned long)+0x80)
　　5: (FileStore::_clone_range(coll_t const&, ghobject_t const&, ghobject_t const&, unsigned long, unsigned long, unsigned long, SequencerPosition const&)+0x1a1)
　　6: (FileStore::_do_transaction(ObjectStore::Transaction&, unsigned long, int, ThreadPool::TPHandle*)+0x42ca)
　　7: (FileStore::_do_transactions(std::vector&, unsigned long, ThreadPool::TPHandle*)+0x3b)
　　8: (FileStore::_do_op(FileStore::OpSequencer*, ThreadPool::TPHandle&)+0x2b5)
　　9: (ThreadPool::worker(ThreadPool::WorkThread*)+0xa6e)
　　10: (ThreadPool::WorkThread::entry()+0x10)
　　11: (()+0x8184)
　　12: (clone()+0x6d)
　　NOTE: a copy of the executable, or `objdump -rdS ` is needed to interpret this.
　　dmesg查看信息：
　　[ 8674.029792] sd 0:0:2:0: tag#21 FAILED Result: hostbyte=DID_OK driverbyte=DRIVER_SENSE
　　[ 8674.029800] sd 0:0:2:0: tag#21 Sense Key : Medium Error
　　[ 8674.029802] sd 0:0:2:0: tag#21 Add. Sense: Unrecovered read error
　　[ 8674.029804] sd 0:0:2:0: tag#21 CDB: Read(16) 88 00 00 00 00 00 02 8c 49 a8 00 00 01 00 00 00
　　[ 8674.029806] blk_update_request: critical medium error, dev sdc, sector 42748543
　　解决:
　　判断为ceph集群osd的硬盘损坏导致,建议更换该osd存储硬盘。

页: [1]

运维网's Archiver

ceph集群osd存储盘Input/output erro