Before you begin troubleshooting OSDs, it's a good idea to validate your cluster and public network between the Ceph nodes, as OSD down issues can usually be caused by communication issues between peer OSDs and MONs:
- How to handle a full OSD flag on your Ceph cluster:
- Running a ceph health detail will provide you with the OSD ID that is currently flagged as full by the cluster:
# ceph health detail
-
- A full flag is indicated by the Ceph config option mon_osd_full_ratio; by default, this is 95%. Note that this config setting applies only to a situation when the flag gets set on the cluster and does not apply to the actual PGs.
- Ceph will prevent client I/O from writing to a PG that resides on an OSD that has the full ...