Management Education Courses
HOME | NEWS | CONTACT US | WEB SEARCH | SITE MAP

 

   Noyes & Associates, Ltd.Nationwide Health Care Management Education Programs

MANAGEMENT EDUCATION CONSULTING SERVICES SEARCH SERVICES CANDIDATE RECRUITMENT ABOUT US  
Noyes and Associates, Ltd. for National Health Care Management Education Programs Health Care Management Education

Mentoring Program

Coaching Program

Articles of Interest

News & Ideas

Helpful Links

  

 

  
Other Articles of Interest:

JONA - Journal of Nursing Administration

Inter-Rater Reliability:
Regaining Credibility with your Staff and Financial Officer While Meeting JCAHO Standards

Betty J. Noyes, RN, MA

Journal of Nursing Administration, Vol. 24, No. 9 (September), 1994, pp. 7-8

---- Line ----

The Joint Commission on the Accreditation of Healthcare Organizations (JCAHO), in Standard NC 3.4.2., currently requires that our method for determining staffing has both face validity and inter-rater reliability. Furthermore, in LD.1.3, they require a plan to provide services in response to identified patient needs. In LD.2.1.4, the standards require that the leaders recommend a sufficient number of qualified persons consistent with the assessed needs of the population served and planned care/services. Every area must have process to determine the level of care required by patients being served in that specific area. The burden of proof that the required number and mix of staff members are provided will fall to patient care leadership.

In the past, creating a reasonable scientific basis for staffing methods brought "efficiency experts," later called "management engineers," to help the nursing department develop work load measurement instruments. These instruments helped nursing administrators enhance and preserve the credibility of their management decisions. This objective instrument theoretically would eliminate any biases in the staffing cycle.

Many came to believe that the indicator-based patient classification systems (PCS) was the best because it had the most discriminating design. It identified tasks that were labor sensitive and differentiated the nursing work load activity from the acuity of the patient illness. The PCS was believed to be reliable, objective, easy to understand, and controllable. Proponents also believed that PCS could be used to "flex" staff members accurately on a shift-by-shift basis.

Many chief nurse executives (CNEs) quickly learned that PCS was not a panacea for accurate staffing. The credibility of PCS slipped because we (CNEs) and the nursing staff learned that point values could be "tweaked" to reduce or expand staffing levels requirements. This was true for the projected numbers of nurses required to meet the demand levels and in skill mix. Skill mix was decided on arbitrarily. Patients were getting "more acute" because of shortening length of stay, and work loads were changing. Financial constraints were an ever-increasing reality.

The audiences of the PCS data - chief executives, chief financial officers, human resource directors, and nursing management and staff - were disappointed. The audiences lost interest in hearing us defend budgets that were based on an unreliable instrument. Validity was destroyed every time a new program or "restructuring" occurred. Many organizations never made the effort to re-evaluate their PCS for new programs. Frequently, there was no money to rehire management engineers to collect new time/work load data.

Many CNEs questioned the effort spent reconstructing such a fallible "nursing" system. That has additional validity today because many organizations which have decentralized ancillary support services and a multiskilled work force do not have a PCS that measures the new task inventories. These redesigned care delivery systems require new methods to determine adequately the required staffing patterns. Decisions need to be made in instrument design and measurement methodology. This is a complex task because ancillary support services, such as physical therapy, respiratory therapy, phlebotomy, and EKG, have not had time/work load analysis done before, either in isolation or as part of the work load of a new, multiskilled, cross-trained worker.

Inter-rater reliability and face validity must be documented for which ever system is used to determine staffing requirements for the model of care in use. Validity ensures that the instrument measures the concept and the properties it is intended to measure. Face validity requires that the system appears to represent its intended purpose. Specifically, does it accurately reflect care requirements of patients in accordance with the practice patterns?

Reliability is concerned with consistency and repeatability. It questions whether the system produces consistent results when the measure is repeated on the same patient by different rates. Inter-rater reliability (IRR) is the measurement of two or more raters that classify the same patient. Inter-rater reliability does not measure the validity of a system. A system can be invalid and still have high IRR scores. These two diagnostic tools tell you only whether your system is healthy or moribund. Acceptable IRR must be achieved for each patient care division, unit, and type, and each frequently used indicator/criteria or prototype description. Inter-rater reliability goals are statistically satisfactory when IRR for individual geographic defined units or aggregated patient groups is in the 90 to 95 percentile range. Each indicator should have at least 80% agreement.

Sample size is another imperative. Each nurse involved in measuring IRR should obtain a 90% IRR score on a minimum of eight patients per quarter, representing a diversity of patient types. Each unit should measure a minimum of 15 to 20% of its unit census, never less than five patients per quarter. The final composite report needs to be for the entire patient care department.

When evaluating your PCS, it is helpful to measure IRR and face validity on each indicator or prototype description and for each participating staff nurse. This frequently will illuminate the source of any discrepancy and reason for low total IRR scores.

Approaches to measure IRR, although seemingly simplistic, are very labor intensive for a busy patient administration/staffing office. The importance of both measures is well worth the effort.

The approach to IRR is to designate monitors. These monitors must be selected carefully for their objectivity and clinical knowledge, then educated about the PCS. Monitoring staff members should be trained so that they can achieve 95% IRR scores among all patient types/classes before the IRR study begins.

The scheduling of the IRR study also requires orchestration. The monitor should classify the patient within 2 hours of the staff/unit level personnel to minimize the possibility of drastic shifts in the patient condition. Should a drastic shift occur within 2 hours, that patient's classification should be discarded. You are trying to measure IRR, not the differences caused by patient changes in condition.

The amount of time required for tabulation of these results is also significant. Many data points and correlations are required.

Inter-rater reliability results are important when the face validity discussions begin. Your experts in face validity are your staff, middle manager, and statistical data derived from other sources, including your IRR study. Face validity questions whether your instrument appears reasonable, complete, and representative of the nursing care standards and practices of your institution. Face validity can be achieved if clinicians and management periodically review all the components of the staffing system. They need to disassemble the parts and evaluate and discuss the definitions that are critical to any PCS validity, the methodology, and the IRR scores for each patient unit designation and each patient type. They must review that the used PCS instrument is reflecting current safe practice of patient care, that the current system responds to changes in the patient condition/work load, and that the resulting staffing recommendations appear reasonable and match patient care requirements and budgetary reality.

Results of these discussions frequently result in instrument and method redesign. Recommendations then are made by this face validity task force to patient care/hospital management for executive decision and ultimately, are included in the budgetary planning and scheduling/staffing guidelines.

The JCAHO requirement has given management the opportunity to revisit the validity of our PCS as a system that drives the fundamental decisions about the use of employees. It is our opportunity to reassure our current staff that the administration is accurately defining their present work load at the bedside. Our PCS must include all patient care providers, including a cross-trained, multiskilled worker as part of the patient care delivery model. No PCS should take precedence over on-site crisis management decisions regarding the need for staffing on a given unit at a given time. A PCS with good face validity and IRR is the first step in developing harmony and balance of human resource supply with the patient care work load demands of the bedside provider staff.


Reference

  1. Joint Commission on the Accreditation of Healthcare Organizations. 1994 Accreditation Manual for Hospitals. Volume I Standards. Oakbrook Terrace, IL: Joint Commission on the Accreditation of Healthcare Organizations; 1993.