Filtering is one of the most well known low-level image processing procedures. In most filtering procedures, the potential capability of an ALU in a processor is not fully used. The authors propose a packed mean filtering scheme. The scheme packs several pixels into a unit and processes them simultaneously. Experiments are held under three distinct machines to evaluate the performance of the scheme. The result shows that the scheme enhances processing speed in all three environments.