#1905 BLOC Trial-3: Field Return: flash corruption - OpenFirmware complaining of 'unknown node type 2006'.
Zarro Boogs per Child
bugtracker at laptop.org
Fri Aug 3 04:19:33 EDT 2007
#1905: Field Return: flash corruption - OpenFirmware complaining of 'unknown node
type 2006'.
----------------------+-----------------------------------------------------
Reporter: dwmw2 | Owner: dwmw2
Type: defect | Status: new
Priority: blocker | Milestone: Trial-3
Component: kernel | Version:
Resolution: | Keywords:
Verified: 0 |
----------------------+-----------------------------------------------------
Comment (by dwmw2):
Replying to [comment:33 Luna]:
> Problem 1: ( Tested on the same CL1 )
>
> If I execute command " nandwrite -p /dev/mtd" in OS, the CL1 will hang
up in the process of nand write.
When we say 'hang up', we normally mean that the computer stops
responding. Your problem1-1.JPG doesn't show that -- it just shows that
the nandwrite program encountered an error and stopped. The kernel
returned an error when asked to write a block of data to the flash. If you
type 'dmesg' at this point, you might see warning messages from the
kernel, describing the failure. Do you have the
CONFIG_MTD_NAND_VERIFY_WRITE configuration option enabled?
If this persists, we can make the kernel a little more verbose about such
errors. Perhaps the nandwrite program ought to be marking the offending
block as 'bad' when the write fails.
> Moreover, if I execute "copy-nand" under OFW, the nandwrite action
keeps going even there is an error message (Refer to the
attached:problem1-1, problem1-2.) and the machine can go into sugar
successfully.
> But it will show a message: "Unsupported node type 2006" after reboot
several times.
That is a misleading error message. It _probably_ means you have similar
corruption problems, but it's not clear. Can you verify whether the ECC
data are correct for the affected blocks?
> I think the bad blocks are not marked well so it gets chance to write
into the blocks which
> have problem
Maybe. But if the ECC data on the flash match the data, that means that
the corruption did not happen on the flash -- it happened in or before the
CAFÉ chip. And if not, then you're describing a different problem which
should be in a different trac report.
Please could you check the status of the ECC on the affected blocks, and
the nature of the corruption (you could just let me have a copy of the
image and the kernel's output when mounting it and I'll do the latter).
--
Ticket URL: <https://dev.laptop.org/ticket/1905#comment:35>
One Laptop Per Child <http://laptop.org/>
More information about the Bugs
mailing list