To what bitstring a character(s) is encoded in each character set?

Input one character or short letters and click "Convert."


(UTF-8)
Charset Character Bit string (binary) Bit String (hexadecimal)
ISO-8859-1 ?????????h 00111111001111110011111100111111001111110011111100111111001111110011111101101000 3f3f3f3f3f3f3f3f3f68
SJIS-WIN 蒻の??彦?8誼?h 111001001110100010000010110011000011111100111111100101010100011000111111100000100101011110001011011000100011111101101000 e4e882cc3f3f95463f82578b623f68
EUC-JP 蒻の??彦?8誼?h 111010001110101010100100110011100011111100111111110010011010011100111111101000111011100010110101110000110011111101101000 e8eaa4ce3f3fc9a73fa3b8b5c33f68
UTF-8 蒻の살졎彦쀫8誼빤h 11101000100100101011101111100011100000011010111011101100100000101011010011101100101000011000111011100101101111011010011011101100100000001010101111101111101111001001100011101000101010101011110011101011101110011010010001101000 e892bbe381aeec82b4eca18ee5bda6ec80abefbc98e8aabcebb9a468
UHC 蒻の살졎彦쀫8誼빤h 11100101101101101010101011001110101110111110110010100000101110111110010111101001100101111110101110100011101110001110101111111110101110101111111001101000 e5b6aacebbeca0bbe5e997eba3b8ebfebafe68

SJIS-Win,EUC-JP: Classic charsets mainly used as Japanese encoding set on Windows(SJIS-Win=CP932) and UNIX(EUC-JP)