To what bitstring a character(s) is encoded in each character set?

Input one character or short letters and click "Convert."


(UTF-8)
Charset Character Bit string (binary) Bit String (hexadecimal)
ISO-8859-1 ??????^ 00111111001111110011111100111111001111110011111101011110 3f3f3f3f3f3f5e
SJIS-WIN 襭苦─襭苦─^ 11100110010000101000101111101010100001001001111111100110010000101000101111101010100001001001111101011110 e6428bea849fe6428bea849f5e
EUC-JP 襭苦─襭苦─^ 11101011101000111011011011101100101010001010000111101011101000111011011011101100101010001010000101011110 eba3b6eca8a1eba3b6eca8a15e
UTF-8 襭苦─襭苦─^ 11101000101001011010110111101000100010111010011011100010100101001000000011101000101001011010110111101000100010111010011011100010100101001000000001011110 e8a5ade88ba6e29480e8a5ade88ba6e294805e
UHC ?苦─?苦─^ 0011111111001101110010001010011010100001001111111100110111001000101001101010000101011110 3fcdc8a6a13fcdc8a6a15e

SJIS-Win,EUC-JP: Classic charsets mainly used as Japanese encoding set on Windows(SJIS-Win=CP932) and UNIX(EUC-JP)