To what bitstring a character(s) is encoded in each character set?

Input one character or short letters and click "Convert."


(UTF-8)
Charset Character Bit string (binary) Bit String (hexadecimal)
ISO-8859-1 ???^WD???^WD^ 00111111001111110011111101011110010101110100010000111111001111110011111101011110010101110100010001011110 3f3f3f5e57443f3f3f5e57445e
SJIS-WIN 梧??^WD梧??^WD^ 100011001110011000111111001111110101111001010111010001001000110011100110001111110011111101011110010101110100010001011110 8ce63f3f5e57448ce63f3f5e57445e
EUC-JP 梧??^WD梧??^WD^ 101110001110100000111111001111110101111001010111010001001011100011101000001111110011111101011110010101110100010001011110 b8e83f3f5e5744b8e83f3f5e57445e
UTF-8 梧귨쉠^WD梧귨쉠^WD^ 11100110101000101010011111101010101101111010100011101100100010011010000001011110010101110100010011100110101000101010011111101010101101111010100011101100100010011010000001011110010101110100010001011110 e6a2a7eab7a8ec89a05e5744e6a2a7eab7a8ec89a05e57445e
UHC 梧귨쉠^WD梧귨쉠^WD^ 11100111111111001000001011101111101111011010101001011110010101110100010011100111111111001000001011101111101111011010101001011110010101110100010001011110 e7fc82efbdaa5e5744e7fc82efbdaa5e57445e

SJIS-Win,EUC-JP: Classic charsets mainly used as Japanese encoding set on Windows(SJIS-Win=CP932) and UNIX(EUC-JP)