To what bitstring a character(s) is encoded in each character set?

Input one character or short letters and click "Convert."


(UTF-8)
Charset Character Bit string (binary) Bit String (hexadecimal)
ISO-8859-1 ???????? 0011111100111111001111110011111100111111001111110011111100111111 3f3f3f3f3f3f3f3f
SJIS-WIN 腫?億?樹爭?以 10001110111011100011111110001001101011010011111110001110111101111110000010100101001111111000100011001000 8eee3f89ad3f8ef7e0a53f88c8
EUC-JP 腫?億?樹爭?以 10111100111100000011111110110010101011110011111110111100111110011110000010100111001111111011000011001010 bcf03fb2af3fbcf9e0a73fb0ca
UTF-8 腫렓億겻樹爭렗以 111010001000010110101011111010111010000010010011111001011000010010000100111010101011001010111011111001101010100010111001111001111000100010101101111010111010000010010111111001001011101110100101 e885abeba093e58484eab2bbe6a8b9e788adeba097e4bba5
UHC 腫렓億겻樹爭렗以 11110000111111101000111010101000111001011110001010110000111001001110001010100111111011101011001110001110101011001110110010100100 f0fe8ea8e5e2b0e4e2a7eeb38eaceca4

SJIS-Win,EUC-JP: Classic charsets mainly used as Japanese encoding set on Windows(SJIS-Win=CP932) and UNIX(EUC-JP)