To what bitstring a character(s) is encoded in each character set?

Input one character or short letters and click "Convert."


(UTF-8)
Charset Character Bit string (binary) Bit String (hexadecimal)
ISO-8859-1 ???????? 0011111100111111001111110011111100111111001111110011111100111111 3f3f3f3f3f3f3f3f
SJIS-WIN 如??弛??遺? 1001010001000000001111110011111110010010011011110011111100111111100010001110001000111111 94403f3f926f3f3f88e23f
EUC-JP 如??弛??遺? 1100011110100001001111110011111111000011110100000011111100111111101100001110010000111111 c7a13f3fc3d03f3fb0e43f
UTF-8 如싲맮弛됬땟遺압 111001011010011010000010111011001000101110110010111010111010011110101110111001011011110010011011111010111001000010101100111010111001010110011111111010011000000110111010111011001001010110010101 e5a682ec8bb2eba7aee5bc9beb90aceb959fe981baec9595
UHC 如싲맮弛됬땟遺압 11100101111111011001101011101011100100001011010111101100101011001000100111100111101101101010110111101011101101101011111011010000 e5fd9aeb90b5ecac89e7b6adebb6bed0

SJIS-Win,EUC-JP: Classic charsets mainly used as Japanese encoding set on Windows(SJIS-Win=CP932) and UNIX(EUC-JP)