To what bitstring a character(s) is encoded in each character set?

Input one character or short letters and click "Convert."


(UTF-8)
Charset Character Bit string (binary) Bit String (hexadecimal)
ISO-8859-1 ??????????????????? 00111111001111110011111100111111001111110011111100111111001111110011111100111111001111110011111100111111001111110011111100111111001111110011111100111111 3f3f3f3f3f3f3f3f3f3f3f3f3f3f3f3f3f3f3f
SJIS-WIN 嚴??意??源?6勇?????恂??伍 1001101010001110001111110011111110001000110100110011111100111111100011001011100100111111100000100101010110010111010001010011111100111111001111110011111100111111100111001001011000111111001111111000110011011110 9a8e3f3f88d33f3f8cb93f825597453f3f3f3f3f9c963f3f8cde
EUC-JP 嚴??意??源?6勇??佾??恂??伍 11010011111011100011111100111111101100001101010100111111001111111011100010111011001111111010001110110110110011011010011000111111001111111000111110110000111110110011111100111111110101111111011000111111001111111011100011100000 d3ee3f3fb0d53f3fb8bb3fa3b6cda63f3f8fb0fb3f3fd7f63f3fb8e0
UTF-8 嚴얠슦意뺞깱源놁6勇싳뮄佾볢굢恂ⓦ걶伍 111001011001101010110100111011001001011010100000111011001000101010100110111001101000010010001111111010111011101010011110111010101011100110110001111001101011101010010000111010111000011010000001111011111011110010010110111001011000101110000111111011001000101110110011111010111010111010000100111001001011110110111110111010111011001110100010111010101011010110100010111001101000000110000010111000101001001110100110111010101011000110110110111001001011110010001101 e59ab4ec96a0ec8aa6e6848febba9eeab9b1e6ba90eb8681efbc96e58b87ec8bb3ebae84e4bdbeebb3a2eab5a2e68182e293a6eab1b6e4bc8d
UHC 嚴얠슦意뺞깱源놁6勇싳뮄佾볢굢恂ⓦ걶伍 1110010111110001101111101110110010011010101100001110101111110010100101011110011010000011100111111110101010111001100001101110110010100011101101101110100110111000100110101110110010010010100100111110110011101011100100111110100010000010100010011110001011100001101010001110001110000001100111001110011111101010 e5f1beec9ab0ebf295e6839feab986eca3b6e9b89aec9293eceb93e88289e2e1a8e3819ce7ea

SJIS-Win,EUC-JP: Classic charsets mainly used as Japanese encoding set on Windows(SJIS-Win=CP932) and UNIX(EUC-JP)