To what bitstring a character(s) is encoded in each character set?

Input one character or short letters and click "Convert."


(UTF-8)
Charset Character Bit string (binary) Bit String (hexadecimal)
ISO-8859-1 ????}????{^ 0011111100111111001111110011111101111101001111110011111100111111001111110111101101011110 3f3f3f3f7d3f3f3f3f7b5e
SJIS-WIN ??楔堯}??楔堯{^ 001111110011111110011110101101101110101010011111011111010011111100111111100111101011011011101010100111110111101101011110 3f3f9eb6ea9f7d3f3f9eb6ea9f7b5e
EUC-JP ??楔堯}??楔堯{^ 001111110011111111011100101110001111010010100001011111010011111100111111110111001011100011110100101000010111101101011110 3f3fdcb8f4a17d3f3fdcb8f4a17b5e
UTF-8 숥숸楔堯}숥숸楔堯{^ 111011001000100010100101111011001000100010111000111001101010010110010100111001011010000010101111011111011110110010001000101001011110110010001000101110001110011010100101100101001110010110100000101011110111101101011110 ec88a5ec88b8e6a594e5a0af7dec88a5ec88b8e6a594e5a0af7b5e
UHC 숥숸楔堯}숥숸楔堯{^ 10011010010000101001101001001101111000001101101111101000111010110111110110011010010000101001101001001101111000001101101111101000111010110111101101011110 9a429a4de0dbe8eb7d9a429a4de0dbe8eb7b5e

SJIS-Win,EUC-JP: Classic charsets mainly used as Japanese encoding set on Windows(SJIS-Win=CP932) and UNIX(EUC-JP)