To what bitstring a character(s) is encoded in each character set?

Input one character or short letters and click "Convert."


(UTF-8)
Charset Character Bit string (binary) Bit String (hexadecimal)
ISO-8859-1 ????????????????????? 001111110011111100111111001111110011111100111111001111110011111100111111001111110011111100111111001111110011111100111111001111110011111100111111001111110011111100111111 3f3f3f3f3f3f3f3f3f3f3f3f3f3f3f3f3f3f3f3f3f
SJIS-WIN ???姨??循??鴉??援??油??艾?? 001111110011111100111111100110110100100000111111001111111000111101111010001111110011111111101001111010110011111100111111100010011000011100111111001111111001011011111011001111110011111111100100100010000011111100111111 3f3f3f9b483f3f8f7a3f3fe9eb3f3f89873f3f96fb3f3fe4883f3f
EUC-JP ???姨??循??鴉??援??油??艾?? 001111110011111100111111110101011010100100111111001111111011110111011011001111110011111111110010111011010011111100111111101100011110011100111111001111111100110011111101001111110011111111100111111010000011111100111111 3f3f3fd5a93f3fbddb3f3ff2ed3f3fb1e73f3fccfd3f3fe7e83f3f
UTF-8 列룸씛姨껃쑵循뗰폋鴉딆떏援쒐뙴油밸윹艾쎈돭 111011111010011010011100111010111010001110111000111011001001010010011011111001011010011110101000111010101011101110000011111011001001000110110101111001011011111010101010111010111001011110110000111011011000111110001011111010011011010010001001111010111001010010000110111010111001011010001111111001101000111110110100111011001001001010010000111010111001100110110100111001101011001010111001111010111011000010111000111011001001110010111001111010001000100110111110111011001000111010001000111010111000111110101101 efa69ceba3b8ec949be5a7a8eabb83ec91b5e5beaaeb97b0ed8f8be9b489eb9486eb968fe68fb4ec9290eb99b4e6b2b9ebb0b8ec9cb9e889beec8e88eb8fad
UHC 列룸씛姨껃쑵循뗰폋鴉딆떏援쒐뙴油밸윹艾쎈돭 111001101110101010110111111010111001110110110000111011001010100110000011111001011011111010101010111000101110000010001011111011111011110010010110111001001011110010001010111011001000101110100101111010101011010110011100111001111000110010110111111010101111101010111001111010111001111110110011111001001111010110111101111010111000100110110000 e6eab7eb9db0eca983e5beaae2e08befbc96e4bc8aec8ba5eab59ce78cb7eafab9eb9fb3e4f5bdeb89b0

SJIS-Win,EUC-JP: Classic charsets mainly used as Japanese encoding set on Windows(SJIS-Win=CP932) and UNIX(EUC-JP)