To what bitstring a character(s) is encoded in each character set?

Input one character or short letters and click "Convert."


(UTF-8)
Charset Character Bit string (binary) Bit String (hexadecimal)
ISO-8859-1 ?????????????????????? 00111111001111110011111100111111001111110011111100111111001111110011111100111111001111110011111100111111001111110011111100111111001111110011111100111111001111110011111100111111 3f3f3f3f3f3f3f3f3f3f3f3f3f3f3f3f3f3f3f3f3f3f
SJIS-WIN ???姨??幽??鶯щ???????艾??竊 00111111001111110011111110011011010010000011111100111111100101110100100000111111001111111110100111110010100001001000101100111111001111110011111100111111001111110011111100111111111001001000100000111111001111111110001010000110 3f3f3f9b483f3f97483f3fe9f2848b3f3f3f3f3f3f3fe4883f3fe286
EUC-JP ???姨??幽??鶯щ?沅??洧??艾??竊 0011111100111111001111111101010110101001001111110011111111001101101010010011111100111111111100101111010010100111111010110011111110001111110001101110100100111111001111111000111111000111101101000011111100111111111001111110100000111111001111111110001111100110 3f3f3fd5a93f3fcda93f3ff2f4a7eb3f8fc6e93f3f8fc7b43f3fe7e83f3fe3e6
UTF-8 列룸씛姨껃뿽幽덌폊鶯щ벥沅붺뙴洧곗쯿艾쎈뜄竊 1110111110100110100111001110101110100011101110001110110010010100100110111110010110100111101010001110101010111011100000111110101110111111101111011110010110111001101111011110101110001101100011001110110110001111100010101110100110110110101011111101000110001001111010111011001010100101111001101011001010000101111010111011011010111010111010111001100110110100111001101011010010100111111010101011001110010111111011001010111110111111111010001000100110111110111011001000111010001000111010111001110010000100111001111010101110001010 efa69ceba3b8ec949be5a7a8eabb83ebbfbde5b9bdeb8d8ced8f8ae9b6afd189ebb2a5e6b285ebb6baeb99b4e6b4a7eab397ecafbfe889beec8e88eb9c84e7ab8a
UHC 列룸씛姨껃뿽幽덌폊鶯щ벥沅붺뙴洧곗쯿艾쎈뜄竊 1110011011101010101101111110101110011101101100001110110010101001100000111110010110010111101111011110101011101011100010001110111110111100100101011110010110100011101011001110101110010011101111011110101010110110100101001110011110001100101101111110101011111011101100001110110010101001100000111110010011110101101111011110101110001101100010001110111110111100 e6eab7eb9db0eca983e597bdeaeb88efbc95e5a3aceb93bdeab694e78cb7eafbb0eca983e4f5bdeb8d88efbc

SJIS-Win,EUC-JP: Classic charsets mainly used as Japanese encoding set on Windows(SJIS-Win=CP932) and UNIX(EUC-JP)