To what bitstring a character(s) is encoded in each character set?

Input one character or short letters and click "Convert."


(UTF-8)
Charset Character Bit string (binary) Bit String (hexadecimal)
ISO-8859-1 ?????????????????????? 00111111001111110011111100111111001111110011111100111111001111110011111100111111001111110011111100111111001111110011111100111111001111110011111100111111001111110011111100111111 3f3f3f3f3f3f3f3f3f3f3f3f3f3f3f3f3f3f3f3f3f3f
SJIS-WIN 曜??娃??汚??梧??節わ?娃??汚??梧 10010111011010100011111100111111100010001010000100111111001111111000100110011000001111110011111110001100111001100011111100111111100100001101111110000010111011010011111110001000101000010011111100111111100010011001100000111111001111111000110011100110 976a3f3f88a13f3f89983f3f8ce63f3f90df82ed3f88a13f3f89983f3f8ce6
EUC-JP 曜??娃??汚??梧??節わ?娃??汚??梧 11001101110010110011111100111111101100001010001100111111001111111011000111111000001111110011111110111000111010000011111100111111110000001110000110100100111011110011111110110000101000110011111100111111101100011111100000111111001111111011100011101000 cdcb3f3fb0a33f3fb1f83f3fb8e83f3fc0e1a4ef3fb0a33f3fb1f83f3fb8e8
UTF-8 曜깍쉼娃띰쉴汚꾬쉥梧잌쨰節わ쉼娃띰쉴汚꾬쉥梧 111001101001101110011100111010101011100110001101111011001000100110111100111001011010100010000011111010111001110110110000111011001000100110110100111001101011000110011010111010101011111010101100111011001000100110100101111001101010001010100111111011001001111010001100111011001010100010110000111001111010111110000000111000111000001010001111111011001000100110111100111001011010100010000011111010111001110110110000111011001000100110110100111001101011000110011010111010101011111010101100111011001000100110100101111001101010001010100111 e69b9ceab98dec89bce5a883eb9db0ec89b4e6b19aeabeacec89a5e6a2a7ec9e8ceca8b0e7af80e3828fec89bce5a883eb9db0ec89b4e6b19aeabeacec89a5e6a2a7
UHC 曜깍쉼娃띰쉴汚꾬쉥梧잌쨰節わ쉼娃띰쉴汚꾬쉥梧 1110100011111000101100011110111110111101101100001110100011011111101101101110111110111101101011111110011111111101100001001110111110111101101010111110011111111100100111111110010110100100100010101110111110111101101010101110111110111101101100001110100011011111101101101110111110111101101011111110011111111101100001001110111110111101101010111110011111111100 e8f8b1efbdb0e8dfb6efbdafe7fd84efbdabe7fc9fe5a48aefbdaaefbdb0e8dfb6efbdafe7fd84efbdabe7fc

SJIS-Win,EUC-JP: Classic charsets mainly used as Japanese encoding set on Windows(SJIS-Win=CP932) and UNIX(EUC-JP)