To what bitstring a character(s) is encoded in each character set?

Input one character or short letters and click "Convert."


(UTF-8)
Charset Character Bit string (binary) Bit String (hexadecimal)
ISO-8859-1 ?????????????????????? 00111111001111110011111100111111001111110011111100111111001111110011111100111111001111110011111100111111001111110011111100111111001111110011111100111111001111110011111100111111 3f3f3f3f3f3f3f3f3f3f3f3f3f3f3f3f3f3f3f3f3f3f
SJIS-WIN 癲???g?純??齬??溢??恂k?堊??? 111000011001111100111111001111110011111110000010100001110011111110001111100000110011111100111111111010101001011100111111001111111000100011101100001111110011111110011100100101101000001010001011001111111001101010111111001111110011111100111111 e19f3f3f3f82873f8f833f3fea973f3f88ec3f3f9c96828b3f9abf3f3f3f
EUC-JP 癲???g?純??齬??溢??恂k?堊??? 111000101010000100111111001111110011111110100011111001110011111110111101111000110011111100111111111100111111011100111111001111111011000011101110001111110011111111010111111101101010001111101011001111111101010011000001001111110011111100111111 e2a13f3f3fa3e73fbde33f3ff3f73f3fb0ee3f3fd7f6a3eb3fd4c13f3f3f
UTF-8 癲얘퀗璘g몭純놁쵅齬잕퉫溢ㅿ쫳恂k걠堊묎만流 111001111001100110110010111011001001011010011000111011011000000010010111111011111010011110101111111011111011110110000111111010111010101010101101111001111011010010010100111010111000011010000001111011001011010110000101111010011011110110101100111011001001111010010101111011011000100110101011111001101011101010100010111000111000010110111111111011001010101110110011111001101000000110000010111011111011110110001011111010101011000110100000111001011010000010001010111010111010110010001110111010111010011110001100111011111010011110001010 e799b2ec9698ed8097efa7afefbd87ebaaade7b494eb8681ecb585e9bdacec9e95ed89abe6baa2e385bfecabb3e68182efbd8beab1a0e5a08aebac8eeba78cefa78a
UHC 癲얘퀗璘g몭純놁쵅齬잕퉫溢ㅿ쫳恂k걠堊묎만流 1110111110100110101111101110101010110011100011001110110011011110101000111110011110010001100101111110001011101101100001101110110010101100100001111110010111100001100111111110101010111001100000111110110011101110101001001110111110100110100010111110001011100001101000111110101110000001100010011110010010111110100100011110101010111000101110001110101011111100 efa6beeab38cecdea3e79197e2ed86ecac87e5e19feab983eceea4efa68be2e1a3eb8189e4be91eab8b8eafc

SJIS-Win,EUC-JP: Classic charsets mainly used as Japanese encoding set on Windows(SJIS-Win=CP932) and UNIX(EUC-JP)