To what bitstring a character(s) is encoded in each character set?

Input one character or short letters and click "Convert."


(UTF-8)
Charset Character Bit string (binary) Bit String (hexadecimal)
ISO-8859-1 ?????????????????????? 00111111001111110011111100111111001111110011111100111111001111110011111100111111001111110011111100111111001111110011111100111111001111110011111100111111001111110011111100111111 3f3f3f3f3f3f3f3f3f3f3f3f3f3f3f3f3f3f3f3f3f3f
SJIS-WIN 掩??揖??醫??掩??揖??醫??畏??泣 100010011000011000111111001111111001011101001011001111110011111111100111110011100011111100111111100010011000011000111111001111111001011101001011001111110011111111100111110011100011111100111111100010001101100000111111001111111000101110000011 89863f3f974b3f3fe7ce3f3f89863f3f974b3f3fe7ce3f3f88d83f3f8b83
EUC-JP 掩??揖??醫??掩??揖??醫??畏??泣 101100011110011000111111001111111100110110101100001111110011111111101110110100000011111100111111101100011110011000111111001111111100110110101100001111110011111111101110110100000011111100111111101100001101101000111111001111111011010111100011 b1e63f3fcdac3f3feed03f3fb1e63f3fcdac3f3feed03f3fb0da3f3fb5e3
UTF-8 掩뽰룊揖욘릸醫묒뒃掩뽰룊揖욘릸醫묒뒃畏븐옚泣 111001101000111010101001111010111011110110110000111010111010001110001010111001101000111110010110111011001001101010011000111010111010011010111000111010011000011010101011111010111010110010010010111010111001001010000011111001101000111010101001111010111011110110110000111010111010001110001010111001101000111110010110111011001001101010011000111010111010011010111000111010011000011010101011111010111010110010010010111010111001001010000011111001111001010110001111111010111011100010010000111011001001100010011010111001101011001110100011 e68ea9ebbdb0eba38ae68f96ec9a98eba6b8e986abebac92eb9283e68ea9ebbdb0eba38ae68f96ec9a98eba6b8e986abebac92eb9283e7958febb890ec989ae6b3a3
UHC 掩뽰룊揖욘릸醫묒뒃掩뽰룊揖욘릸醫묒뒃畏븐옚泣 1110010111110011100101101110110010001111100010011110101111100111101111111110011010010000100101101110110010100010100100011110110010001010100000011110010111110011100101101110110010001111100010011110101111100111101111111110011010010000100101101110110010100010100100011110110010001010100000011110100011100110101110101110110010011110100111101110101111101000 e5f396ec8f89ebe7bfe69096eca291ec8a81e5f396ec8f89ebe7bfe69096eca291ec8a81e8e6baec9e9eebe8

SJIS-Win,EUC-JP: Classic charsets mainly used as Japanese encoding set on Windows(SJIS-Win=CP932) and UNIX(EUC-JP)