To what bitstring a character(s) is encoded in each character set?

Input one character or short letters and click "Convert."


(UTF-8)
Charset Character Bit string (binary) Bit String (hexadecimal)
ISO-8859-1 ?????????P????????????? 0011111100111111001111110011111100111111001111110011111100111111001111110101000000111111001111110011111100111111001111110011111100111111001111110011111100111111001111110011111100111111 3f3f3f3f3f3f3f3f3f503f3f3f3f3f3f3f3f3f3f3f3f3f
SJIS-WIN 掩??揖??癒??P掩??揖??癒??掩??揖 10001001100001100011111100111111100101110100101100111111001111111001011011111100001111110011111101010000100010011000011000111111001111111001011101001011001111110011111110010110111111000011111100111111100010011000011000111111001111111001011101001011 89863f3f974b3f3f96fc3f3f5089863f3f974b3f3f96fc3f3f89863f3f974b
EUC-JP 掩??揖?ł癒??P掩??揖?ł癒??掩??揖 1011000111100110001111110011111111001101101011000011111110001111101010011100100011001100111111100011111100111111010100001011000111100110001111110011111111001101101011000011111110001111101010011100100011001100111111100011111100111111101100011110011000111111001111111100110110101100 b1e63f3fcdac3f8fa9c8ccfe3f3f50b1e63f3fcdac3f8fa9c8ccfe3f3fb1e63f3fcdac
UTF-8 掩뽰룊揖욜ł癒뀁뒫P掩뽰룊揖욜ł癒뀁뒫掩뽰룊揖 1110011010001110101010011110101110111101101100001110101110100011100010101110011010001111100101101110110010011010100111001100010110000010111001111001100110010010111010111000000010000001111010111001001010101011010100001110011010001110101010011110101110111101101100001110101110100011100010101110011010001111100101101110110010011010100111001100010110000010111001111001100110010010111010111000000010000001111010111001001010101011111001101000111010101001111010111011110110110000111010111010001110001010111001101000111110010110 e68ea9ebbdb0eba38ae68f96ec9a9cc582e79992eb8081eb92ab50e68ea9ebbdb0eba38ae68f96ec9a9cc582e79992eb8081eb92abe68ea9ebbdb0eba38ae68f96
UHC 掩뽰룊揖욜ł癒뀁뒫P掩뽰룊揖욜ł癒뀁뒫掩뽰룊揖 111001011111001110010110111011001000111110001001111010111110011110111111111001111010100110101001111010111010100010110010111011001000101010100101010100001110010111110011100101101110110010001111100010011110101111100111101111111110011110101001101010011110101110101000101100101110110010001010101001011110010111110011100101101110110010001111100010011110101111100111 e5f396ec8f89ebe7bfe7a9a9eba8b2ec8aa550e5f396ec8f89ebe7bfe7a9a9eba8b2ec8aa5e5f396ec8f89ebe7

SJIS-Win,EUC-JP: Classic charsets mainly used as Japanese encoding set on Windows(SJIS-Win=CP932) and UNIX(EUC-JP)