To what bitstring a character(s) is encoded in each character set?

Input one character or short letters and click "Convert."


(UTF-8)
Charset Character Bit string (binary) Bit String (hexadecimal)
ISO-8859-1 ?????????n}?????????n{^ 0011111100111111001111110011111100111111001111110011111100111111001111110110111001111101001111110011111100111111001111110011111100111111001111110011111100111111011011100111101101011110 3f3f3f3f3f3f3f3f3f6e7d3f3f3f3f3f3f3f3f3f6e7b5e
SJIS-WIN 蜈?????汚ц?n}蜈?????汚ц?n{^ 1110010110000101001111110011111100111111001111110011111110001001100110001000010010001000001111110110111001111101111001011000010100111111001111110011111100111111001111111000100110011000100001001000100000111111011011100111101101011110 e5853f3f3f3f3f899884883f6e7de5853f3f3f3f3f899884883f6e7b5e
EUC-JP 蜈?????汚ц?n}蜈?????汚ц?n{^ 1110100111100101001111110011111100111111001111110011111110110001111110001010011111101000001111110110111001111101111010011110010100111111001111110011111100111111001111111011000111111000101001111110100000111111011011100111101101011110 e9e53f3f3f3f3fb1f8a7e83f6e7de9e53f3f3f3f3fb1f8a7e83f6e7b5e
UTF-8 蜈욤닋隸뚧옄汚ц닋n}蜈욤닋隸뚧옄汚ц닋n{^ 111010001001110010001000111011001001101010100100111010111000101110001011111011111010011010111000111010111001101010100111111011001001100010000100111001101011000110011010110100011000011011101011100010111000101101101110011111011110100010011100100010001110110010011010101001001110101110001011100010111110111110100110101110001110101110011010101001111110110010011000100001001110011010110001100110101101000110000110111010111000101110001011011011100111101101011110 e89c88ec9aa4eb8b8befa6b8eb9aa7ec9884e6b19ad186eb8b8b6e7de89c88ec9aa4eb8b8befa6b8eb9aa7ec9884e6b19ad186eb8b8b6e7b5e
UHC 蜈욤닋隸뚧옄汚ц닋n}蜈욤닋隸뚧옄汚ц닋n{^ 1110100010100101101111111110100010001000100100101110011111100110100011001110011010011110100100001110011111111101101011001110100010001000100100100110111001111101111010001010010110111111111010001000100010010010111001111110011010001100111001101001111010010000111001111111110110101100111010001000100010010010011011100111101101011110 e8a5bfe88892e7e68ce69e90e7fdace888926e7de8a5bfe88892e7e68ce69e90e7fdace888926e7b5e

SJIS-Win,EUC-JP: Classic charsets mainly used as Japanese encoding set on Windows(SJIS-Win=CP932) and UNIX(EUC-JP)