To what bitstring a character(s) is encoded in each character set?

Input one character or short letters and click "Convert."


(UTF-8)
Charset Character Bit string (binary) Bit String (hexadecimal)
ISO-8859-1 ????????????????????????^ 00111111001111110011111100111111001111110011111100111111001111110011111100111111001111110011111100111111001111110011111100111111001111110011111100111111001111110011111100111111001111110011111101011110 3f3f3f3f3f3f3f3f3f3f3f3f3f3f3f3f3f3f3f3f3f3f3f3f5e
SJIS-WIN 午??援ο?矣??揶??午??援ο?矣??揶??^ 1000110011011111001111110011111110001001100001111000001111001101001111111110000111100001001111110011111110011101100010000011111100111111100011001101111100111111001111111000100110000111100000111100110100111111111000011110000100111111001111111001110110001000001111110011111101011110 8cdf3f3f898783cd3fe1e13f3f9d883f3f8cdf3f3f898783cd3fe1e13f3f9d883f3f5e
EUC-JP 午??援ο?矣??揶??午??援ο?矣??揶??^ 1011100011100001001111110011111110110001111001111010011011001111001111111110001011100011001111110011111111011001111010000011111100111111101110001110000100111111001111111011000111100111101001101100111100111111111000101110001100111111001111111101100111101000001111110011111101011110 b8e13f3fb1e7a6cf3fe2e33f3fd9e83f3fb8e13f3fb1e7a6cf3fe2e33f3fd9e83f3f5e
UTF-8 午닿퓭援ο㎖矣섎겱揶쏆쉰午닿퓭援ο㎖矣섎겱揶쏆쉰^ 1110010110001101100010001110101110001011101111111110110110010011101011011110011010001111101101001100111010111111111000111000111010010110111001111001111110100011111011001000010010001110111010101011001010110001111001101000111110110110111011001000111110000110111011001000100110110000111001011000110110001000111010111000101110111111111011011001001110101101111001101000111110110100110011101011111111100011100011101001011011100111100111111010001111101100100001001000111011101010101100101011000111100110100011111011011011101100100011111000011011101100100010011011000001011110 e58d88eb8bbfed93ade68fb4cebfe38e96e79fa3ec848eeab2b1e68fb6ec8f86ec89b0e58d88eb8bbfed93ade68fb4cebfe38e96e79fa3ec848eeab2b1e68fb6ec8f86ec89b05e
UHC 午닿퓭援ο㎖矣섎겱揶쏆쉰午닿퓭援ο㎖矣섎겱揶쏆쉰^ 11100111111011011011010011101010101111111001010011101010101101011010010111101111101001111010001011101011111110001001100011101011100000011011110111100101101010101001101111101100101111011010111011100111111011011011010011101010101111111001010011101010101101011010010111101111101001111010001011101011111110001001100011101011100000011011110111100101101010101001101111101100101111011010111001011110 e7edb4eabf94eab5a5efa7a2ebf898eb81bde5aa9becbdaee7edb4eabf94eab5a5efa7a2ebf898eb81bde5aa9becbdae5e

SJIS-Win,EUC-JP: Classic charsets mainly used as Japanese encoding set on Windows(SJIS-Win=CP932) and UNIX(EUC-JP)