To what bitstring a character(s) is encoded in each character set?

Input one character or short letters and click "Convert."


(UTF-8)
Charset Character Bit string (binary) Bit String (hexadecimal)
ISO-8859-1 ????????????????????? 001111110011111100111111001111110011111100111111001111110011111100111111001111110011111100111111001111110011111100111111001111110011111100111111001111110011111100111111 3f3f3f3f3f3f3f3f3f3f3f3f3f3f3f3f3f3f3f3f3f
SJIS-WIN 癲??怨?????靭??純??域??循?? 111000011001111100111111001111111000100110000101001111110011111100111111001111110011111110010000011110000011111100111111100011111000001100111111001111111000100011100110001111110011111110001111011110100011111100111111 e19f3f3f89853f3f3f3f3f90783f3f8f833f3f88e63f3f8f7a3f3f
EUC-JP 癲??怨?????靭??純??域??循?? 111000101010000100111111001111111011000111100101001111110011111100111111001111110011111110111111110110010011111100111111101111011110001100111111001111111011000011101000001111110011111110111101110110110011111100111111 e2a13f3fb1e53f3f3f3f3fbfd93f3fbde33f3fb0e83f3fbddb3f3f
UTF-8 癲ㅺ슝怨대춸烈ㅻ뀪靭뚨럤純놁춶域밟뫁循띌뒽 111001111001100110110010111000111000010110111010111011001000101010011101111001101000000010101000111010111000110010000000111011001011011010111000111011111010011010011111111000111000010110111011111010111000000010101010111010011001110110101101111010111001101010101000111010111001111110100100111001111011010010010100111010111000011010000001111011001011011010110110111001011001111110011111111010111011000010011111111010111010101110000001111001011011111010101010111010111001110110001100111010111001001010111101 e799b2e385baec8a9de680a8eb8c80ecb6b8efa69fe385bbeb80aae99dadeb9aa8eb9fa4e7b494eb8681ecb6b6e59f9febb09febab81e5beaaeb9d8ceb92bd
UHC 癲ㅺ슝怨대춸烈ㅻ뀪靭뚨럤純놁춶域밟뫁循띌뒽 111011111010011010100100111010101011110110111001111010101011001110110100111010111010110110010100111001101110111110100100111010111000010110100000111011001110010110001100111001111000111010000111111000101110110110000110111011001010110110010010111001101011010010111001111000101001000110100101111000101110000010110110111010011000101010110011 efa6a4eabdb9eab3b4ebad94e6efa4eb85a0ece58ce78e87e2ed86ecad92e6b4b9e291a5e2e0b6e98ab3

SJIS-Win,EUC-JP: Classic charsets mainly used as Japanese encoding set on Windows(SJIS-Win=CP932) and UNIX(EUC-JP)