To what bitstring a character(s) is encoded in each character set?

Input one character or short letters and click "Convert."


(UTF-8)
Charset Character Bit string (binary) Bit String (hexadecimal)
ISO-8859-1 ????????????????B 0011111100111111001111110011111100111111001111110011111100111111001111110011111100111111001111110011111100111111001111110011111101000010 3f3f3f3f3f3f3f3f3f3f3f3f3f3f3f3f42
SJIS-WIN ??????淫???????淫?B 00111111001111110011111100111111001111110011111110001000111110100011111100111111001111110011111100111111001111110011111110001000111110100011111101000010 3f3f3f3f3f3f88fa3f3f3f3f3f3f3f88fa3f42
EUC-JP ??????淫???????淫?B 00111111001111110011111100111111001111110011111110110000111111000011111100111111001111110011111100111111001111110011111110110000111111000011111101000010 3f3f3f3f3f3fb0fc3f3f3f3f3f3f3fb0fc3f42
UTF-8 琉득텋理띷돽淫얝琉득텋理띷돽淫얝B 11101111101001111000110011101011100100111001110111101101100001011000101111101111101001111010010011101011100111011011011111101011100011111011110111100110101101111010101111101100100101101001110111101111101001111000110011101011100100111001110111101101100001011000101111101111101001111010010011101011100111011011011111101011100011111011110111100110101101111010101111101100100101101001110101000010 efa78ceb939ded858befa7a4eb9db7eb8fbde6b7abec969defa78ceb939ded858befa7a4eb9db7eb8fbde6b7abec969d42
UHC 琉득텋理띷돽淫얝琉득텋理띷돽淫얝B 111010111010010010110101111001101011011010001000111011001011010110001101111001101000100110111111111010111110001010011110010001011110101110100100101101011110011010110110100010001110110010110101100011011110011010001001101111111110101111100010100111100100010101000010 eba4b5e6b688ecb58de689bfebe29e45eba4b5e6b688ecb58de689bfebe29e4542

SJIS-Win,EUC-JP: Classic charsets mainly used as Japanese encoding set on Windows(SJIS-Win=CP932) and UNIX(EUC-JP)