To what bitstring a character(s) is encoded in each character set?

Input one character or short letters and click "Convert."


(UTF-8)
Charset Character Bit string (binary) Bit String (hexadecimal)
ISO-8859-1 ????????????????????? 001111110011111100111111001111110011111100111111001111110011111100111111001111110011111100111111001111110011111100111111001111110011111100111111001111110011111100111111 3f3f3f3f3f3f3f3f3f3f3f3f3f3f3f3f3f3f3f3f3f
SJIS-WIN ????←?碎??壓??松??????ч? 0011111100111111001111110011111110000001101010010011111111100001111010100011111100111111100110101101100000111111001111111000111110111100001111110011111100111111001111110011111100111111100001001000100100111111 3f3f3f3f81a93fe1ea3f3f9ad83f3f8fbc3f3f3f3f3f3f84893f
EUC-JP ????←?碎??壓??松??????ч? 0011111100111111001111110011111110100010101010110011111111100010111011000011111100111111110101001101101000111111001111111011111010111110001111110011111100111111001111110011111100111111101001111110100100111111 3f3f3f3fa2ab3fe2ec3f3fd4da3f3fbebe3f3f3f3f3f3fa7e93f
UTF-8 僚녹뼐璘←댚碎좊븶壓믪솃松쎌춻列룸챷璘ч뒽 1110111110100110101110111110101110000101101110011110101110111100100100001110111110100111101011111110001010000110100100001110101110001100100110101110011110100010100011101110110010100010100010101110101110111000101101101110010110100011100100111110101110101111101010101110110010000110100000111110011010011101101111101110110010001110100011001110110010110110101110111110111110100110100111001110101110100011101110001110110010110001101101111110111110100111101011111101000110000111111010111001001010111101 efa6bbeb85b9ebbc90efa7afe28690eb8c9ae7a28eeca28aebb8b6e5a393ebafaaec8683e69dbeec8e8cecb6bbefa69ceba3b8ecb1b7efa7afd187eb92bd
UHC 僚녹뼐璘←댚碎좊븶壓믪솃松쎌춻列룸챷璘ч뒽 111010001110100010110011111011001001011010011000111011001101111010100001111001111000100010111110111000011110111110100000111010111001010110011111111001001110001010010010111011001001100110001000111000011110011010111101111011001010110110010111111001101110101010110111111010111010101010000100111011001101111010101100111010011000101010110011 e8e8b3ec9698ecdea1e788bee1efa0eb959fe4e292ec9988e1e6bdecad97e6eab7ebaa84ecdeace98ab3

SJIS-Win,EUC-JP: Classic charsets mainly used as Japanese encoding set on Windows(SJIS-Win=CP932) and UNIX(EUC-JP)