To what bitstring a character(s) is encoded in each character set?

Input one character or short letters and click "Convert."


(UTF-8)
Charset Character Bit string (binary) Bit String (hexadecimal)
ISO-8859-1 ?????????????????????? 00111111001111110011111100111111001111110011111100111111001111110011111100111111001111110011111100111111001111110011111100111111001111110011111100111111001111110011111100111111 3f3f3f3f3f3f3f3f3f3f3f3f3f3f3f3f3f3f3f3f3f3f
SJIS-WIN 蒻??毅??惟щ/鶯??爾??猷?????鍮 11100100111010000011111100111111100010110100001000111111001111111000100011010010100001001000101110000001010111101110100111110010001111110011111110001110101000100011111100111111100101110101000100111111001111110011111100111111001111111110100001001010 e4e83f3f8b423f3f88d2848b815ee9f23f3f8ea23f3f97513f3f3f3f3fe84a
EUC-JP 蒻??毅??惟щ/鶯??爾??猷?????鍮 11101000111010100011111100111111101101011010001100111111001111111011000011010100101001111110101110100001101111111111001011110100001111110011111110111100101001000011111100111111110011011011001000111111001111110011111100111111001111111110111110101011 e8ea3f3fb5a33f3fb0d4a7eba1bff2f43f3fbca43f3fcdb23f3f3f3f3fefab
UTF-8 蒻몃쪇毅뷴맅惟щ/鶯밤깾爾븅툣猷몃뼣若쒓막鍮 1110100010010010101110111110101110101010100000111110110010101010100001111110011010101111100001011110101110110111101101001110101110100111100001011110011010000011100111111101000110001001111011111011110010001111111010011011011010101111111010111011000010100100111010101011100110111110111001111000100010111110111010111011100010000101111011011000100010100011111001111000110010110111111010111010101010000011111010111011110010100011111011111010010110110100111011001001001010010011111010111010011110001001111010011000110110101110 e892bbebaa83ecaa87e6af85ebb7b4eba785e6839fd189efbc8fe9b6afebb0a4eab9bee788beebb885ed88a3e78cb7ebaa83ebbca3efa5b4ec9293eba789e98dae
UHC 蒻몃쪇毅뷴맅惟щ/鶯밤깾爾븅툣猷몃뼣若쒓막鍮 1110010110110110101110001110101110100101100000011110101111110110101110101110010110010000100111111110101011101110101011001110101110100011101011111110010110100011101110011110001110000011101001111110110010110011101110101110100110111000100110101110101110100011101110001110101110010110101001101110010110101110100111001110101010111000101101111110101110111001 e5b6b8eba581ebf6bae5909feaeeaceba3afe5a3b9e383a7ecb3bae9b89aeba3b8eb96a6e5ae9ceab8b7ebb9

SJIS-Win,EUC-JP: Classic charsets mainly used as Japanese encoding set on Windows(SJIS-Win=CP932) and UNIX(EUC-JP)