To what bitstring a character(s) is encoded in each character set?

Input one character or short letters and click "Convert."


(UTF-8)
Charset Character Bit string (binary) Bit String (hexadecimal)
ISO-8859-1 ??????????????????? 00111111001111110011111100111111001111110011111100111111001111110011111100111111001111110011111100111111001111110011111100111111001111110011111100111111 3f3f3f3f3f3f3f3f3f3f3f3f3f3f3f3f3f3f3f
SJIS-WIN 蒻??毅??惟щ/鶯??爾??猷??? 111001001110100000111111001111111000101101000010001111110011111110001000110100101000010010001011100000010101111011101001111100100011111100111111100011101010001000111111001111111001011101010001001111110011111100111111 e4e83f3f8b423f3f88d2848b815ee9f23f3f8ea23f3f97513f3f3f
EUC-JP 蒻??毅??惟щ/鶯??爾??猷??? 111010001110101000111111001111111011010110100011001111110011111110110000110101001010011111101011101000011011111111110010111101000011111100111111101111001010010000111111001111111100110110110010001111110011111100111111 e8ea3f3fb5a33f3fb0d4a7eba1bff2f43f3fbca43f3fcdb23f3f3f
UTF-8 蒻몃쪇毅뷴맅惟щ/鶯밤깾爾븅툣猷몃뼣若 1110100010010010101110111110101110101010100000111110110010101010100001111110011010101111100001011110101110110111101101001110101110100111100001011110011010000011100111111101000110001001111011111011110010001111111010011011011010101111111010111011000010100100111010101011100110111110111001111000100010111110111010111011100010000101111011011000100010100011111001111000110010110111111010111010101010000011111010111011110010100011111011111010010110110100 e892bbebaa83ecaa87e6af85ebb7b4eba785e6839fd189efbc8fe9b6afebb0a4eab9bee788beebb885ed88a3e78cb7ebaa83ebbca3efa5b4
UHC 蒻몃쪇毅뷴맅惟щ/鶯밤깾爾븅툣猷몃뼣若 1110010110110110101110001110101110100101100000011110101111110110101110101110010110010000100111111110101011101110101011001110101110100011101011111110010110100011101110011110001110000011101001111110110010110011101110101110100110111000100110101110101110100011101110001110101110010110101001101110010110101110 e5b6b8eba581ebf6bae5909feaeeaceba3afe5a3b9e383a7ecb3bae9b89aeba3b8eb96a6e5ae

SJIS-Win,EUC-JP: Classic charsets mainly used as Japanese encoding set on Windows(SJIS-Win=CP932) and UNIX(EUC-JP)