To what bitstring a character(s) is encoded in each character set?

Input one character or short letters and click "Convert."


(UTF-8)
Charset Character Bit string (binary) Bit String (hexadecimal)
ISO-8859-1 ?????????i?????????iB 001111110011111100111111001111110011111100111111001111110011111100111111011010010011111100111111001111110011111100111111001111110011111100111111001111110110100101000010 3f3f3f3f3f3f3f3f3f693f3f3f3f3f3f3f3f3f6942
SJIS-WIN ??????乙ъ┓i??????乙ъ┓iB 001111110011111100111111001111110011111100111111100010011011001110000100100011001000010010101101011010010011111100111111001111110011111100111111001111111000100110110011100001001000110010000100101011010110100101000010 3f3f3f3f3f3f89b3848c84ad693f3f3f3f3f3f89b3848c84ad6942
EUC-JP ???堉??乙ъ┓i???堉??乙ъ┓iB 00111111001111110011111110001111101101111111110100111111001111111011001010110101101001111110110010101000101011110110100100111111001111110011111110001111101101111111110100111111001111111011001010110101101001111110110010101000101011110110100101000010 3f3f3f8fb7fd3f3fb2b5a7eca8af693f3f3f8fb7fd3f3fb2b5a7eca8af6942
UTF-8 緣낆늹堉삥퍗乙ъ┓i緣낆늹堉삥퍗乙ъ┓iB 11100111101101111010001111101011100000101000011011101011100010101011100111100101101000001000100111101100100000101010010111101101100011011001011111100100101110011001100111010001100010101110001010010100100100110110100111100111101101111010001111101011100000101000011011101011100010101011100111100101101000001000100111101100100000101010010111101101100011011001011111100100101110011001100111010001100010101110001010010100100100110110100101000010 e7b7a3eb8286eb8ab9e5a089ec82a5ed8d97e4b999d18ae2949369e7b7a3eb8286eb8ab9e5a089ec82a5ed8d97e4b999d18ae294936942
UHC 緣낆늹堉삥퍗乙ъ┓i緣낆늹堉삥퍗乙ъ┓iB 111001101101111010000101111011001000100010000010111010111011110010111011111001101011101110001110111010111110000010101100111011001010011010101111011010011110011011011110100001011110110010001000100000101110101110111100101110111110011010111011100011101110101111100000101011001110110010100110101011110110100101000010 e6de85ec8882ebbcbbe6bb8eebe0aceca6af69e6de85ec8882ebbcbbe6bb8eebe0aceca6af6942

SJIS-Win,EUC-JP: Classic charsets mainly used as Japanese encoding set on Windows(SJIS-Win=CP932) and UNIX(EUC-JP)