To what bitstring a character(s) is encoded in each character set?

Input one character or short letters and click "Convert."


(UTF-8)
Charset Character Bit string (binary) Bit String (hexadecimal)
ISO-8859-1 ?????????n}?????????n{^ 0011111100111111001111110011111100111111001111110011111100111111001111110110111001111101001111110011111100111111001111110011111100111111001111110011111100111111011011100111101101011110 3f3f3f3f3f3f3f3f3f6e7d3f3f3f3f3f3f3f3f3f6e7b5e
SJIS-WIN □????オ?訥ぢn}□????オ?訥ぢn{^ 10000001101000000011111100111111001111110011111110000011010010010011111111100110011000111000001011000000011011100111110110000001101000000011111100111111001111110011111110000011010010010011111111100110011000111000001011000000011011100111101101011110 81a03f3f3f3f83493fe66382c06e7d81a03f3f3f3f83493fe66382c06e7b5e
EUC-JP □????オ?訥ぢn}□????オ?訥ぢn{^ 10100010101000100011111100111111001111110011111110100101101010100011111111101011110001001010010011000010011011100111110110100010101000100011111100111111001111110011111110100101101010100011111111101011110001001010010011000010011011100111101101011110 a2a23f3f3f3fa5aa3febc4a4c26e7da2a23f3f3f3fa5aa3febc4a4c26e7b5e
UTF-8 □룫吏룶欄オ룶訥ぢn}□룫吏룶欄オ룶訥ぢn{^ 1110001010010110101000011110101110100011101010111110111110100111100111101110101110100011101101101110111110100100100111011110001110000010101010101110101110100011101101101110100010101000101001011110001110000001101000100110111001111101111000101001011010100001111010111010001110101011111011111010011110011110111010111010001110110110111011111010010010011101111000111000001010101010111010111010001110110110111010001010100010100101111000111000000110100010011011100111101101011110 e296a1eba3abefa79eeba3b6efa49de382aaeba3b6e8a8a5e381a26e7de296a1eba3abefa79eeba3b6efa49de382aaeba3b6e8a8a5e381a26e7b5e
UHC □룫吏룶欄オ룶訥ぢn}□룫吏룶欄オ룶訥ぢn{^ 1010000111100000100011111010001011101100101001111000111110101011110100011110110110101011101010101000111110101011110100101110110110101010110000100110111001111101101000011110000010001111101000101110110010100111100011111010101111010001111011011010101110101010100011111010101111010010111011011010101011000010011011100111101101011110 a1e08fa2eca78fabd1edabaa8fabd2edaac26e7da1e08fa2eca78fabd1edabaa8fabd2edaac26e7b5e

SJIS-Win,EUC-JP: Classic charsets mainly used as Japanese encoding set on Windows(SJIS-Win=CP932) and UNIX(EUC-JP)