To what bitstring a character(s) is encoded in each character set?

Input one character or short letters and click "Convert."


(UTF-8)
Charset Character Bit string (binary) Bit String (hexadecimal)
ISO-8859-1 ??????????????????^ 00111111001111110011111100111111001111110011111100111111001111110011111100111111001111110011111100111111001111110011111100111111001111110011111101011110 3f3f3f3f3f3f3f3f3f3f3f3f3f3f3f3f3f3f5e
SJIS-WIN □?荷??オ?訥ぢ□?荷??オ?訥ぢ^ 1000000110100000001111111000100111010111001111110011111110000011010010010011111111100110011000111000001011000000100000011010000000111111100010011101011100111111001111111000001101001001001111111110011001100011100000101100000001011110 81a03f89d73f3f83493fe66382c081a03f89d73f3f83493fe66382c05e
EUC-JP □?荷??オ?訥ぢ□?荷??オ?訥ぢ^ 1010001010100010001111111011001011011001001111110011111110100101101010100011111111101011110001001010010011000010101000101010001000111111101100101101100100111111001111111010010110101010001111111110101111000100101001001100001001011110 a2a23fb2d93f3fa5aa3febc4a4c2a2a23fb2d93f3fa5aa3febc4a4c25e
UTF-8 □룫荷룶欄オ룶訥ぢ□룫荷룶欄オ룶訥ぢ^ 11100010100101101010000111101011101000111010101111101000100011011011011111101011101000111011011011101111101001001001110111100011100000101010101011101011101000111011011011101000101010001010010111100011100000011010001011100010100101101010000111101011101000111010101111101000100011011011011111101011101000111011011011101111101001001001110111100011100000101010101011101011101000111011011011101000101010001010010111100011100000011010001001011110 e296a1eba3abe88db7eba3b6efa49de382aaeba3b6e8a8a5e381a2e296a1eba3abe88db7eba3b6efa49de382aaeba3b6e8a8a5e381a25e
UHC □룫荷룶欄オ룶訥ぢ□룫荷룶欄オ룶訥ぢ^ 10100001111000001000111110100010111110011100001110001111101010111101000111101101101010111010101010001111101010111101001011101101101010101100001010100001111000001000111110100010111110011100001110001111101010111101000111101101101010111010101010001111101010111101001011101101101010101100001001011110 a1e08fa2f9c38fabd1edabaa8fabd2edaac2a1e08fa2f9c38fabd1edabaa8fabd2edaac25e

SJIS-Win,EUC-JP: Classic charsets mainly used as Japanese encoding set on Windows(SJIS-Win=CP932) and UNIX(EUC-JP)