To what bitstring a character(s) is encoded in each character set?

Input one character or short letters and click "Convert."


(UTF-8)
Charset Character Bit string (binary) Bit String (hexadecimal)
ISO-8859-1 ???????????i???????????iB 00111111001111110011111100111111001111110011111100111111001111110011111100111111001111110110100100111111001111110011111100111111001111110011111100111111001111110011111100111111001111110110100101000010 3f3f3f3f3f3f3f3f3f3f3f693f3f3f3f3f3f3f3f3f3f3f6942
SJIS-WIN ?????普?????i?????普?????iB 001111110011111100111111001111110011111110010101100000010011111100111111001111110011111100111111011010010011111100111111001111110011111100111111100101011000000100111111001111110011111100111111001111110110100101000010 3f3f3f3f3f95813f3f3f3f3f693f3f3f3f3f95813f3f3f3f3f6942
EUC-JP ?????普?????i?????普?????iB 001111110011111100111111001111110011111111001001111000010011111100111111001111110011111100111111011010010011111100111111001111110011111100111111110010011110000100111111001111110011111100111111001111110110100101000010 3f3f3f3f3fc9e13f3f3f3f3f693f3f3f3f3fc9e13f3f3f3f3f6942
UTF-8 렻렖렻렪렻普렖렺렮렻렧i렻렖렻렪렻普렖렺렮렻렧iB 111010111010000010111011111010111010000010010110111010111010000010111011111010111010000010101010111010111010000010111011111001101001100110101110111010111010000010010110111010111010000010111010111010111010000010101110111010111010000010111011111010111010000010100111011010011110101110100000101110111110101110100000100101101110101110100000101110111110101110100000101010101110101110100000101110111110011010011001101011101110101110100000100101101110101110100000101110101110101110100000101011101110101110100000101110111110101110100000101001110110100101000010 eba0bbeba096eba0bbeba0aaeba0bbe699aeeba096eba0baeba0aeeba0bbeba0a769eba0bbeba096eba0bbeba0aaeba0bbe699aeeba096eba0baeba0aeeba0bbeba0a76942
UHC 렻렖렻렪렻普렖렺렮렻렧i렻렖렻렪렻普렖렺렮렻렧iB 1000111011000011100011101010101110001110110000111000111010111000100011101100001111011100110001011000111010101011100011101100001010001110101110111000111011000011100011101011011001101001100011101100001110001110101010111000111011000011100011101011100010001110110000111101110011000101100011101010101110001110110000101000111010111011100011101100001110001110101101100110100101000010 8ec38eab8ec38eb88ec3dcc58eab8ec28ebb8ec38eb6698ec38eab8ec38eb88ec3dcc58eab8ec28ebb8ec38eb66942

SJIS-Win,EUC-JP: Classic charsets mainly used as Japanese encoding set on Windows(SJIS-Win=CP932) and UNIX(EUC-JP)