To what bitstring a character(s) is encoded in each character set?

Input one character or short letters and click "Convert."


(UTF-8)
Charset Character Bit string (binary) Bit String (hexadecimal)
ISO-8859-1 ?????????n}?????????n{^ 0011111100111111001111110011111100111111001111110011111100111111001111110110111001111101001111110011111100111111001111110011111100111111001111110011111100111111011011100111101101011110 3f3f3f3f3f3f3f3f3f6e7d3f3f3f3f3f3f3f3f3f6e7b5e
SJIS-WIN ??????喩??n}??????喩??n{^ 00111111001111110011111100111111001111110011111110011010011001110011111100111111011011100111110100111111001111110011111100111111001111110011111110011010011001110011111100111111011011100111101101011110 3f3f3f3f3f3f9a673f3f6e7d3f3f3f3f3f3f9a673f3f6e7b5e
EUC-JP ???絪??喩??n}???絪??喩??n{^ 0011111100111111001111111000111111010011111011000011111100111111110100111100100000111111001111110110111001111101001111110011111100111111100011111101001111101100001111110011111111010011110010000011111100111111011011100111101101011110 3f3f3f8fd3ec3f3fd3c83f3f6e7d3f3f3f8fd3ec3f3fd3c83f3f6e7b5e
UTF-8 列룸쓷絪울쫨喩띾똽n}列룸쓷絪울쫨喩띾똽n{^ 1110111110100110100111001110101110100011101110001110110010010011101101111110011110110101101010101110110010011010101110001110110010101011101010001110010110010110101010011110101110011101101111101110101110011000101111010110111001111101111011111010011010011100111010111010001110111000111011001001001110110111111001111011010110101010111011001001101010111000111011001010101110101000111001011001011010101001111010111001110110111110111010111001100010111101011011100111101101011110 efa69ceba3b8ec93b7e7b5aaec9ab8ecaba8e596a9eb9dbeeb98bd6e7defa69ceba3b8ec93b7e7b5aaec9ab8ecaba8e596a9eb9dbeeb98bd6e7b5e
UHC 列룸쓷絪울쫨喩띾똽n}列룸쓷絪울쫨喩띾똽n{^ 1110011011101010101101111110101110011101100101001110110011011111101111111110111110100110100000011110101011100111100011011110101110001100100000110110111001111101111001101110101010110111111010111001110110010100111011001101111110111111111011111010011010000001111010101110011110001101111010111000110010000011011011100111101101011110 e6eab7eb9d94ecdfbfefa681eae78deb8c836e7de6eab7eb9d94ecdfbfefa681eae78deb8c836e7b5e

SJIS-Win,EUC-JP: Classic charsets mainly used as Japanese encoding set on Windows(SJIS-Win=CP932) and UNIX(EUC-JP)