To what bitstring a character(s) is encoded in each character set?

Input one character or short letters and click "Convert."


(UTF-8)
Charset Character Bit string (binary) Bit String (hexadecimal)
ISO-8859-1 ?????????d????????????? 0011111100111111001111110011111100111111001111110011111100111111001111110110010000111111001111110011111100111111001111110011111100111111001111110011111100111111001111110011111100111111 3f3f3f3f3f3f3f3f3f643f3f3f3f3f3f3f3f3f3f3f3f3f
SJIS-WIN ?????????d????????????? 0011111100111111001111110011111100111111001111110011111100111111001111110110010000111111001111110011111100111111001111110011111100111111001111110011111100111111001111110011111100111111 3f3f3f3f3f3f3f3f3f643f3f3f3f3f3f3f3f3f3f3f3f3f
EUC-JP ?????????d????????????? 0011111100111111001111110011111100111111001111110011111100111111001111110110010000111111001111110011111100111111001111110011111100111111001111110011111100111111001111110011111100111111 3f3f3f3f3f3f3f3f3f643f3f3f3f3f3f3f3f3f3f3f3f3f
UTF-8 淋믫슕泥좎쮫泥좏슀d淋믫슕泥좎쮫泥섏쮬淋믫슕李 11101111101001111011010111101011101011111010101111101100100010101001010111101111101001111010001111101100101000101000111011101100101011101010101111101111101001111010001111101100101000101000111111101100100010101000000001100100111011111010011110110101111010111010111110101011111011001000101010010101111011111010011110100011111011001010001010001110111011001010111010101011111011111010011110100011111011001000010010001111111011001010111010101100111011111010011110110101111010111010111110101011111011001000101010010101111011111010011110100001 efa7b5ebafabec8a95efa7a3eca28eecaeabefa7a3eca28fec8a8064efa7b5ebafabec8a95efa7a3eca28eecaeabefa7a3ec848fecaeacefa7b5ebafabec8a95efa7a1
UHC 淋믫슕泥좎쮫泥좏슀d淋믫슕泥좎쮫泥섏쮬淋믫슕李 111011001111100010010010111011011001101010100100111011001011001010100000111011001010100010001000111011001011001010100000111011011001101010010011011001001110110011111000100100101110110110011010101001001110110010110010101000001110110010101000100010001110110010110010100110001110110010101000100010011110110011111000100100101110110110011010101001001110110010110000 ecf892ed9aa4ecb2a0eca888ecb2a0ed9a9364ecf892ed9aa4ecb2a0eca888ecb298eca889ecf892ed9aa4ecb0

SJIS-Win,EUC-JP: Classic charsets mainly used as Japanese encoding set on Windows(SJIS-Win=CP932) and UNIX(EUC-JP)