To what bitstring a character(s) is encoded in each character set?

Input one character or short letters and click "Convert."


(UTF-8)
Charset Character Bit string (binary) Bit String (hexadecimal)
ISO-8859-1 ????????????????????? 001111110011111100111111001111110011111100111111001111110011111100111111001111110011111100111111001111110011111100111111001111110011111100111111001111110011111100111111 3f3f3f3f3f3f3f3f3f3f3f3f3f3f3f3f3f3f3f3f3f
SJIS-WIN ???釗??鼇??竊??受?????而?? 0011111100111111001111111111101110111011001111110011111111101010100001110011111100111111111000101000011000111111001111111000111011110011001111110011111100111111001111110011111110001110101001110011111100111111 3f3f3ffbbb3f3fea873f3fe2863f3f8ef33f3f3f3f3f8ea73f3f
EUC-JP ???釗??鼇??竊??受?????而?? 001111110011111100111111100011111110001110100110001111110011111111110011111001110011111100111111111000111110011000111111001111111011110011110101001111110011111100111111001111110011111110111100101010010011111100111111 3f3f3f8fe3a63f3ff3e73f3fe3e63f3fbcf53f3f3f3f3fbca93f3f
UTF-8 列룸씈釗고씘鼇잕랜竊뽫쑴受노뭵列룸씈而졾쑵 111011111010011010011100111010111010001110111000111011001001010010001000111010011000011110010111111010101011001110100000111011001001010010011000111010011011110010000111111011001001111010010101111010111001111010011100111001111010101110001010111010111011110110101011111011001001000110110100111001011000111110010111111010111000010110111000111010111010110110110101111011111010011010011100111010111010001110111000111011001001010010001000111010001000000010001100111011001010000110111110111011001001000110110101 efa69ceba3b8ec9488e98797eab3a0ec9498e9bc87ec9e95eb9e9ce7ab8aebbdabec91b4e58f97eb85b8ebadb5efa69ceba3b8ec9488e8808ceca1beec91b5
UHC 列룸씈釗고씘鼇잕랜竊뽫쑴受노뭵列룸씈而졾쑵 111001101110101010110111111010111001110110100000111000011111001010110000111011011001110110101101111010001010100010011111111010101011011110100011111011111011110010010110111001111011111010101001111000011111010010110011111010111001001010000100111001101110101010110111111010111001110110100000111011001011101110100000111001011011111010101010 e6eab7eb9da0e1f2b0ed9dade8a89feab7a3efbc96e7bea9e1f4b3eb9284e6eab7eb9da0ecbba0e5beaa

SJIS-Win,EUC-JP: Classic charsets mainly used as Japanese encoding set on Windows(SJIS-Win=CP932) and UNIX(EUC-JP)