To what bitstring a character(s) is encoded in each character set?

Input one character or short letters and click "Convert."


(UTF-8)
Charset Character Bit string (binary) Bit String (hexadecimal)
ISO-8859-1 ?????????z?????????zB 001111110011111100111111001111110011111100111111001111110011111100111111011110100011111100111111001111110011111100111111001111110011111100111111001111110111101001000010 3f3f3f3f3f3f3f3f3f7a3f3f3f3f3f3f3f3f3f7a42
SJIS-WIN ?レ?違ζ?膺??z?レ?違ζ?膺??zB 0011111110000011100011000011111110001000111000011000001111000100001111111110010001011110001111110011111101111010001111111000001110001100001111111000100011100001100000111100010000111111111001000101111000111111001111110111101001000010 3f838c3f88e183c43fe45e3f3f7a3f838c3f88e183c43fe45e3f3f7a42
EUC-JP ?レ?違ζ?膺??z?レ?違ζ?膺??zB 0011111110100101111011000011111110110000111000111010011011000110001111111110011110111111001111110011111101111010001111111010010111101100001111111011000011100011101001101100011000111111111001111011111100111111001111110111101001000010 3fa5ec3fb0e3a6c63fe7bf3f3f7a3fa5ec3fb0e3a6c63fe7bf3f3f7a42
UTF-8 曆レ눘違ζ눨膺덉젳z曆レ눘違ζ눨膺덉젳zB 11101111101001101000101111100011100000111010110011101011100010001001100011101001100000011001010111001110101101101110101110001000101010001110100010000110101110101110101110001101100010011110110010100000101100110111101011101111101001101000101111100011100000111010110011101011100010001001100011101001100000011001010111001110101101101110101110001000101010001110100010000110101110101110101110001101100010011110110010100000101100110111101001000010 efa68be383aceb8898e98195ceb6eb88a8e886baeb8d89eca0b37aefa68be383aceb8898e98195ceb6eb88a8e886baeb8d89eca0b37a42
UHC 曆レ눘違ζ눨膺덉젳z曆レ눘違ζ눨膺덉젳zB 111001101011011110101011111011001000011110110001111010101101111010100101111001101000011110111111111010111110110010001000111011001010000010100111011110101110011010110111101010111110110010000111101100011110101011011110101001011110011010000111101111111110101111101100100010001110110010100000101001110111101001000010 e6b7abec87b1eadea5e687bfebec88eca0a77ae6b7abec87b1eadea5e687bfebec88eca0a77a42

SJIS-Win,EUC-JP: Classic charsets mainly used as Japanese encoding set on Windows(SJIS-Win=CP932) and UNIX(EUC-JP)