To what bitstring a character(s) is encoded in each character set?

Input one character or short letters and click "Convert."


(UTF-8)
Charset Character Bit string (binary) Bit String (hexadecimal)
ISO-8859-1 ?????????uy?????????uyB 0011111100111111001111110011111100111111001111110011111100111111001111110111010101111001001111110011111100111111001111110011111100111111001111110011111100111111011101010111100101000010 3f3f3f3f3f3f3f3f3f75793f3f3f3f3f3f3f3f3f757942
SJIS-WIN 爭?祭槃?際?肢?uy爭?祭槃?際?肢?uyB 111000001010010100111111100011011101010110011110110011110011111110001101110110110011111110001110100010000011111101110101011110011110000010100101001111111000110111010101100111101100111100111111100011011101101100111111100011101000100000111111011101010111100101000010 e0a53f8dd59ecf3f8ddb3f8e883f7579e0a53f8dd59ecf3f8ddb3f8e883f757942
EUC-JP 爭?祭槃?際?肢?uy爭?祭槃?際?肢?uyB 111000001010011100111111101110101101011111011100110100010011111110111010110111010011111110111011111010000011111101110101011110011110000010100111001111111011101011010111110111001101000100111111101110101101110100111111101110111110100000111111011101010111100101000010 e0a73fbad7dcd13fbadd3fbbe83f7579e0a73fbad7dcd13fbadd3fbbe83f757942
UTF-8 爭렦祭槃커際렑肢렖uy爭렦祭槃커際렑肢렖uyB 1110011110001000101011011110101110100000101001101110011110100101101011011110011010100111100000111110110010111011101001001110100110011010100110111110101110100000100100011110100010000010101000101110101110100000100101100111010101111001111001111000100010101101111010111010000010100110111001111010010110101101111001101010011110000011111011001011101110100100111010011001101010011011111010111010000010010001111010001000001010100010111010111010000010010110011101010111100101000010 e788adeba0a6e7a5ade6a783ecbba4e99a9beba091e882a2eba0967579e788adeba0a6e7a5ade6a783ecbba4e99a9beba091e882a2eba096757942
UHC 爭렦祭槃커際렑肢렖uy爭렦祭槃커際렑肢렖uyB 1110111010110011100011101011010111110000101011101101101011101001110001001011111111110000101101111000111010100110111100101011011010001110101010110111010101111001111011101011001110001110101101011111000010101110110110101110100111000100101111111111000010110111100011101010011011110010101101101000111010101011011101010111100101000010 eeb38eb5f0aedae9c4bff0b78ea6f2b68eab7579eeb38eb5f0aedae9c4bff0b78ea6f2b68eab757942

SJIS-Win,EUC-JP: Classic charsets mainly used as Japanese encoding set on Windows(SJIS-Win=CP932) and UNIX(EUC-JP)