To what bitstring a character(s) is encoded in each character set?

Input one character or short letters and click "Convert."


(UTF-8)
Charset Character Bit string (binary) Bit String (hexadecimal)
ISO-8859-1 ????????????????????? 001111110011111100111111001111110011111100111111001111110011111100111111001111110011111100111111001111110011111100111111001111110011111100111111001111110011111100111111 3f3f3f3f3f3f3f3f3f3f3f3f3f3f3f3f3f3f3f3f3f
SJIS-WIN ???厭????。????ョ?擬??烏l? 001111110011111100111111100010010111110100111111001111110011111100111111100000010100001000111111001111110011111100111111100000111000011100111111100010110101101100111111001111111000100101000111100000101000110000111111 3f3f3f897d3f3f3f3f81423f3f3f3f83873f8b5b3f3f8947828c3f
EUC-JP ???厭????。彛???ョ?擬??烏l? 0011111100111111001111111011000111011110001111110011111100111111001111111010000110100011100011111011110011111010001111110011111100111111101001011110011100111111101101011011110000111111001111111011000110101000101000111110110000111111 3f3f3fb1de3f3f3f3fa1a38fbcfa3f3f3fa5e73fb5bc3f3fb1a8a3ec3f
UTF-8 玲곷젷厭묒뼏杻듣。彛몃젶溜ョ뼇擬쀫뼲烏l쥍 111011111010011010101101111010101011001110110111111011001010000010110111111001011000111010101101111010111010110010010010111010111011110010001111111011111010011110001000111010111001001110100011111000111000000010000010111001011011110110011011111010111010101010000011111011001010000010110110111011111010011110001011111000111000001110100111111010111011110010000111111001101001001110101100111011001000000010101011111010111011110010110010111001111000001110001111111011111011110110001100111011001010010110001101 efa6adeab3b7eca0b7e58eadebac92ebbc8fefa788eb93a3e38082e5bd9bebaa83eca0b6efa78be383a7ebbc87e693acec80abebbcb2e7838fefbd8ceca58d
UHC 玲곷젷厭묒뼏杻듣。彛몃젶溜ョ뼇擬쀫뼲烏l쥍 111001111011111110000001111010111010000010101011111001101111010010010001111011001001011010010111111010101111010010110101111010001010000110100011111011001010110110111000111010111010000010101010111010101111111010101011111001111001011010010001111010111111010010010111111010111001011010110101111010001010000110100011111011001010001010000110 e7bf81eba0abe6f491ec9697eaf4b5e8a1a3ecadb8eba0aaeafeabe79691ebf497eb96b5e8a1a3eca286

SJIS-Win,EUC-JP: Classic charsets mainly used as Japanese encoding set on Windows(SJIS-Win=CP932) and UNIX(EUC-JP)