To what bitstring a character(s) is encoded in each character set?

Input one character or short letters and click "Convert."


(UTF-8)
Charset Character Bit string (binary) Bit String (hexadecimal)
ISO-8859-1 ??????????????????? 00111111001111110011111100111111001111110011111100111111001111110011111100111111001111110011111100111111001111110011111100111111001111110011111100111111 3f3f3f3f3f3f3f3f3f3f3f3f3f3f3f3f3f3f3f
SJIS-WIN 腋?????蹂〓?壓??違??蟻??腰 1110001111111100001111110011111100111111001111110011111111100110111110001000000110101100001111111001101011011000001111110011111110001000111000010011111100111111100010110110000100111111001111111000110110011000 e3fc3f3f3f3f3fe6f881ac3f9ad83f3f88e13f3f8b613f3f8d98
EUC-JP 腋??堉??蹂〓?壓??違??蟻??腰 11100110111111100011111100111111100011111011011111111101001111110011111111101100111110101010001010101110001111111101010011011010001111110011111110110000111000110011111100111111101101011100001000111111001111111011100111111000 e6fe3f3f8fb7fd3f3fecfaa2ae3fd4da3f3fb0e33f3fb5c23f3fb9f8
UTF-8 腋뉗럩堉랃㎕蹂〓룆壓믩랭違뺡굲蟻녿젙腰 111010001000010110001011111010111000100110010111111010111001111110101001111001011010000010001001111010111001111010000011111000111000111010010101111010001011100110000010111000111000000010010011111010111010001110000110111001011010001110010011111010111010111110101001111010111001111010101101111010011000000110010101111010111011101010100001111010101011010110110010111010001001111110111011111010111000010110111111111011001010000010011001111010001000010110110000 e8858beb8997eb9fa9e5a089eb9e83e38e95e8b982e38093eba386e5a393ebafa9eb9eade98195ebbaa1eab5b2e89fbbeb85bfeca099e885b0
UHC 腋뉗럩堉랃㎕蹂〓룆壓믩랭違뺡굲蟻녿젙腰 1110010011111101100001111110110010001110100011001110101110111100100011011110111110100111101000011110101110110011101000011110101110001111100001011110010011100010100100101110101110110111101010011110101011011110100101011110100110000010100101011110101111111100100001101110101110100000100101011110100110100110 e4fd87ec8e8cebbc8defa7a1ebb3a1eb8f85e4e292ebb7a9eade95e98295ebfc86eba095e9a6

SJIS-Win,EUC-JP: Classic charsets mainly used as Japanese encoding set on Windows(SJIS-Win=CP932) and UNIX(EUC-JP)