To what bitstring a character(s) is encoded in each character set?

Input one character or short letters and click "Convert."


(UTF-8)
Charset Character Bit string (binary) Bit String (hexadecimal)
ISO-8859-1 ??????U}??????U{^ 0011111100111111001111110011111100111111001111110101010101111101001111110011111100111111001111110011111100111111010101010111101101011110 3f3f3f3f3f3f557d3f3f3f3f3f3f557b5e
SJIS-WIN 席腺褻錫?繕U}席腺褻錫?繕U{^ 100100001100100010010001010000101110010111110110100011101110000000111111100100010101010101010101011111011001000011001000100100010100001011100101111101101000111011100000001111111001000101010101010101010111101101011110 90c89142e5f68ee03f9155557d90c89142e5f68ee03f9155557b5e
EUC-JP 席腺褻錫?繕U}席腺褻錫?繕U{^ 110000001100101011000001101000111110101011111000101111001110001000111111110000011011011001010101011111011100000011001010110000011010001111101010111110001011110011100010001111111100000110110110010101010111101101011110 c0cac1a3eaf8bce23fc1b6557dc0cac1a3eaf8bce23fc1b6557b5e
UTF-8 席腺褻錫卨繕U}席腺褻錫卨繕U{^ 1110010110111000101011011110100010000101101110101110100010100100101110111110100110001100101010111110010110001101101010001110011110111001100101010101010101111101111001011011100010101101111010001000010110111010111010001010010010111011111010011000110010101011111001011000110110101000111001111011100110010101010101010111101101011110 e5b8ade885bae8a4bbe98cabe58da8e7b995557de5b8ade885bae8a4bbe98cabe58da8e7b995557b5e
UHC 席腺褻錫卨繕U}席腺褻錫卨繕U{^ 1110000010101100111000001100110111100000111000011110000010111000111000001101100111100000110010110101010101111101111000001010110011100000110011011110000011100001111000001011100011100000110110011110000011001011010101010111101101011110 e0ace0cde0e1e0b8e0d9e0cb557de0ace0cde0e1e0b8e0d9e0cb557b5e

SJIS-Win,EUC-JP: Classic charsets mainly used as Japanese encoding set on Windows(SJIS-Win=CP932) and UNIX(EUC-JP)