To what bitstring a character(s) is encoded in each character set?

Input one character or short letters and click "Convert."


(UTF-8)
Charset Character Bit string (binary) Bit String (hexadecimal)
ISO-8859-1 ?????????? 00111111001111110011111100111111001111110011111100111111001111110011111100111111 3f3f3f3f3f3f3f3f3f3f
SJIS-WIN 杖??堪?杖??堪? 1000111111110001001111110011111110001010101011000011111110001111111100010011111100111111100010101010110000111111 8ff13f3f8aac3f8ff13f3f8aac3f
EUC-JP 杖?檉堪?杖?檉堪? 101111101111001100111111100011111100010110111011101101001010111000111111101111101111001100111111100011111100010110111011101101001010111000111111 bef33f8fc5bbb4ae3fbef33f8fc5bbb4ae3f
UTF-8 杖렚檉堪갼杖렚檉堪갼 111001101001110110010110111010111010000010011010111001101010101010001001111001011010000010101010111010101011000010111100111001101001110110010110111010111010000010011010111001101010101010001001111001011010000010101010111010101011000010111100 e69d96eba09ae6aa89e5a0aaeab0bce69d96eba09ae6aa89e5a0aaeab0bc
UHC 杖렚檉堪갼杖렚檉堪갼 1110110111101000100011101010110111101111111000001100101011101101101100001011111011101101111010001000111010101101111011111110000011001010111011011011000010111110 ede88eadefe0caedb0beede88eadefe0caedb0be

SJIS-Win,EUC-JP: Classic charsets mainly used as Japanese encoding set on Windows(SJIS-Win=CP932) and UNIX(EUC-JP)