To what bitstring a character(s) is encoded in each character set?

Input one character or short letters and click "Convert."


(UTF-8)
Charset Character Bit string (binary) Bit String (hexadecimal)
ISO-8859-1 ????v????vB 0011111100111111001111110011111101110110001111110011111100111111001111110111011001000010 3f3f3f3f763f3f3f3f7642
SJIS-WIN 際?咀魄v際?咀魄vB 1000110111011011001111111001100111110000111010011010111001110110100011011101101100111111100110011111000011101001101011100111011001000010 8ddb3f99f0e9ae768ddb3f99f0e9ae7642
EUC-JP 際?咀魄v際?咀魄vB 1011101011011101001111111101001011110010111100101011000001110110101110101101110100111111110100101111001011110010101100000111011001000010 badd3fd2f2f2b076badd3fd2f2f2b07642
UTF-8 際렑咀魄v際렑咀魄vB 111010011001101010011011111010111010000010010001111001011001001010000000111010011010110110000100011101101110100110011010100110111110101110100000100100011110010110010010100000001110100110101101100001000111011001000010 e99a9beba091e59280e9ad8476e99a9beba091e59280e9ad847642
UHC 際렑咀魄v際렑咀魄vB 11110000101101111000111010100110111011101011101011011011110111100111011011110000101101111000111010100110111011101011101011011011110111100111011001000010 f0b78ea6eebadbde76f0b78ea6eebadbde7642

SJIS-Win,EUC-JP: Classic charsets mainly used as Japanese encoding set on Windows(SJIS-Win=CP932) and UNIX(EUC-JP)