To what bitstring a character(s) is encoded in each character set?

Input one character or short letters and click "Convert."


(UTF-8)
Charset Character Bit string (binary) Bit String (hexadecimal)
ISO-8859-1 ??????? 00111111001111110011111100111111001111110011111100111111 3f3f3f3f3f3f3f
SJIS-WIN 勺湘。照ラ湘。 1000111011011001100011111100001110100001100011111100011011010111100011111100001110100001 8ed98fc3a18fc6d78fc3a1
EUC-JP 勺湘。照ラ湘。 1011110011011011101111101100010110001110101000011011111011001000100011101101011110111110110001011000111010100001 bcdbbec58ea1bec88ed7bec58ea1
UTF-8 勺湘。照ラ湘。 111001011000101110111010111001101011100110011000111011111011110110100001111001111000010110100111111011111011111010010111111001101011100110011000111011111011110110100001 e58bbae6b998efbda1e785a7efbe97e6b998efbda1
UHC 勺湘?照?湘? 1110110111000011110111111100111100111111111100001100111000111111110111111100111100111111 edc3dfcf3ff0ce3fdfcf3f

SJIS-Win,EUC-JP: Classic charsets mainly used as Japanese encoding set on Windows(SJIS-Win=CP932) and UNIX(EUC-JP)