To what bitstring a character(s) is encoded in each character set?

Input one character or short letters and click "Convert."


(UTF-8)
Charset Character Bit string (binary) Bit String (hexadecimal)
ISO-8859-1 ??????????^ 0011111100111111001111110011111100111111001111110011111100111111001111110011111101011110 3f3f3f3f3f3f3f3f3f3f5e
SJIS-WIN 鄒旃榊ョ優鄒旃榊ョ優^ 11100111101111101001110111010001100011011110010110101110100101110100010011100111101111101001110111010001100011011110010110101110100101110100010001011110 e7be9dd18de5ae9744e7be9dd18de5ae97445e
EUC-JP 鄒旃榊ョ優鄒旃榊ョ優^ 111011101100000011011010110100111011101011100111100011101010111011001101101001011110111011000000110110101101001110111010111001111000111010101110110011011010010101011110 eec0dad3bae78eaecda5eec0dad3bae78eaecda55e
UTF-8 鄒旃榊ョ優鄒旃榊ョ優^ 11101001100001001001001011100110100101111000001111100110101001101000101011101111101111011010111011100101100001001010101011101001100001001001001011100110100101111000001111100110101001101000101011101111101111011010111011100101100001001010101001011110 e98492e69783e6a68aefbdaee584aae98492e69783e6a68aefbdaee584aa5e
UHC 鄒???優鄒???優^ 111101011101101100111111001111110011111111101001110100001111010111011011001111110011111100111111111010011101000001011110 f5db3f3f3fe9d0f5db3f3f3fe9d05e

SJIS-Win,EUC-JP: Classic charsets mainly used as Japanese encoding set on Windows(SJIS-Win=CP932) and UNIX(EUC-JP)