To what bitstring a character(s) is encoded in each character set?

Input one character or short letters and click "Convert."


(UTF-8)
Charset Character Bit string (binary) Bit String (hexadecimal)
ISO-8859-1 ????????^ 001111110011111100111111001111110011111100111111001111110011111101011110 3f3f3f3f3f3f3f3f5e
SJIS-WIN ??倖魄??倖白^ 00111111001111111000110011110100111010011010111000111111001111111000110011110100100101001001001001011110 3f3f8cf4e9ae3f3f8cf494925e
EUC-JP 澔?倖魄澔?倖白^ 1000111111001000111011000011111110111000111101101111001010110000100011111100100011101100001111111011100011110110110001111111001001011110 8fc8ec3fb8f6f2b08fc8ec3fb8f6c7f25e
UTF-8 澔렜倖魄澔렜倖白^ 11100110101111101001010011101011101000001001110011100101100000001001011011101001101011011000010011100110101111101001010011101011101000001001110011100101100000001001011011100111100110011011110101011110 e6be94eba09ce58096e9ad84e6be94eba09ce58096e799bd5e
UHC 澔렜倖魄澔렜倖白^ 1111101111001011100011101010111011111010101110001101101111011110111110111100101110001110101011101111101010111000110110111101110001011110 fbcb8eaefab8dbdefbcb8eaefab8dbdc5e

SJIS-Win,EUC-JP: Classic charsets mainly used as Japanese encoding set on Windows(SJIS-Win=CP932) and UNIX(EUC-JP)