To what bitstring a character(s) is encoded in each character set?

Input one character or short letters and click "Convert."


(UTF-8)
Charset Character Bit string (binary) Bit String (hexadecimal)
ISO-8859-1 ???????^ 0011111100111111001111110011111100111111001111110011111101011110 3f3f3f3f3f3f3f5e
SJIS-WIN ?撤庭??逕濤^ 001111111001001101010000100100101110101100111111001111111110011110010100100111111011011101011110 3f935092eb3f3fe7949fb75e
EUC-JP ?撤庭??逕濤^ 001111111100010110110001110001001110110100111111001111111110110111110100110111101011100101011110 3fc5b1c4ed3f3fedf4deb95e
UTF-8 뤋撤庭쫸샅逕濤^ 11101011101001001000101111100110100100101010010011100101101110101010110111101100101010111011100011101100100000111000010111101001100000001001010111100110101111111010010001011110 eba48be692a4e5baadecabb8ec8385e98095e6bfa45e
UHC 뤋撤庭쫸샅逕濤^ 100011111011101111110100110011001110111111010100101001101000111110111011111101001100110011101111110101001010011001011110 8fbbf4ccefd4a68fbbf4ccefd4a65e

SJIS-Win,EUC-JP: Classic charsets mainly used as Japanese encoding set on Windows(SJIS-Win=CP932) and UNIX(EUC-JP)