To what bitstring a character(s) is encoded in each character set?

Input one character or short letters and click "Convert."


(UTF-8)
Charset Character Bit string (binary) Bit String (hexadecimal)
ISO-8859-1 ???????? 0011111100111111001111110011111100111111001111110011111100111111 3f3f3f3f3f3f3f3f
SJIS-WIN 蠍「逞榊カ「逞洪 11100101101101101010001011100111100101111000110111100101101101101010001011100111100101111000110101011110 e5b6a2e7978de5b6a2e7978d5e
EUC-JP 蠍「逞榊カ「逞洪 11101010101110001000111010100010111011011111011110111010111001111000111010110110100011101010001011101101111101111011100110111111 eab88ea2edf7bae78eb68ea2edf7b9bf
UTF-8 蠍「逞榊カ「逞洪 111010001010000010001101111011111011110110100010111010011000000010011110111001101010011010001010111011111011110110110110111011111011110110100010111010011000000010011110111001101011010010101010 e8a08defbda2e9809ee6a68aefbdb6efbda2e9809ee6b4aa
UHC ??逞???逞洪 0011111100111111110101101100000100111111001111110011111111010110110000011111101111110011 3f3fd6c13f3f3fd6c1fbf3

SJIS-Win,EUC-JP: Classic charsets mainly used as Japanese encoding set on Windows(SJIS-Win=CP932) and UNIX(EUC-JP)