To what bitstring a character(s) is encoded in each character set?

Input one character or short letters and click "Convert."


(UTF-8)
Charset Character Bit string (binary) Bit String (hexadecimal)
ISO-8859-1 ??????? 00111111001111110011111100111111001111110011111100111111 3f3f3f3f3f3f3f
SJIS-WIN 讓ケ讀咲懸莉ー 111001101010100010111001111001101010010010001101111001111000110010011100111001001011101110110000 e6a8b9e6a48de78c9ce4bbb0
EUC-JP 讓ケ讀咲懸莉ー 1110110010101010100011101011100111101100101001101011101011101001101101111111110011101000101111011000111010110000 ecaa8eb9eca6bae9b7fce8bd8eb0
UTF-8 讓ケ讀咲懸莉ー 111010001010111010010011111011111011110110111001111010001010111010000000111001011001001010110010111001101000011110111000111010001000111010001001111011111011110110110000 e8ae93efbdb9e8ae80e592b2e687b8e88e89efbdb0
UHC 讓?讀?懸莉? 1110010111010011001111111101010011000001001111111111101011011000110101111110100100111111 e5d33fd4c13ffad8d7e93f

SJIS-Win,EUC-JP: Classic charsets mainly used as Japanese encoding set on Windows(SJIS-Win=CP932) and UNIX(EUC-JP)