To what bitstring a character(s) is encoded in each character set?

Input one character or short letters and click "Convert."


(UTF-8)
Charset Character Bit string (binary) Bit String (hexadecimal)
ISO-8859-1 ????????????????????? 001111110011111100111111001111110011111100111111001111110011111100111111001111110011111100111111001111110011111100111111001111110011111100111111001111110011111100111111 3f3f3f3f3f3f3f3f3f3f3f3f3f3f3f3f3f3f3f3f3f
SJIS-WIN ??????鎖??歪????┠釉??倭?? 0011111100111111001111110011111100111111001111111000110110111101001111110011111110011000011000110011111100111111001111110011111110000100101101011110011111010110001111110011111110011000011000000011111100111111 3f3f3f3f3f3f8dbd3f3f98633f3f3f3f84b5e7d63f3f98603f3f
EUC-JP 艅??堉??鎖??歪????┠釉??倭?? 100011111101011011111101001111110011111110001111101101111111110100111111001111111011101010111111001111110011111111001111110001000011111100111111001111110011111110101000101101111110111011011000001111110011111111001111110000010011111100111111 8fd6fd3f3f8fb7fd3f3fbabf3f3fcfc43f3f3f3fa8b7eed83f3fcfc13f3f
UTF-8 艅덈낌堉싨만鎖듦섭歪묆굥琉뗰┠釉띿뒩倭얠닪 111010001000100110000101111010111000110110001000111010111000001010001100111001011010000010001001111011001000101110101000111010111010011110001100111010011000111010010110111010111001001110100110111011001000010010101101111001101010110110101010111010111010110010000110111010101011010110100101111011111010011110001100111010111001011110110000111000101001010010100000111010011000011110001001111010111001110110111111111010111001001010101001111001011000000010101101111011001001011010100000111010111000101110101010 e88985eb8d88eb828ce5a089ec8ba8eba78ce98e96eb93a6ec84ade6adaaebac86eab5a5efa78ceb97b0e294a0e98789eb9dbfeb92a9e580adec96a0eb8baa
UHC 艅덈낌堉싨만鎖듦섭歪묆굥琉뗰┠釉띿뒩倭얠닪 111001101010100110001000111010111011001110100110111010111011110010011010111001101011100010111000111000011111000010110101111010101011110010110111111010001110000010010001111000111000001010001011111010111010010010001011111011111010011010110111111010111011100010001101111011001000101010100011111010001101111010111110111011001000100010100101 e6a988ebb3a6ebbc9ae6b8b8e1f0b5eabcb7e8e091e3828beba48befa6b7ebb88dec8aa3e8debeec88a5

SJIS-Win,EUC-JP: Classic charsets mainly used as Japanese encoding set on Windows(SJIS-Win=CP932) and UNIX(EUC-JP)