To what bitstring a character(s) is encoded in each character set?

Input one character or short letters and click "Convert."


(UTF-8)
Charset Character Bit string (binary) Bit String (hexadecimal)
ISO-8859-1 ?????????[?????????[^ 001111110011111100111111001111110011111100111111001111110011111100111111010110110011111100111111001111110011111100111111001111110011111100111111001111110101101101011110 3f3f3f3f3f3f3f3f3f5b3f3f3f3f3f3f3f3f3f5b5e
SJIS-WIN ???????ユ?[???????ユ?[^ 0011111100111111001111110011111100111111001111110011111110000011100001100011111101011011001111110011111100111111001111110011111100111111001111111000001110000110001111110101101101011110 3f3f3f3f3f3f3f83863f5b3f3f3f3f3f3f3f83863f5b5e
EUC-JP ???轝???ユ?[???轝???ユ?[^ 001111110011111100111111100011111110000110101010001111110011111100111111101001011110011000111111010110110011111100111111001111111000111111100001101010100011111100111111001111111010010111100110001111110101101101011110 3f3f3f8fe1aa3f3f3fa5e63f5b3f3f3f8fe1aa3f3f3fa5e63f5b5e
UTF-8 溫숂겓轝믦츢練ユ춴[溫숂겓轝믦츢練ユ춴[^ 111001101011101010101011111011001000100010000010111010101011001010010011111010001011110110011101111010111010111110100110111011001011100010100010111011111010011010010110111000111000001110100110111011001011011010110100010110111110011010111010101010111110110010001000100000101110101010110010100100111110100010111101100111011110101110101111101001101110110010111000101000101110111110100110100101101110001110000011101001101110110010110110101101000101101101011110 e6baabec8882eab293e8bd9debafa6ecb8a2efa696e383a6ecb6b45be6baabec8882eab293e8bd9debafa6ecb8a2efa696e383a6ecb6b45b5e
UHC 溫숂겓轝믦츢練ユ춴[溫숂겓轝믦츢練ユ춴[^ 111010001010111010011001111001111000000110101011111001101010110010010010111010001010111010011001111001101101111110101011111001101010110110010000010110111110100010101110100110011110011110000001101010111110011010101100100100101110100010101110100110011110011011011111101010111110011010101101100100000101101101011110 e8ae99e781abe6ac92e8ae99e6dfabe6ad905be8ae99e781abe6ac92e8ae99e6dfabe6ad905b5e

SJIS-Win,EUC-JP: Classic charsets mainly used as Japanese encoding set on Windows(SJIS-Win=CP932) and UNIX(EUC-JP)