To what bitstring a character(s) is encoded in each character set?

Input one character or short letters and click "Convert."


(UTF-8)
Charset Character Bit string (binary) Bit String (hexadecimal)
ISO-8859-1 ?????????????????????? 00111111001111110011111100111111001111110011111100111111001111110011111100111111001111110011111100111111001111110011111100111111001111110011111100111111001111110011111100111111 3f3f3f3f3f3f3f3f3f3f3f3f3f3f3f3f3f3f3f3f3f3f
SJIS-WIN 繹??曜??冗?????哀?????繹??仰 11100011100010000011111100111111100101110110101000111111001111111000111111100111001111110011111100111111001111110011111110001000101000110011111100111111001111110011111100111111111000111000100000111111001111111000101111000010 e3883f3f976a3f3f8fe73f3f3f3f3f88a33f3f3f3f3fe3883f3f8bc2
EUC-JP 繹??曜??冗?????哀?????繹??仰 11100101111010000011111100111111110011011100101100111111001111111011111011101001001111110011111100111111001111110011111110110000101001010011111100111111001111110011111100111111111001011110100000111111001111111011011011000100 e5e83f3fcdcb3f3fbee93f3f3f3f3fb0a53f3f3f3f3fe5e83f3fb6c4
UTF-8 繹먮젾曜쒕젡冗밴염溜⑸젾哀잙젦惡욌젳繹먮젾仰 111001111011100110111001111010111010100010101110111011001010000010111110111001101001101110011100111011001001001010010101111011001010000010100001111001011000011010010111111010111011000010110100111011001001011110111100111011111010011110001011111000101001000110111000111011001010000010111110111001011001001110000000111011001001111010011001111011001010000010100110111011111010011010111001111011001001101010001100111011001010000010110011111001111011100110111001111010111010100010101110111011001010000010111110111001001011101110110000 e7b9b9eba8aeeca0bee69b9cec9295eca0a1e58697ebb0b4ec97bcefa78be291b8eca0bee59380ec9e99eca0a6efa6b9ec9a8ceca0b3e7b9b9eba8aeeca0bee4bbb0
UHC 繹먮젾曜쒕젡冗밴염溜⑸젾哀잙젦惡욌젳繹먮젾仰 1110011010111010100100001110101110100000101100001110100011111000100111001110101110100000100110101110100110110111101110011110101010111111101100001110101011111110101010011110101110100000101100001110010011101110100111111110101110100000100111101110011111110111100111101110101110100000101001111110011010111010100100001110101110100000101100001110010011100110 e6ba90eba0b0e8f89ceba09ae9b7b9eabfb0eafea9eba0b0e4ee9feba09ee7f79eeba0a7e6ba90eba0b0e4e6

SJIS-Win,EUC-JP: Classic charsets mainly used as Japanese encoding set on Windows(SJIS-Win=CP932) and UNIX(EUC-JP)