To what bitstring a character(s) is encoded in each character set?

Input one character or short letters and click "Convert."


(UTF-8)
Charset Character Bit string (binary) Bit String (hexadecimal)
ISO-8859-1 ??????????????????? 00111111001111110011111100111111001111110011111100111111001111110011111100111111001111110011111100111111001111110011111100111111001111110011111100111111 3f3f3f3f3f3f3f3f3f3f3f3f3f3f3f3f3f3f3f
SJIS-WIN 繹ル?溢?ず繹ル?溢?ず繹ル?溢?ず繹 1110001110001000100000111000101100111111100010001110110000111111100000101011100011100011100010001000001110001011001111111000100011101100001111111000001010111000111000111000100010000011100010110011111110001000111011000011111110000010101110001110001110001000 e388838b3f88ec3f82b8e388838b3f88ec3f82b8e388838b3f88ec3f82b8e388
EUC-JP 繹ル?溢?ず繹ル?溢?ず繹ル?溢?ず繹 1110010111101000101001011110101100111111101100001110111000111111101001001011101011100101111010001010010111101011001111111011000011101110001111111010010010111010111001011110100010100101111010110011111110110000111011100011111110100100101110101110010111101000 e5e8a5eb3fb0ee3fa4bae5e8a5eb3fb0ee3fa4bae5e8a5eb3fb0ee3fa4bae5e8
UTF-8 繹ル슜溢띄ず繹ル슜溢띄ず繹ル슜溢띄ず繹 111001111011100110111001111000111000001110101011111011001000101010011100111001101011101010100010111010111001110110000100111000111000000110011010111001111011100110111001111000111000001110101011111011001000101010011100111001101011101010100010111010111001110110000100111000111000000110011010111001111011100110111001111000111000001110101011111011001000101010011100111001101011101010100010111010111001110110000100111000111000000110011010111001111011100110111001 e7b9b9e383abec8a9ce6baa2eb9d84e3819ae7b9b9e383abec8a9ce6baa2eb9d84e3819ae7b9b9e383abec8a9ce6baa2eb9d84e3819ae7b9b9
UHC 繹ル슜溢띄ず繹ル슜溢띄ず繹ル슜溢띄ず繹 1110011010111010101010111110101110011010101010011110110011101110101101101110011110101010101110101110011010111010101010111110101110011010101010011110110011101110101101101110011110101010101110101110011010111010101010111110101110011010101010011110110011101110101101101110011110101010101110101110011010111010 e6baabeb9aa9eceeb6e7aabae6baabeb9aa9eceeb6e7aabae6baabeb9aa9eceeb6e7aabae6ba

SJIS-Win,EUC-JP: Classic charsets mainly used as Japanese encoding set on Windows(SJIS-Win=CP932) and UNIX(EUC-JP)