To what bitstring a character(s) is encoded in each character set?

Input one character or short letters and click "Convert."


(UTF-8)
Charset Character Bit string (binary) Bit String (hexadecimal)
ISO-8859-1 ?????????????????????? 00111111001111110011111100111111001111110011111100111111001111110011111100111111001111110011111100111111001111110011111100111111001111110011111100111111001111110011111100111111 3f3f3f3f3f3f3f3f3f3f3f3f3f3f3f3f3f3f3f3f3f3f
SJIS-WIN ???萸??喩?????誼??循??馭??? 001111110011111100111111111001001100111000111111001111111001101001100111001111110011111100111111001111110011111110001011011000100011111100111111100011110111101000111111001111111110100101100110001111110011111100111111 3f3f3fe4ce3f3f9a673f3f3f3f3f8b623f3f8f7a3f3fe9663f3f3f
EUC-JP ???萸??喩?????誼??循??馭??? 001111110011111100111111111010001101000000111111001111111101001111001000001111110011111100111111001111110011111110110101110000110011111100111111101111011101101100111111001111111111000111000111001111110011111100111111 3f3f3fe8d03f3fd3c83f3f3f3f3fb5c33f3fbddb3f3ff1c73f3f3f
UTF-8 列룸쓷萸먲쫨喩띾똽列룸똻誼뤹뵓循놁뵣馭궽삼폇 111011111010011010011100111010111010001110111000111011001001001110110111111010001001000010111000111010111010100010110010111011001010101110101000111001011001011010101001111010111001110110111110111010111001100010111101111011111010011010011100111010111010001110111000111010111001100010111011111010001010101010111100111010111010010010111001111010111011010110010011111001011011111010101010111010111000011010000001111010111011010110100011111010011010011010101101111010101011011010111101111011001000001010111100111011011000111110000111 efa69ceba3b8ec93b7e890b8eba8b2ecaba8e596a9eb9dbeeb98bdefa69ceba3b8eb98bbe8aabceba4b9ebb593e5beaaeb8681ebb5a3e9a6adeab6bdec82bced8f87
UHC 列룸쓷萸먲쫨喩띾똽列룸똻誼뤹뵓循놁뵣馭궽삼폇 1110011011101010101101111110101110011101100101001110101110101101100100001110111110100110100000011110101011100111100011011110101110001100100000111110011011101010101101111110101110001100100000011110101111111110100011111110011110010100100101011110001011100000100001101110110010010100101000111110010111011111100000101100111010111011111011111011110010010100 e6eab7eb9d94ebad90efa681eae78deb8c83e6eab7eb8c81ebfe8fe79495e2e086ec94a3e5df82cebbefbc94

SJIS-Win,EUC-JP: Classic charsets mainly used as Japanese encoding set on Windows(SJIS-Win=CP932) and UNIX(EUC-JP)