To what bitstring a character(s) is encoded in each character set?

Input one character or short letters and click "Convert."


(UTF-8)
Charset Character Bit string (binary) Bit String (hexadecimal)
ISO-8859-1 ?????????????????????? 00111111001111110011111100111111001111110011111100111111001111110011111100111111001111110011111100111111001111110011111100111111001111110011111100111111001111110011111100111111 3f3f3f3f3f3f3f3f3f3f3f3f3f3f3f3f3f3f3f3f3f3f
SJIS-WIN ???鎰??議??壤??鎰??轅??壤??? 00111111001111110011111111101000010011000011111100111111100010110110001100111111001111111001101011011111001111110011111111101000010011000011111100111111111001110111011000111111001111111001101011011111001111110011111100111111 3f3f3fe84c3f3f8b633f3f9adf3f3fe84c3f3fe7763f3f9adf3f3f3f
EUC-JP ???鎰??議??壤??鎰??轅??壤??? 00111111001111110011111111101111101011010011111100111111101101011100010000111111001111111101010011100001001111110011111111101111101011010011111100111111111011011101011100111111001111111101010011100001001111110011111100111111 3f3f3fefad3f3fb5c43f3fd4e13f3fefad3f3fedd73f3fd4e13f3f3f
UTF-8 捻뚭엽鎰쎾쉽議욧펶壤깆쥋鎰쏁솈轅깊뮎壤깆쥉利 111011111010011010100100111010111001101010101101111011001001011110111101111010011000111010110000111011001000111010111110111011001000100110111101111010001010110110110000111011001001101010100111111011011000111010110110111001011010001110100100111010101011100110000110111011001010010110001011111010011000111010110000111011001000111110000001111011001000011010001000111010001011110110000101111010101011100110001010111010111010111010001110111001011010001110100100111010101011100110000110111011001010010110001001111011111010011110011101 efa6a4eb9aadec97bde98eb0ec8ebeec89bde8adb0ec9aa7ed8eb6e5a3a4eab986eca58be98eb0ec8f81ec8688e8bd85eab98aebae8ee5a3a4eab986eca589efa79d
UHC 捻뚭엽鎰쎾쉽議욧펶壤깆쥋鎰쏁솈轅깊뮎壤깆쥉利 1110011011110111100011001110101010111111101100011110110011110000100110111110010110111101101100011110110010100001101111111110101010111100100001111110010110111101101100011110110010100010100001001110110011110000100110111110011110011001100011001110101010111111101100011110110110010010100110111110010110111101101100011110110010100010100000101110110010100110 e6f78ceabfb1ecf09be5bdb1eca1bfeabc87e5bdb1eca284ecf09be7998ceabfb1ed929be5bdb1eca282eca6

SJIS-Win,EUC-JP: Classic charsets mainly used as Japanese encoding set on Windows(SJIS-Win=CP932) and UNIX(EUC-JP)