To what bitstring a character(s) is encoded in each character set?

Input one character or short letters and click "Convert."


(UTF-8)
Charset Character Bit string (binary) Bit String (hexadecimal)
ISO-8859-1 ?????????????????????? 00111111001111110011111100111111001111110011111100111111001111110011111100111111001111110011111100111111001111110011111100111111001111110011111100111111001111110011111100111111 3f3f3f3f3f3f3f3f3f3f3f3f3f3f3f3f3f3f3f3f3f3f
SJIS-WIN ???誼??議?????溢??揄э?獰??? 00111111001111110011111110001011011000100011111100111111100010110110001100111111001111110011111100111111001111111000100011101100001111110011111110011101100010011000010010001111001111111110000011010110001111110011111100111111 3f3f3f8b623f3f8b633f3f3f3f3f88ec3f3f9d89848f3fe0d63f3f3f
EUC-JP ???誼??議?????溢??揄э?獰??? 00111111001111110011111110110101110000110011111100111111101101011100010000111111001111110011111100111111001111111011000011101110001111110011111111011001111010011010011111101111001111111110000011011000001111110011111100111111 3f3f3fb5c33f3fb5c43f3f3f3f3fb0ee3f3fd9e9a7ef3fe0d83f3f3f
UTF-8 列룸똻誼뤻춾議삳똽列룸씛溢닸씭揄э폋獰⒱뫒藺 1110111110100110100111001110101110100011101110001110101110011000101110111110100010101010101111001110101110100100101110111110110010110110101111101110100010101101101100001110110010000010101100111110101110011000101111011110111110100110100111001110101110100011101110001110110010010100100110111110011010111010101000101110101110001011101110001110110010010100101011011110011010001111100001001101000110001101111011011000111110001011111001111000110110110000111000101001001010110001111010111010101110010010111011111010011110110000 efa69ceba3b8eb98bbe8aabceba4bbecb6bee8adb0ec82b3eb98bdefa69ceba3b8ec949be6baa2eb8bb8ec94ade68f84d18ded8f8be78db0e292b1ebab92efa7b0
UHC 列룸똻誼뤻춾議삳똽列룸씛溢닸씭揄э폋獰⒱뫒藺 1110011011101010101101111110101110001100100000011110101111111110100011111110100110101101100110101110110010100001101110111110101110001100100000111110011011101010101101111110101110011101101100001110110011101110101101001110011010011101101111101110101011110001101011001110111110111100100101101110011110111110101010011110001010010001101101001110110011100001 e6eab7eb8c81ebfe8fe9ad9aeca1bbeb8c83e6eab7eb9db0eceeb4e69dbeeaf1acefbc96e7bea9e291b4ece1

SJIS-Win,EUC-JP: Classic charsets mainly used as Japanese encoding set on Windows(SJIS-Win=CP932) and UNIX(EUC-JP)