To what bitstring a character(s) is encoded in each character set?

Input one character or short letters and click "Convert."


(UTF-8)
Charset Character Bit string (binary) Bit String (hexadecimal)
ISO-8859-1 ?????????????????????? 00111111001111110011111100111111001111110011111100111111001111110011111100111111001111110011111100111111001111110011111100111111001111110011111100111111001111110011111100111111 3f3f3f3f3f3f3f3f3f3f3f3f3f3f3f3f3f3f3f3f3f3f
SJIS-WIN 予??揖??儒?????異??乙??鈺??踰 1001011101011100001111110011111110010111010010110011111100111111100011101111001000111111001111110011111100111111001111111000100011011001001111110011111110001001101100110011111100111111111110111100010000111111001111111110011011111010 975c3f3f974b3f3f8ef23f3f3f3f3f88d93f3f89b33f3ffbc43f3fe6fa
EUC-JP 予??揖??儒?????異??乙??鈺??踰 110011011011110100111111001111111100110110101100001111110011111110111100111101000011111100111111001111110011111100111111101100001101101100111111001111111011001010110101001111110011111110001111111000111101010100111111001111111110110011111100 cdbd3f3fcdac3f3fbcf43f3f3f3f3fb0db3f3fb2b53f3f8fe3d53f3fecfc
UTF-8 予쀬궠揖댐ℓ儒븍퓛列용낌異룡콢乙대듌鈺곕뀿踰 111001001011101010001000111011001000000010101100111010101011011010100000111001101000111110010110111010111000110010010000111000101000010010010011111001011000010010010010111010111011100010001101111011011001001110011011111011111010011010011100111011001001101010101001111010111000001010001100111001111001010110110000111010111010001110100001111011001011110110100010111001001011100110011001111010111000110010000000111010111001001110001100111010011000100010111010111010101011001110010101111010111000000010111111111010001011100010110000 e4ba88ec80aceab6a0e68f96eb8c90e28493e58492ebb88ded939befa69cec9aa9eb828ce795b0eba3a1ecbda2e4b999eb8c80eb938ce988baeab395eb80bfe8b8b0
UHC 予쀬궠揖댐ℓ儒븍퓛列용낌異룡콢乙대듌鈺곕뀿踰 1110010111111000100101111110110010000010101100111110101111100111101101001110111110100111101001001110101011100011101110101110101110111111100001101110011011101010101111111110101110110011101001101110110010110110101101111110011010110001100110101110101111100000101101001110101110001010101111111110100010101101101100001110101110000101101101011110101110110010 e5f897ec82b3ebe7b4efa7a4eae3baebbf86e6eabfebb3a6ecb6b7e6b19aebe0b4eb8abfe8adb0eb85b5ebb2

SJIS-Win,EUC-JP: Classic charsets mainly used as Japanese encoding set on Windows(SJIS-Win=CP932) and UNIX(EUC-JP)