To what bitstring a character(s) is encoded in each character set?

Input one character or short letters and click "Convert."


(UTF-8)
Charset Character Bit string (binary) Bit String (hexadecimal)
ISO-8859-1 ??????????????????????B 0011111100111111001111110011111100111111001111110011111100111111001111110011111100111111001111110011111100111111001111110011111100111111001111110011111100111111001111110011111101000010 3f3f3f3f3f3f3f3f3f3f3f3f3f3f3f3f3f3f3f3f3f3f42
SJIS-WIN ?④????湲??以??④????湲??以?B 0011111110000111010000110011111100111111001111110011111110011111110100010011111100111111100010001100100000111111001111111000011101000011001111110011111100111111001111111001111111010001001111110011111110001000110010000011111101000010 3f87433f3f3f3f9fd13f3f88c83f3f87433f3f3f3f9fd13f3f88c83f42
EUC-JP ??????湲??以???????湲??以?B 001111110011111100111111001111110011111100111111110111101101001100111111001111111011000011001010001111110011111100111111001111110011111100111111001111111101111011010011001111110011111110110000110010100011111101000010 3f3f3f3f3f3fded33f3fb0ca3f3f3f3f3f3f3fded33f3fb0ca3f42
UTF-8 曆④엽溜싲㈀湲띺썚以뢹曆④엽溜싲㈀湲띺썚以뢹B 11101111101001101000101111100010100100011010001111101100100101111011110111101111101001111000101111101100100010111011001011100011100010001000000011100110101110011011001011101011100111011011101011101100100011011001101011100100101110111010010111101011101000101011100111101111101001101000101111100010100100011010001111101100100101111011110111101111101001111000101111101100100010111011001011100011100010001000000011100110101110011011001011101011100111011011101011101100100011011001101011100100101110111010010111101011101000101011100101000010 efa68be291a3ec97bdefa78bec8bb2e38880e6b9b2eb9dbaec8d9ae4bba5eba2b9efa68be291a3ec97bdefa78bec8bb2e38880e6b9b2eb9dbaec8d9ae4bba5eba2b942
UHC 曆④엽溜싲㈀湲띺썚以뢹曆④엽溜싲㈀湲띺썚以뢹B 111001101011011110101000111010101011111110110001111010101111111010011010111010111010100110110001111010101011100010001101111010011001101110001101111011001010010010001111011101101110011010110111101010001110101010111111101100011110101011111110100110101110101110101001101100011110101010111000100011011110100110011011100011011110110010100100100011110111011001000010 e6b7a8eabfb1eafe9aeba9b1eab88de99b8deca48f76e6b7a8eabfb1eafe9aeba9b1eab88de99b8deca48f7642

SJIS-Win,EUC-JP: Classic charsets mainly used as Japanese encoding set on Windows(SJIS-Win=CP932) and UNIX(EUC-JP)