To what bitstring a character(s) is encoded in each character set?

Input one character or short letters and click "Convert."


(UTF-8)
Charset Character Bit string (binary) Bit String (hexadecimal)
ISO-8859-1 ????????????????????????^ 00111111001111110011111100111111001111110011111100111111001111110011111100111111001111110011111100111111001111110011111100111111001111110011111100111111001111110011111100111111001111110011111101011110 3f3f3f3f3f3f3f3f3f3f3f3f3f3f3f3f3f3f3f3f3f3f3f3f5e
SJIS-WIN 額?ぉ???嚥??縊k?額?ぉ???嚥??縊k?^ 1000101001111010001111111000001010100111001111110011111100111111100110101000101100111111001111111110001101101111100000101000101100111111100010100111101000111111100000101010011100111111001111110011111110011010100010110011111100111111111000110110111110000010100010110011111101011110 8a7a3f82a73f3f3f9a8b3f3fe36f828b3f8a7a3f82a73f3f3f9a8b3f3fe36f828b3f5e
EUC-JP 額?ぉ???嚥??縊k?額?ぉ???嚥??縊k?^ 1011001111011011001111111010010010101001001111110011111100111111110100111110101100111111001111111110010111010000101000111110101100111111101100111101101100111111101001001010100100111111001111110011111111010011111010110011111100111111111001011101000010100011111010110011111101011110 b3db3fa4a93f3f3fd3eb3f3fe5d0a3eb3fb3db3fa4a93f3f3fd3eb3f3fe5d0a3eb3f5e
UTF-8 額ㅻぉ溜곕젷嚥잙젉縊k젺額ㅻぉ溜곕젷嚥잙젉縊k젶^ 11101001101000011000110111100011100001011011101111100011100000011000100111101111101001111000101111101010101100111001010111101100101000001011011111100101100110101010010111101100100111101001100111101100101000001000100111100111101110001000101011101111101111011000101111101100101000001011101011101001101000011000110111100011100001011011101111100011100000011000100111101111101001111000101111101010101100111001010111101100101000001011011111100101100110101010010111101100100111101001100111101100101000001000100111100111101110001000101011101111101111011000101111101100101000001011011001011110 e9a18de385bbe38189efa78beab395eca0b7e59aa5ec9e99eca089e7b88aefbd8beca0bae9a18de385bbe38189efa78beab395eca0b7e59aa5ec9e99eca089e7b88aefbd8beca0b65e
UHC 額ㅻぉ溜곕젷嚥잙젉縊k젺額ㅻぉ溜곕젷嚥잙젉縊k젶^ 11100100111111101010010011101011101010101010100111101010111111101011000011101011101000001010101111100110101111111001111111101011101000001000101111100100111111001010001111101011101000001010110111100100111111101010010011101011101010101010100111101010111111101011000011101011101000001010101111100110101111111001111111101011101000001000101111100100111111001010001111101011101000001010101001011110 e4fea4ebaaa9eafeb0eba0abe6bf9feba08be4fca3eba0ade4fea4ebaaa9eafeb0eba0abe6bf9feba08be4fca3eba0aa5e

SJIS-Win,EUC-JP: Classic charsets mainly used as Japanese encoding set on Windows(SJIS-Win=CP932) and UNIX(EUC-JP)