To what bitstring a character(s) is encoded in each character set?

Input one character or short letters and click "Convert."


(UTF-8)
Charset Character Bit string (binary) Bit String (hexadecimal)
ISO-8859-1 ??????????????????^ 00111111001111110011111100111111001111110011111100111111001111110011111100111111001111110011111100111111001111110011111100111111001111110011111101011110 3f3f3f3f3f3f3f3f3f3f3f3f3f3f3f3f3f3f5e
SJIS-WIN 嚥??誼ユ┏揄??嚥??誼ユ┏揄??^ 1001101010001011001111110011111110001011011000101000001110000110100001001010110010011101100010010011111100111111100110101000101100111111001111111000101101100010100000111000011010000100101011001001110110001001001111110011111101011110 9a8b3f3f8b62838684ac9d893f3f9a8b3f3f8b62838684ac9d893f3f5e
EUC-JP 嚥??誼ユ┏揄??嚥??誼ユ┏揄??^ 1101001111101011001111110011111110110101110000111010010111100110101010001010111011011001111010010011111100111111110100111110101100111111001111111011010111000011101001011110011010101000101011101101100111101001001111110011111101011110 d3eb3f3fb5c3a5e6a8aed9e93f3fd3eb3f3fb5c3a5e6a8aed9e93f3f5e
UTF-8 嚥싲갭誼ユ┏揄앹쓧嚥싲갭誼ユ┏揄앹뇗^ 11100101100110101010010111101100100010111011001011101010101100001010110111101000101010101011110011100011100000111010011011100010100101001000111111100110100011111000010011101100100101011011100111101100100100111010011111100101100110101010010111101100100010111011001011101010101100001010110111101000101010101011110011100011100000111010011011100010100101001000111111100110100011111000010011101100100101011011100111101011100001111001011101011110 e59aa5ec8bb2eab0ade8aabce383a6e2948fe68f84ec95b9ec93a7e59aa5ec8bb2eab0ade8aabce383a6e2948fe68f84ec95b9eb87975e
UHC 嚥싲갭誼ユ┏揄앹쓧嚥싲갭誼ユ┏揄앹뇗^ 11100110101111111001101011101011101100001011100011101011111111101010101111100110101001101010111011101010111100011001110111101100100111011000100011100110101111111001101011101011101100001011100011101011111111101010101111100110101001101010111011101010111100011001110111101100100001111000001001011110 e6bf9aebb0b8ebfeabe6a6aeeaf19dec9d88e6bf9aebb0b8ebfeabe6a6aeeaf19dec87825e

SJIS-Win,EUC-JP: Classic charsets mainly used as Japanese encoding set on Windows(SJIS-Win=CP932) and UNIX(EUC-JP)