To what bitstring a character(s) is encoded in each character set?

Input one character or short letters and click "Convert."


(UTF-8)
Charset Character Bit string (binary) Bit String (hexadecimal)
ISO-8859-1 ??????????????????^ 00111111001111110011111100111111001111110011111100111111001111110011111100111111001111110011111100111111001111110011111100111111001111110011111101011110 3f3f3f3f3f3f3f3f3f3f3f3f3f3f3f3f3f3f5e
SJIS-WIN 荳奇セ滓万訒ア豎深荳奇セ滓万訒ア豎深^ 111001001011100010001010111011111011111010011111111001101001011010011100111110111010001110110001111001101011000110010000010110111110010010111000100010101110111110111110100111111110011010010110100111001111101110100011101100011110011010110001100100000101101101011110 e4b88aefbe9fe6969cfba3b1e6b1905be4b88aefbe9fe6969cfba3b1e6b1905b5e
EUC-JP 荳奇セ滓万訒ア豎深荳奇セ滓万訒ア豎深^ 111010001011101010110100111100011000111010111110110111101110100011001011111111001000111111011101110010001000111010110001111011001011001110111111101111001110100010111010101101001111000110001110101111101101111011101000110010111111110010001111110111011100100010001110101100011110110010110011101111111011110001011110 e8bab4f18ebedee8cbfc8fddc88eb1ecb3bfbce8bab4f18ebedee8cbfc8fddc88eb1ecb3bfbc5e
UTF-8 荳奇セ滓万訒ア豎深荳奇セ滓万訒ア豎深^ 11101000100011011011001111100101101001011000011111101111101111011011111011100110101110111001001111100100101110001000011111101000101010001001001011101111101111011011000111101000101100011000111011100110101101111011000111101000100011011011001111100101101001011000011111101111101111011011111011100110101110111001001111100100101110001000011111101000101010001001001011101111101111011011000111101000101100011000111011100110101101111011000101011110 e88db3e5a587efbdbee6bb93e4b887e8a892efbdb1e8b18ee6b7b1e88db3e5a587efbdbee6bb93e4b887e8a892efbdb1e8b18ee6b7b15e
UHC 荳奇?滓万???深荳奇?滓万???深^ 1101010011100101110100001111010000111111111011101010101111011000101100100011111100111111001111111110010010100010110101001110010111010000111101000011111111101110101010111101100010110010001111110011111100111111111001001010001001011110 d4e5d0f43feeabd8b23f3f3fe4a2d4e5d0f43feeabd8b23f3f3fe4a25e

SJIS-Win,EUC-JP: Classic charsets mainly used as Japanese encoding set on Windows(SJIS-Win=CP932) and UNIX(EUC-JP)