To what bitstring a character(s) is encoded in each character set?

Input one character or short letters and click "Convert."


(UTF-8)
Charset Character Bit string (binary) Bit String (hexadecimal)
ISO-8859-1 ????????????????????? 001111110011111100111111001111110011111100111111001111110011111100111111001111110011111100111111001111110011111100111111001111110011111100111111001111110011111100111111 3f3f3f3f3f3f3f3f3f3f3f3f3f3f3f3f3f3f3f3f3f
SJIS-WIN 塋ゅ????耶??譽???хオ???也よ? 1001101011001000100000101110001100111111001111110011111100111111100101101110101100111111001111111110011010100011001111110011111100111111100001001000011110000011010010010011111100111111001111111001011011100111100000101110011000111111 9ac882e33f3f3f3f96eb3f3fe6a33f3f3f848783493f3f3f96e782e63f
EUC-JP 塋ゅ?孼??耶??譽???хオ???也よ? 11010100110010101010010011100101001111111000111110111010110000110011111100111111110011001110110100111111001111111110110010100101001111110011111100111111101001111110011110100101101010100011111100111111001111111100110011101001101001001110100000111111 d4caa4e53f8fbac33f3fcced3f3feca53f3f3fa7e7a5aa3f3f3fcce9a4e83f
UTF-8 塋ゅ콪孼껇꽦耶섉릍譽긷춼歷хオ掠욄룂也よ갬 1110010110100001100010111110001110000010100001011110110010111101101010101110010110101101101111001110101010111011100001111110101010111101101001101110100010000000101101101110110010000100100010011110101110100110100011011110100010101101101111011110101010111000101101111110110010110110101111001110111110100110100011001101000110000101111000111000001010101010111011111010010110110101111011001001101010000100111010111010001110000010111001001011100110011111111000111000001010001000111010101011000010101100 e5a18be38285ecbdaae5adbceabb87eabda6e880b6ec8489eba68de8adbdeab8b7ecb6bcefa68cd185e382aaefa5b5ec9a84eba382e4b99fe38288eab0ac
UHC 塋ゅ콪孼껇꽦耶섉릍譽긷춼歷хオ掠욄룂也よ갬 111001111010101110101010111001011011000110011110111001011110110110000011111010001000010010110001111001011010110110011000111001101011100010101100111001111110001010110001111001011010110110011000111001101011100010101100111001111010101110101010111001011011000110011110111001101000111110000011111001011010010110101010111010001011000010110111 e7abaae5b19ee5ed83e884b1e5ad98e6b8ace7e2b1e5ad98e6b8ace7abaae5b19ee68f83e5a5aae8b0b7

SJIS-Win,EUC-JP: Classic charsets mainly used as Japanese encoding set on Windows(SJIS-Win=CP932) and UNIX(EUC-JP)