To what bitstring a character(s) is encoded in each character set?

Input one character or short letters and click "Convert."


(UTF-8)
Charset Character Bit string (binary) Bit String (hexadecimal)
ISO-8859-1 ??????????????????? 00111111001111110011111100111111001111110011111100111111001111110011111100111111001111110011111100111111001111110011111100111111001111110011111100111111 3f3f3f3f3f3f3f3f3f3f3f3f3f3f3f3f3f3f3f
SJIS-WIN 也ゆ?娃??耶??譽???уⅹ???也 100101101110011110000010111001000011111110001000101000010011111100111111100101101110101100111111001111111110011010100011001111110011111100111111100001001000010111111010010010010011111100111111001111111001011011100111 96e782e43f88a13f3f96eb3f3fe6a33f3f3f8485fa493f3f3f96e7
EUC-JP 也ゆ?娃??耶??譽???у?孼??也 11001100111010011010010011100110001111111011000010100011001111110011111111001100111011010011111100111111111011001010010100111111001111110011111110100111111001010011111110001111101110101100001100111111001111111100110011101001 cce9a4e63fb0a33f3fcced3f3feca53f3f3fa7e53f8fbac33f3fcce9
UTF-8 也ゆ룂娃쒑꽦耶섉릍譽긷춼歷уⅹ孼껈걶也 1110010010111001100111111110001110000010100001101110101110100011100000101110010110101000100000111110110010010010100100011110101010111101101001101110100010000000101101101110110010000100100010011110101110100110100011011110100010101101101111011110101010111000101101111110110010110110101111001110111110100110100011001101000110000011111000101000010110111001111001011010110110111100111010101011101110001000111010101011000110110110111001001011100110011111 e4b99fe38286eba382e5a883ec9291eabda6e880b6ec8489eba68de8adbdeab8b7ecb6bcefa68cd183e285b9e5adbceabb88eab1b6e4b99f
UHC 也ゆ룂娃쒑꽦耶섉릍譽긷춼歷уⅹ孼껈걶也 1110010110100101101010101110011010001111100000111110100011011111100111001110100010000100101100011110010110101101100110001110011010111000101011001110011111100010101100011110010110101101100110001110011010111000101011001110010110100101101010101110010111101101100000111110100110000001100111001110010110100101 e5a5aae68f83e8df9ce884b1e5ad98e6b8ace7e2b1e5ad98e6b8ace5a5aae5ed83e9819ce5a5

SJIS-Win,EUC-JP: Classic charsets mainly used as Japanese encoding set on Windows(SJIS-Win=CP932) and UNIX(EUC-JP)