To what bitstring a character(s) is encoded in each character set?

Input one character or short letters and click "Convert."


(UTF-8)
Charset Character Bit string (binary) Bit String (hexadecimal)
ISO-8859-1 ?????????}?????????{^ 001111110011111100111111001111110011111100111111001111110011111100111111011111010011111100111111001111110011111100111111001111110011111100111111001111110111101101011110 3f3f3f3f3f3f3f3f3f7d3f3f3f3f3f3f3f3f3f7b5e
SJIS-WIN 臧???彎隕???}臧???彎隕???{^ 111001000110100000111111001111110011111110011100010111011110100010100110001111110011111100111111011111011110010001101000001111110011111100111111100111000101110111101000101001100011111100111111001111110111101101011110 e4683f3f3f9c5de8a63f3f3f7de4683f3f3f9c5de8a63f3f3f7b5e
EUC-JP 臧???彎隕???}臧???彎隕???{^ 111001111100100100111111001111110011111111010111101111101111000010101000001111110011111100111111011111011110011111001001001111110011111100111111110101111011111011110000101010000011111100111111001111110111101101011110 e7c93f3f3fd7bef0a83f3f3f7de7c93f3f3fd7bef0a83f3f3f7b5e
UTF-8 臧얩렪잿彎隕뀄렮렓}臧얩렪잿彎隕뀄렮렓{^ 111010001000011110100111111011001001011010101001111010111010000010101010111011001001111010111111111001011011110110001110111010011001101010010101111010111000000010000100111010111010000010101110111010111010000010010011011111011110100010000111101001111110110010010110101010011110101110100000101010101110110010011110101111111110010110111101100011101110100110011010100101011110101110000000100001001110101110100000101011101110101110100000100100110111101101011110 e887a7ec96a9eba0aaec9ebfe5bd8ee99a95eb8084eba0aeeba0937de887a7ec96a9eba0aaec9ebfe5bd8ee99a95eb8084eba0aeeba0937b5e
UHC 臧얩렪잿彎隕뀄렮렓}臧얩렪잿彎隕뀄렮렓{^ 111011011111010110111110111011011000111010111000110000001110110111011000101101101110101010100010101100101110110110001110101110111000111010101000011111011110110111110101101111101110110110001110101110001100000011101101110110001011011011101010101000101011001011101101100011101011101110001110101010000111101101011110 edf5beed8eb8c0edd8b6eaa2b2ed8ebb8ea87dedf5beed8eb8c0edd8b6eaa2b2ed8ebb8ea87b5e

SJIS-Win,EUC-JP: Classic charsets mainly used as Japanese encoding set on Windows(SJIS-Win=CP932) and UNIX(EUC-JP)