To what bitstring a character(s) is encoded in each character set?

Input one character or short letters and click "Convert."


(UTF-8)
Charset Character Bit string (binary) Bit String (hexadecimal)
ISO-8859-1 ?????????????????????? 00111111001111110011111100111111001111110011111100111111001111110011111100111111001111110011111100111111001111110011111100111111001111110011111100111111001111110011111100111111 3f3f3f3f3f3f3f3f3f3f3f3f3f3f3f3f3f3f3f3f3f3f
SJIS-WIN ?????????予??吟??徇?????? 00111111001111110011111100111111001111110011111100111111001111110011111110010111010111000011111100111111100010111110000100111111001111111001110001101101001111110011111100111111001111110011111100111111 3f3f3f3f3f3f3f3f3f975c3f3f8be13f3f9c6d3f3f3f3f3f3f
EUC-JP ?????????予??吟??徇?????? 00111111001111110011111100111111001111110011111100111111001111110011111111001101101111010011111100111111101101101110001100111111001111111101011111001110001111110011111100111111001111110011111100111111 3f3f3f3f3f3f3f3f3fcdbd3f3fb6e33f3fd7ce3f3f3f3f3f3f
UTF-8 琉뗦슫泥귥깦溜롫쨱予댁쾫吟섇뙼徇먮젽琉뗥퓚理 111011111010011110001100111010111001011110100110111011001000101010101011111011111010011110100011111010101011011110100101111010101011100110100110111011111010011110001011111010111010000110101011111011001010100010110001111001001011101010001000111010111000110010000001111011001011111010101011111001011001000010011111111011001000010010000111111010111001100110111100111001011011111010000111111010111010100010101110111011001010000010111101111011111010011110001100111010111001011110100101111011011001001110011010111011111010011110100100 efa78ceb97a6ec8aabefa7a3eab7a5eab9a6efa78beba1abeca8b1e4ba88eb8c81ecbeabe5909fec8487eb99bce5be87eba8aeeca0bdefa78ceb97a5ed939aefa7a4
UHC 琉뗦슫泥귥깦溜롫쨱予댁쾫吟섇뙼徇먮젽琉뗥퓚理 1110101110100100100010111110011010011010101101001110110010110010100000101110110010000011100110001110101011111110100011101110101110100100100010111110010111111000101101001110110010110010100000101110101111100001100110001110010110001100101111111110001011011111100100001110101110100000101011111110101110100100100010111110010110111111100001011110110010110101 eba48be69ab4ecb282ec8398eafe8eeba48be5f8b4ecb282ebe198e58cbfe2df90eba0afeba48be5bf85ecb5

SJIS-Win,EUC-JP: Classic charsets mainly used as Japanese encoding set on Windows(SJIS-Win=CP932) and UNIX(EUC-JP)