To what bitstring a character(s) is encoded in each character set?

Input one character or short letters and click "Convert."


(UTF-8)
Charset Character Bit string (binary) Bit String (hexadecimal)
ISO-8859-1 ????????????????????¨? 00111111001111110011111100111111001111110011111100111111001111110011111100111111001111110011111100111111001111110011111100111111001111110011111100111111001111111010100000111111 3f3f3f3f3f3f3f3f3f3f3f3f3f3f3f3f3f3f3f3fa83f
SJIS-WIN 誤??韋??蹂?????猿??淫??筌?¨陰 100011001110101100111111001111111110100011101000001111110011111111100110111110000011111100111111001111110011111100111111100010011000111000111111001111111000100011111010001111110011111111100010101000110011111110000001010011101000100101000001 8ceb3f3fe8e83f3fe6f83f3f3f3f3f898e3f3f88fa3f3fe2a33f814e8941
EUC-JP 誤??韋??蹂?????猿??淫??筌?¨陰 101110001110110100111111001111111111000011101010001111110011111111101100111110100011111100111111001111110011111100111111101100011110111000111111001111111011000011111100001111110011111111100100101001010011111110100001101011111011000110100010 b8ed3f3ff0ea3f3fecfa3f3f3f3f3fb1ee3f3fb0fc3f3fe4a53fa1afb1a2
UTF-8 誤곸룆韋귨ℓ蹂좊짎亮쎄퍓猿쒏쾬淫볧맊筌욌¨陰 1110100010101010101001001110101010110011101110001110101110100011100001101110100110011111100010111110101010110111101010001110001010000100100100111110100010111001100000101110110010100010100010101110110010100111100011101110111110100101101101111110110010001110100001001110110110001101100100111110011110001100101111111110110010010010100011111110110010111110101011001110011010110111101010111110101110110011101001111110101110100111100010101110011110101101100011001110110010011010100011001100001010101000111010011001100110110000 e8aaa4eab3b8eba386e99f8beab7a8e28493e8b982eca28aeca78eefa5b7ec8e84ed8d93e78cbfec928fecbeace6b7abebb3a7eba78ae7ad8cec9a8cc2a8e999b0
UHC 誤곸룆韋귨ℓ蹂좊짎亮쎄퍓猿쒏쾬淫볧맊筌욌¨陰 1110100010100110100000011110110010001111100001011110101011011111100000101110111110100111101001001110101110110011101000001110101110100011100110101110010110111001101111011110101010111011100010101110101010111011100111001110011010110010100000111110101111100010100100111110110110010000101000101110111110100111100111101110101110100001101001111110101111100100 e8a681ec8f85eadf82efa7a4ebb3a0eba39ae5b9bdeabb8aeabb9ce6b283ebe293ed90a2efa79eeba1a7ebe4

SJIS-Win,EUC-JP: Classic charsets mainly used as Japanese encoding set on Windows(SJIS-Win=CP932) and UNIX(EUC-JP)