To what bitstring a character(s) is encoded in each character set?

Input one character or short letters and click "Convert."


(UTF-8)
Charset Character Bit string (binary) Bit String (hexadecimal)
ISO-8859-1 ??????????????????^ 00111111001111110011111100111111001111110011111100111111001111110011111100111111001111110011111100111111001111110011111100111111001111110011111101011110 3f3f3f3f3f3f3f3f3f3f3f3f3f3f3f3f3f3f5e
SJIS-WIN 族逗愆??腫?遠魄族逗愆??腫?遠白^ 10010001101100001001000010000000100111001011001000111111001111111000111011101110001111111000100110010011111010011010111010010001101100001001000010000000100111001011001000111111001111111000111011101110001111111000100110010011100101001001001001011110 91b090809cb23f3f8eee3f8993e9ae91b090809cb23f3f8eee3f899394925e
EUC-JP 族逗愆??腫?遠魄族逗愆??腫?遠白^ 11000010101100101011111111100000110110001011010000111111001111111011110011110000001111111011000111110011111100101011000011000010101100101011111111100000110110001011010000111111001111111011110011110000001111111011000111110011110001111111001001011110 c2b2bfe0d8b43f3fbcf03fb1f3f2b0c2b2bfe0d8b43f3fbcf03fb1f3c7f25e
UTF-8 族逗愆롈렊腫렣遠魄族逗愆롈렊腫렣遠白^ 11100110100101111000111111101001100000001001011111100110100001001000011011101011101000011000100011101011101000001000101011101000100001011010101111101011101000001010001111101001100000011010000011101001101011011000010011100110100101111000111111101001100000001001011111100110100001001000011011101011101000011000100011101011101000001000101011101000100001011010101111101011101000001010001111101001100000011010000011100111100110011011110101011110 e6978fe98097e68486eba188eba08ae885abeba0a3e981a0e9ad84e6978fe98097e68486eba188eba08ae885abeba0a3e981a0e799bd5e
UHC 族逗愆롈렊腫렣遠魄族逗愆롈렊腫렣遠白^ 11110000111010011101010011101000110010111111000010001110110011101000111010100001111100001111111010001110101101001110101011000000110110111101111011110000111010011101010011101000110010111111000010001110110011101000111010100001111100001111111010001110101101001110101011000000110110111101110001011110 f0e9d4e8cbf08ece8ea1f0fe8eb4eac0dbdef0e9d4e8cbf08ece8ea1f0fe8eb4eac0dbdc5e

SJIS-Win,EUC-JP: Classic charsets mainly used as Japanese encoding set on Windows(SJIS-Win=CP932) and UNIX(EUC-JP)