To what bitstring a character(s) is encoded in each character set?

Input one character or short letters and click "Convert."


(UTF-8)
Charset Character Bit string (binary) Bit String (hexadecimal)
ISO-8859-1 ??????????????????? 00111111001111110011111100111111001111110011111100111111001111110011111100111111001111110011111100111111001111110011111100111111001111110011111100111111 3f3f3f3f3f3f3f3f3f3f3f3f3f3f3f3f3f3f3f
SJIS-WIN 勇?????鷹???????┼???魏 1001011101000101001111110011111100111111001111110011111110010001111010010011111100111111001111110011111100111111001111110011111110000100101010010011111100111111001111111110100110110000 97453f3f3f3f3f91e93f3f3f3f3f3f3f84a93f3f3fe9b0
EUC-JP 勇??佾??鷹??饔??堉?┼洧??魏 11001101101001100011111100111111100011111011000011111011001111110011111111000010111010110011111100111111100011111110100011101111001111110011111110001111101101111111110100111111101010001010101110001111110001111011010000111111001111111111001010110010 cda63f3f8fb0fb3f3fc2eb3f3f8fe8ef3f3f8fb7fd3fa8ab8fc7b43f3ff2b2
UTF-8 勇싳뮄佾볠꼷鷹껓펿饔낃퉭堉낉┼洧쏅맩魏 111001011000101110000111111011001000101110110011111010111010111010000100111001001011110110111110111010111011001110100000111010101011110010110111111010011011011110111001111010101011101110010011111011011000111010111111111010011010010110010100111010111000001010000011111011011000100110101101111001011010000010001001111010111000001010001001111000101001010010111100111001101011010010100111111011001000111110000101111010111010011110101001111010011010110110001111 e58b87ec8bb3ebae84e4bdbeebb3a0eabcb7e9b7b9eabb93ed8ebfe9a594eb8283ed89ade5a089eb8289e294bce6b4a7ec8f85eba7a9e9ad8f
UHC 勇싳뮄佾볠꼷鷹껓펿饔낃퉭堉낉┼洧쏅맩魏 1110100110111000100110101110110010010010100100111110110011101011100100111110011010000100100011111110101111101101100000111110111110111100100011101110100010111101100001011110101010111001100001011110101110111100100001011110111110100110101010111110101011111011100110111110101110010000101100011110101011100000 e9b89aec9293eceb93e6848febed83efbc8ee8bd85eab985ebbc85efa6abeafb9beb90b1eae0

SJIS-Win,EUC-JP: Classic charsets mainly used as Japanese encoding set on Windows(SJIS-Win=CP932) and UNIX(EUC-JP)