To what bitstring a character(s) is encoded in each character set?

Input one character or short letters and click "Convert."


(UTF-8)
Charset Character Bit string (binary) Bit String (hexadecimal)
ISO-8859-1 ????????????????????? 001111110011111100111111001111110011111100111111001111110011111100111111001111110011111100111111001111110011111100111111001111110011111100111111001111110011111100111111 3f3f3f3f3f3f3f3f3f3f3f3f3f3f3f3f3f3f3f3f3f
SJIS-WIN 岳??孺??袁с?嚴щ?悠??怨k?倭?? 10001010011110000011111100111111100110110111110100111111001111111110010111001101100001001000001100111111100110101000111010000100100010110011111110010111010010010011111100111111100010011000010110000010100010110011111110011000011000000011111100111111 8a783f3f9b7d3f3fe5cd84833f9a8e848b3f97493f3f8985828b3f98603f3f
EUC-JP 岳??孺??袁с?嚴щ?悠??怨k?倭?? 10110011110110010011111100111111110101011101111000111111001111111110101011001111101001111110001100111111110100111110111010100111111010110011111111001101101010100011111100111111101100011110010110100011111010110011111111001111110000010011111100111111 b3d93f3fd5de3f3feacfa7e33fd3eea7eb3fcdaa3f3fb1e5a3eb3fcfc13f3f
UTF-8 岳묒빘孺욤짆袁с렅嚴щㅏ悠롧넭怨k쳴倭얠쾿 11100101101100101011001111101011101011001001001011101011101110011001100011100101101011011011101011101100100110101010010011101100101001111000011011101000101000101000000111010001100000011110101110100000100001011110010110011010101101001101000110001001111000111000010110001111111001101000001010100000111010111010000110100111111010111000010010101101111001101000000010101000111011111011110110001011111011001011001110110100111001011000000010101101111011001001011010100000111011001011111010111111 e5b2b3ebac92ebb998e5adbaec9aa4eca786e8a281d181eba085e59ab4d189e3858fe682a0eba1a7eb84ade680a8efbd8becb3b4e580adec96a0ecbebf
UHC 岳묒빘孺욤짆袁с렅嚴щㅏ悠롧넭怨k쳴倭얠쾿 111001001011111110010001111011001001010110111001111010101110100010111111111010001010001110010101111010101011111010101100111000111000111010011111111001011111000110101100111010111010010010111111111010101110110110001110111001111000011010101100111010101011001110100011111010111010101110010111111010001101111010111110111011001011001010010101 e4bf91ec95b9eae8bfe8a395eabeace38e9fe5f1aceba4bfeaed8ee786aceab3a3ebab97e8debeecb295

SJIS-Win,EUC-JP: Classic charsets mainly used as Japanese encoding set on Windows(SJIS-Win=CP932) and UNIX(EUC-JP)