To what bitstring a character(s) is encoded in each character set?

Input one character or short letters and click "Convert."


(UTF-8)
Charset Character Bit string (binary) Bit String (hexadecimal)
ISO-8859-1 ????????????????????? 001111110011111100111111001111110011111100111111001111110011111100111111001111110011111100111111001111110011111100111111001111110011111100111111001111110011111100111111 3f3f3f3f3f3f3f3f3f3f3f3f3f3f3f3f3f3f3f3f3f
SJIS-WIN 譽??泳??倭??汝??岳?ぐ岳??鳶?? 1110011010100011001111110011111110001001011010100011111100111111100110000110000000111111001111111001001111110000001111110011111110001010011110000011111110000010101011101000101001111000001111110011111110010011110011100011111100111111 e6a33f3f896a3f3f98603f3f93f03f3f8a783f82ae8a783f3f93ce3f3f
EUC-JP 譽??泳??倭??汝??岳?ぐ岳??鳶?? 1110110010100101001111110011111110110001110010110011111100111111110011111100000100111111001111111100011011110010001111110011111110110011110110010011111110100100101100001011001111011001001111110011111111000110110100000011111100111111 eca53f3fb1cb3f3fcfc13f3fc6f23f3fb3d93fa4b0b3d93f3fc6d03f3f
UTF-8 譽긴퍟泳싪갬倭좄쥤汝싪쥤岳껇ぐ岳껃끀鳶멩뿈 111010001010110110111101111010101011100010110100111011011000110110011111111001101011001110110011111011001000101110101010111010101011000010101100111001011000000010101101111011001010001010000100111011001010010110100100111001101011000110011101111011001000101110101010111011001010010110100100111001011011001010110011111010101011101110000111111000111000000110010000111001011011001010110011111010101011101110000011111010111000000110000000111010011011001110110110111010111010100110101001111010111011111110001000 e8adbdeab8b4ed8d9fe6b3b3ec8baaeab0ace580adeca284eca5a4e6b19dec8baaeca5a4e5b2b3eabb87e38190e5b2b3eabb83eb8180e9b3b6eba9a9ebbf88
UHC 譽긴퍟泳싪갬倭좄쥤汝싪쥤岳껇ぐ岳껃끀鳶멩뿈 111001111110001010110001111001001011101110010110111001111011011010011010111010001011000010110111111010001101111010100000111010001010001010010110111001101010001110011010111010001010001010010110111001001011111110000011111010001010101010110000111001001011111110000011111001011000010110110110111001101110100110111000111001101001011110001111 e7e2b1e4bb96e7b69ae8b0b7e8dea0e8a296e6a39ae8a296e4bf83e8aab0e4bf83e585b6e6e9b8e6978f

SJIS-Win,EUC-JP: Classic charsets mainly used as Japanese encoding set on Windows(SJIS-Win=CP932) and UNIX(EUC-JP)