To what bitstring a character(s) is encoded in each character set?

Input one character or short letters and click "Convert."


(UTF-8)
Charset Character Bit string (binary) Bit String (hexadecimal)
ISO-8859-1 ??????????????????????B 0011111100111111001111110011111100111111001111110011111100111111001111110011111100111111001111110011111100111111001111110011111100111111001111110011111100111111001111110011111101000010 3f3f3f3f3f3f3f3f3f3f3f3f3f3f3f3f3f3f3f3f3f3f42
SJIS-WIN 淨?紆??絲?????淨?紆??絲?????B 1001111111000100001111111110001011111100001111110011111111100011010011100011111100111111001111110011111100111111100111111100010000111111111000101111110000111111001111111110001101001110001111110011111100111111001111110011111101000010 9fc43fe2fc3f3fe34e3f3f3f3f3f9fc43fe2fc3f3fe34e3f3f3f3f3f42
EUC-JP 淨?紆?焌絲??勖??淨?紆?焌絲??勖??B 11011110110001100011111111100100111111100011111110001111110010011110100011100101101011110011111100111111100011111011001111101101001111110011111111011110110001100011111111100100111111100011111110001111110010011110100011100101101011110011111100111111100011111011001111101101001111110011111101000010 dec63fe4fe3f8fc9e8e5af3f3f8fb3ed3f3fdec63fe4fe3f8fc9e8e5af3f3f8fb3ed3f3f42
UTF-8 淨렠紆렣焌絲렟렩勖쾅긺淨렠紆렣焌絲렟렩勖쾅긺B 11100110101101111010100011101011101000001010000011100111101101001000011011101011101000001010001111100111100001001000110011100111101101011011001011101011101000001001111111101011101000001010100111100101100010111001011011101100101111101000010111101010101110001011101011100110101101111010100011101011101000001010000011100111101101001000011011101011101000001010001111100111100001001000110011100111101101011011001011101011101000001001111111101011101000001010100111100101100010111001011011101100101111101000010111101010101110001011101001000010 e6b7a8eba0a0e7b486eba0a3e7848ce7b5b2eba09feba0a9e58b96ecbe85eab8bae6b7a8eba0a0e7b486eba0a3e7848ce7b5b2eba09feba0a9e58b96ecbe85eab8ba42
UHC 淨렠紆렣焌絲렟렩勖쾅긺淨렠紆렣焌絲렟렩勖쾅긺B 111011111110010010001110101100011110100111100001100011101011010011110001111000001101111011101010100011101011000010001110101101111110100111101101110001001110011110110001111001111110111111100100100011101011000111101001111000011000111010110100111100011110000011011110111010101000111010110000100011101011011111101001111011011100010011100111101100011110011101000010 efe48eb1e9e18eb4f1e0deea8eb08eb7e9edc4e7b1e7efe48eb1e9e18eb4f1e0deea8eb08eb7e9edc4e7b1e742

SJIS-Win,EUC-JP: Classic charsets mainly used as Japanese encoding set on Windows(SJIS-Win=CP932) and UNIX(EUC-JP)