To what bitstring a character(s) is encoded in each character set?

Input one character or short letters and click "Convert."


(UTF-8)
Charset Character Bit string (binary) Bit String (hexadecimal)
ISO-8859-1 ????????????????????? 001111110011111100111111001111110011111100111111001111110011111100111111001111110011111100111111001111110011111100111111001111110011111100111111001111110011111100111111 3f3f3f3f3f3f3f3f3f3f3f3f3f3f3f3f3f3f3f3f3f
SJIS-WIN 淨?紆??篩???絅?伊逗工霽?淨?寃?? 10011111110001000011111111100010111111000011111100111111111000101011111100111111001111110011111111100011010001000011111110001000110010011001000010000000100011010100100011101000110001110011111110011111110001000011111110011011100000110011111100111111 9fc43fe2fc3f3fe2bf3f3f3fe3443f88c990808d48e8c73f9fc43f9b833f3f
EUC-JP 淨?紆?焌篩???絅?伊逗工霽?淨?寃?? 110111101100011000111111111001001111111000111111100011111100100111101000111001001100000100111111001111110011111111100101101001010011111110110000110010111011111111100000101110011010100111110000110010010011111111011110110001100011111111010101111000110011111100111111 dec63fe4fe3f8fc9e8e4c13f3f3fe5a53fb0cbbfe0b9a9f0c93fdec63fd5e33f3f
UTF-8 淨렠紆렣焌篩렍讀렮絅볕伊逗工霽렢淨렠寃닿횅 111001101011011110101000111010111010000010100000111001111011010010000110111010111010000010100011111001111000010010001100111001111010111110101001111010111010000010001101111011111010010110011010111010111010000010101110111001111011010110000101111010111011001110010101111001001011110010001010111010011000000010010111111001011011011110100101111010011001110010111101111010111010000010100010111001101011011110101000111010111010000010100000111001011010111110000011111010111000101110111111111011011001101010000101 e6b7a8eba0a0e7b486eba0a3e7848ce7afa9eba08defa59aeba0aee7b585ebb395e4bc8ae98097e5b7a5e99cbdeba0a2e6b7a8eba0a0e5af83eb8bbfed9a85
UHC 淨렠紆렣焌篩렍讀렮絅볕伊逗工霽렢淨렠寃닿횅 111011111110010010001110101100011110100111100001100011101011010011110001111000001101111011101000100011101010001111010100111001101000111010111011110011001110011110111010101101011110110010100101110101001110100011001101111011111111000010111000100011101011001111101111111001001000111010110001111010101011001010110100111010101100100010110111 efe48eb1e9e18eb4f1e0dee88ea3d4e68ebbcce7bab5eca5d4e8cdeff0b88eb3efe48eb1eab2b4eac8b7

SJIS-Win,EUC-JP: Classic charsets mainly used as Japanese encoding set on Windows(SJIS-Win=CP932) and UNIX(EUC-JP)