To what bitstring a character(s) is encoded in each character set?

Input one character or short letters and click "Convert."


(UTF-8)
Charset Character Bit string (binary) Bit String (hexadecimal)
ISO-8859-1 ????????????????????? 001111110011111100111111001111110011111100111111001111110011111100111111001111110011111100111111001111110011111100111111001111110011111100111111001111110011111100111111 3f3f3f3f3f3f3f3f3f3f3f3f3f3f3f3f3f3f3f3f3f
SJIS-WIN 馭??揖??矜猿??純??筌Q??鸚??泣 111010010110011000111111001111111001011101001011001111110011111111100001111000001000100110001110001111110011111110001111100000110011111100111111111000101010001110000010011100000011111100111111111010100101111100111111001111111000101110000011 e9663f3f974b3f3fe1e0898e3f3f8f833f3fe2a382703f3fea5f3f3f8b83
EUC-JP 馭??揖??矜猿??純??筌Q??鸚??泣 111100011100011100111111001111111100110110101100001111110011111111100010111000101011000111101110001111110011111110111101111000110011111100111111111001001010010110100011110100010011111100111111111100111100000000111111001111111011010111100011 f1c73f3fcdac3f3fe2e2b1ee3f3fbde33f3fe4a5a3d13f3ff3c03f3fb5e3
UTF-8 馭곥룊揖듸㎗矜猿뚧걖純됱돲筌Q딄퍕鸚룸엨泣 111010011010011010101101111010101011001110100101111010111010001110001010111001101000111110010110111010111001001110111000111000111000111010010111111001111001111110011100111001111000110010111111111010111001101010100111111010101011000110010110111001111011010010010100111010111001000010110001111010111000111110110010111001111010110110001100111011111011110010110001111010111001010010000100111011011000110110010101111010011011100010011010111010111010001110111000111011001001011110101000111001101011001110100011 e9a6adeab3a5eba38ae68f96eb93b8e38e97e79f9ce78cbfeb9aa7eab196e7b494eb90b1eb8fb2e7ad8cefbcb1eb9484ed8d95e9b89aeba3b8ec97a8e6b3a3
UHC 馭곥룊揖듸㎗矜猿뚧걖純됱돲筌Q딄퍕鸚룸엨泣 111001011101111110000001111000111000111110001001111010111110011110110101111011111010011110100011110100001110100011101010101110111000110011100110100000011000000111100010111011011000100111101100100010011011010111101111101001111010001111010001100010101110101010111011100011001110010110100100101101111110101110011110100000011110101111101000 e5df81e38f89ebe7b5efa7a3d0e8eabb8ce68181e2ed89ec89b5efa7a3d18aeabb8ce5a4b7eb9e81ebe8

SJIS-Win,EUC-JP: Classic charsets mainly used as Japanese encoding set on Windows(SJIS-Win=CP932) and UNIX(EUC-JP)