To what bitstring a character(s) is encoded in each character set?

Input one character or short letters and click "Convert."


(UTF-8)
Charset Character Bit string (binary) Bit String (hexadecimal)
ISO-8859-1 ????????????????????? 001111110011111100111111001111110011111100111111001111110011111100111111001111110011111100111111001111110011111100111111001111110011111100111111001111110011111100111111 3f3f3f3f3f3f3f3f3f3f3f3f3f3f3f3f3f3f3f3f3f
SJIS-WIN 馭??揖??袁??筌??援??癒??鸚?? 11101001011001100011111100111111100101110100101100111111001111111110010111001101001111110011111111100010101000110011111100111111100010011000011100111111001111111001011011111100001111110011111111101010010111110011111100111111 e9663f3f974b3f3fe5cd3f3fe2a33f3f89873f3f96fc3f3fea5f3f3f
EUC-JP 馭??揖??袁??筌??援??癒??鸚?? 11110001110001110011111100111111110011011010110000111111001111111110101011001111001111110011111111100100101001010011111100111111101100011110011100111111001111111100110011111110001111110011111111110011110000000011111100111111 f1c73f3fcdac3f3feacf3f3fe4a53f3fb1e73f3fccfe3f3ff3c03f3f
UTF-8 馭곥룊揖놂㎖袁⑹숯筌듬쵐援섇㎤癒곗궒鸚룸띁 111010011010011010101101111010101011001110100101111010111010001110001010111001101000111110010110111010111000011010000010111000111000111010010110111010001010001010000001111000101001000110111001111011001000100010101111111001111010110110001100111010111001001110101100111011001011010110010000111001101000111110110100111011001000010010000111111000111000111010100100111001111001100110010010111010101011001110010111111010101011011010010010111010011011100010011010111010111010001110111000111010111001110110000001 e9a6adeab3a5eba38ae68f96eb8682e38e96e8a281e291b9ec88afe7ad8ceb93acecb590e68fb4ec8487e38ea4e79992eab397eab692e9b89aeba3b8eb9d81
UHC 馭곥룊揖놂㎖袁⑹숯筌듬쵐援섇㎤癒곗궒鸚룸띁 111001011101111110000001111000111000111110001001111010111110011110110011111011111010011110100010111010101011111010101001111011001011110110100001111011111010011110110101111010111010110010010010111010101011010110011000111001011010011110101000111010111010100010110000111011001000001010100111111001011010010010110111111010111000110110111100 e5df81e38f89ebe7b3efa7a2eabea9ecbda1efa7b5ebac92eab598e5a7a8eba8b0ec82a7e5a4b7eb8dbc

SJIS-Win,EUC-JP: Classic charsets mainly used as Japanese encoding set on Windows(SJIS-Win=CP932) and UNIX(EUC-JP)