To what bitstring a character(s) is encoded in each character set?

Input one character or short letters and click "Convert."


(UTF-8)
Charset Character Bit string (binary) Bit String (hexadecimal)
ISO-8859-1 ?????????D?????????D^ 001111110011111100111111001111110011111100111111001111110011111100111111010001000011111100111111001111110011111100111111001111110011111100111111001111110100010001011110 3f3f3f3f3f3f3f3f3f443f3f3f3f3f3f3f3f3f445e
SJIS-WIN ?ゆ??よ?亦??D?ゆ??よ?亦??D^ 001111111000001011100100001111110011111110000010111001100011111110010110100100100011111100111111010001000011111110000010111001000011111100111111100000101110011000111111100101101001001000111111001111110100010001011110 3f82e43f3f82e63f96923f3f443f82e43f3f82e63f96923f3f445e
EUC-JP ?ゆ??よ?亦??D?ゆ??よ?亦??D^ 001111111010010011100110001111110011111110100100111010000011111111001011111100100011111100111111010001000011111110100100111001100011111100111111101001001110100000111111110010111111001000111111001111110100010001011110 3fa4e63f3fa4e83fcbf23f3f443fa4e63f3fa4e83fcbf23f3f445e
UTF-8 銳ゆ꼇銳よ뮅亦사뿏D銳ゆ꼇銳よ뮅亦사뿏D^ 111010011000101010110011111000111000001010000110111010101011110010000111111010011000101010110011111000111000001010001000111010111010111010000101111001001011101010100110111011001000001010101100111010111011111110001111010001001110100110001010101100111110001110000010100001101110101010111100100001111110100110001010101100111110001110000010100010001110101110101110100001011110010010111010101001101110110010000010101011001110101110111111100011110100010001011110 e98ab3e38286eabc87e98ab3e38288ebae85e4baa6ec82acebbf8f44e98ab3e38286eabc87e98ab3e38288ebae85e4baa6ec82acebbf8f445e
UHC 銳ゆ꼇銳よ뮅亦사뿏D銳ゆ꼇銳よ뮅亦사뿏D^ 111001111110010110101010111001101011001010111011111001111110010110101010111010001001001010010100111001101011001010111011111001111001011110010100010001001110011111100101101010101110011010110010101110111110011111100101101010101110100010010010100101001110011010110010101110111110011110010111100101000100010001011110 e7e5aae6b2bbe7e5aae89294e6b2bbe7979444e7e5aae6b2bbe7e5aae89294e6b2bbe79794445e

SJIS-Win,EUC-JP: Classic charsets mainly used as Japanese encoding set on Windows(SJIS-Win=CP932) and UNIX(EUC-JP)