To what bitstring a character(s) is encoded in each character set?

Input one character or short letters and click "Convert."


(UTF-8)
Charset Character Bit string (binary) Bit String (hexadecimal)
ISO-8859-1 ?????????}?????????{^ 001111110011111100111111001111110011111100111111001111110011111100111111011111010011111100111111001111110011111100111111001111110011111100111111001111110111101101011110 3f3f3f3f3f3f3f3f3f7d3f3f3f3f3f3f3f3f3f7b5e
SJIS-WIN 冶??????よ?}冶??????よ?{^ 10010110111010000011111100111111001111110011111100111111001111111000001011100110001111110111110110010110111010000011111100111111001111110011111100111111001111111000001011100110001111110111101101011110 96e83f3f3f3f3f3f82e63f7d96e83f3f3f3f3f3f82e63f7b5e
EUC-JP 冶??????よ?}冶??????よ?{^ 11001100111010100011111100111111001111110011111100111111001111111010010011101000001111110111110111001100111010100011111100111111001111110011111100111111001111111010010011101000001111110111101101011110 ccea3f3f3f3f3f3fa4e83f7dccea3f3f3f3f3f3fa4e83f7b5e
UTF-8 冶먮젷娛뤺꼫溜よ뒫}冶먮젷娛뤺꼫溜よ뒫{^ 111001011000011010110110111010111010100010101110111011001010000010110111111001011010100010011011111010111010010010111010111010101011110010101011111011111010011110001011111000111000001010001000111010111001001010101011011111011110010110000110101101101110101110101000101011101110110010100000101101111110010110101000100110111110101110100100101110101110101010111100101010111110111110100111100010111110001110000010100010001110101110010010101010110111101101011110 e586b6eba8aeeca0b7e5a89beba4baeabcabefa78be38288eb92ab7de586b6eba8aeeca0b7e5a89beba4baeabcabefa78be38288eb92ab7b5e
UHC 冶먮젷娛뤺꼫溜よ뒫}冶먮젷娛뤺꼫溜よ뒫{^ 111001011010011110010000111010111010000010101011111001111111010010001111111010001000010010001000111010101111111010101010111010001000101010100101011111011110010110100111100100001110101110100000101010111110011111110100100011111110100010000100100010001110101011111110101010101110100010001010101001010111101101011110 e5a790eba0abe7f48fe88488eafeaae88aa57de5a790eba0abe7f48fe88488eafeaae88aa57b5e

SJIS-Win,EUC-JP: Classic charsets mainly used as Japanese encoding set on Windows(SJIS-Win=CP932) and UNIX(EUC-JP)