To what bitstring a character(s) is encoded in each character set?

Input one character or short letters and click "Convert."


(UTF-8)
Charset Character Bit string (binary) Bit String (hexadecimal)
ISO-8859-1 ?????????}?????????{^ 001111110011111100111111001111110011111100111111001111110011111100111111011111010011111100111111001111110011111100111111001111110011111100111111001111110111101101011110 3f3f3f3f3f3f3f3f3f7d3f3f3f3f3f3f3f3f3f7b5e
SJIS-WIN ??????伎烏?}??????伎烏?{^ 00111111001111110011111100111111001111110011111110001010111010101000100101000111001111110111110100111111001111110011111100111111001111110011111110001010111010101000100101000111001111110111101101011110 3f3f3f3f3f3f8aea89473f7d3f3f3f3f3f3f8aea89473f7b5e
EUC-JP ??????伎烏?}??????伎烏?{^ 00111111001111110011111100111111001111110011111110110100111011001011000110101000001111110111110100111111001111110011111100111111001111110011111110110100111011001011000110101000001111110111101101011110 3f3f3f3f3f3fb4ecb1a83f7d3f3f3f3f3f3fb4ecb1a83f7b5e
UTF-8 娛붵꺈隸잒몘伎烏냣}娛붵꺈隸잒몘伎烏냣{^ 111001011010100010011011111010111011011010110101111010101011101010001000111011111010011010111000111011001001111010010010111010111010101010011000111001001011110010001110111001111000001110001111111010111000001110100011011111011110010110101000100110111110101110110110101101011110101010111010100010001110111110100110101110001110110010011110100100101110101110101010100110001110010010111100100011101110011110000011100011111110101110000011101000110111101101011110 e5a89bebb6b5eaba88efa6b8ec9e92ebaa98e4bc8ee7838feb83a37de5a89bebb6b5eaba88efa6b8ec9e92ebaa98e4bc8ee7838feb83a37b5e
UHC 娛붵꺈隸잒몘伎烏냣}娛붵꺈隸잒몘伎烏냣{^ 111001111111010010010100111000111000001110101111111001111110011010011111111010001001000110000110110100001110101111101000101000011000011001101110011111011110011111110100100101001110001110000011101011111110011111100110100111111110100010010001100001101101000011101011111010001010000110000110011011100111101101011110 e7f494e383afe7e69fe89186d0ebe8a1866e7de7f494e383afe7e69fe89186d0ebe8a1866e7b5e

SJIS-Win,EUC-JP: Classic charsets mainly used as Japanese encoding set on Windows(SJIS-Win=CP932) and UNIX(EUC-JP)