To what bitstring a character(s) is encoded in each character set?

Input one character or short letters and click "Convert."


(UTF-8)
Charset Character Bit string (binary) Bit String (hexadecimal)
ISO-8859-1 ??????????N}??????????N{^ 00111111001111110011111100111111001111110011111100111111001111110011111100111111010011100111110100111111001111110011111100111111001111110011111100111111001111110011111100111111010011100111101101011110 3f3f3f3f3f3f3f3f3f3f4e7d3f3f3f3f3f3f3f3f3f3f4e7b5e
SJIS-WIN ???宋????宋?N}???宋????宋?N{^ 0011111100111111001111111001000101110110001111110011111100111111001111111001000101110110001111110100111001111101001111110011111100111111100100010111011000111111001111110011111100111111100100010111011000111111010011100111101101011110 3f3f3f91763f3f3f3f91763f4e7d3f3f3f91763f3f3f3f91763f4e7b5e
EUC-JP ???宋????宋?N}???宋????宋?N{^ 0011111100111111001111111100000111010111001111110011111100111111001111111100000111010111001111110100111001111101001111110011111100111111110000011101011100111111001111110011111100111111110000011101011100111111010011100111101101011110 3f3f3fc1d73f3f3f3fc1d73f4e7d3f3f3fc1d73f3f3f3fc1d73f4e7b5e
UTF-8 樂롦슢宋릈樂롦슢宋릃N}樂롦슢宋릈樂롦슢宋릃N{^ 1110111110100110101111111110101110100001101001101110110010001010101000101110010110101110100010111110101110100110100010001110111110100110101111111110101110100001101001101110110010001010101000101110010110101110100010111110101110100110100000110100111001111101111011111010011010111111111010111010000110100110111011001000101010100010111001011010111010001011111010111010011010001000111011111010011010111111111010111010000110100110111011001000101010100010111001011010111010001011111010111010011010000011010011100111101101011110 efa6bfeba1a6ec8aa2e5ae8beba688efa6bfeba1a6ec8aa2e5ae8beba6834e7defa6bfeba1a6ec8aa2e5ae8beba688efa6bfeba1a6ec8aa2e5ae8beba6834e7b5e
UHC 樂롦슢宋릈樂롦슢宋릃N}樂롦슢宋릈樂롦슢宋릃N{^ 111010001111100110001110111001101001101010101110111000011110010010010000011010001110100011111001100011101110011010011010101011101110000111100100100100000110011001001110011111011110100011111001100011101110011010011010101011101110000111100100100100000110100011101000111110011000111011100110100110101010111011100001111001001001000001100110010011100111101101011110 e8f98ee69aaee1e49068e8f98ee69aaee1e490664e7de8f98ee69aaee1e49068e8f98ee69aaee1e490664e7b5e

SJIS-Win,EUC-JP: Classic charsets mainly used as Japanese encoding set on Windows(SJIS-Win=CP932) and UNIX(EUC-JP)