To what bitstring a character(s) is encoded in each character set?

Input one character or short letters and click "Convert."


(UTF-8)
Charset Character Bit string (binary) Bit String (hexadecimal)
ISO-8859-1 ????????N}????????N{^ 001111110011111100111111001111110011111100111111001111110011111101001110011111010011111100111111001111110011111100111111001111110011111100111111010011100111101101011110 3f3f3f3f3f3f3f3f4e7d3f3f3f3f3f3f3f3f4e7b5e
SJIS-WIN 褻??酬褻??遂N}褻??酬褻??遂N{^ 1110010111110110001111110011111110001111010101101110010111110110001111110011111110010000100010110100111001111101111001011111011000111111001111111000111101010110111001011111011000111111001111111001000010001011010011100111101101011110 e5f63f3f8f56e5f63f3f908b4e7de5f63f3f8f56e5f63f3f908b4e7b5e
EUC-JP 褻?芎酬褻?芎遂N}褻?芎酬褻?芎遂N{^ 11101010111110000011111110001111110101111011011010111101101101111110101011111000001111111000111111010111101101101011111111101011010011100111110111101010111110000011111110001111110101111011011010111101101101111110101011111000001111111000111111010111101101101011111111101011010011100111101101011110 eaf83f8fd7b6bdb7eaf83f8fd7b6bfeb4e7deaf83f8fd7b6bdb7eaf83f8fd7b6bfeb4e7b5e
UTF-8 褻뤝芎酬褻뤝芎遂N}褻뤝芎酬褻뤝芎遂N{^ 1110100010100100101110111110101110100100100111011110100010001010100011101110100110000101101011001110100010100100101110111110101110100100100111011110100010001010100011101110100110000001100000100100111001111101111010001010010010111011111010111010010010011101111010001000101010001110111010011000010110101100111010001010010010111011111010111010010010011101111010001000101010001110111010011000000110000010010011100111101101011110 e8a4bbeba49de88a8ee985ace8a4bbeba49de88a8ee981824e7de8a4bbeba49de88a8ee985ace8a4bbeba49de88a8ee981824e7b5e
UHC 褻뤝芎酬褻뤝芎遂N}褻뤝芎酬褻뤝芎遂N{^ 11100000111000011000111111001100110011111110010011100010110001101110000011100001100011111100110011001111111001001110001011000100010011100111110111100000111000011000111111001100110011111110010011100010110001101110000011100001100011111100110011001111111001001110001011000100010011100111101101011110 e0e18fcccfe4e2c6e0e18fcccfe4e2c44e7de0e18fcccfe4e2c6e0e18fcccfe4e2c44e7b5e

SJIS-Win,EUC-JP: Classic charsets mainly used as Japanese encoding set on Windows(SJIS-Win=CP932) and UNIX(EUC-JP)