To what bitstring a character(s) is encoded in each character set?

Input one character or short letters and click "Convert."


(UTF-8)
Charset Character Bit string (binary) Bit String (hexadecimal)
ISO-8859-1 ????????N}????????N{^ 001111110011111100111111001111110011111100111111001111110011111101001110011111010011111100111111001111110011111100111111001111110011111100111111010011100111101101011110 3f3f3f3f3f3f3f3f4e7d3f3f3f3f3f3f3f3f4e7b5e
SJIS-WIN シ眄ウ猜シ眄ウ猗N}シ眄ウ猜シ眄ウ猗N{^ 1011110011100001101111101011001111100000110010001011110011100001101111101011001111100000110001100100111001111101101111001110000110111110101100111110000011001000101111001110000110111110101100111110000011000110010011100111101101011110 bce1beb3e0c8bce1beb3e0c64e7dbce1beb3e0c8bce1beb3e0c64e7b5e
EUC-JP シ眄ウ猜シ眄ウ猗N}シ眄ウ猜シ眄ウ猗N{^ 10001110101111001110001011000000100011101011001111100000110010101000111010111100111000101100000010001110101100111110000011001000010011100111110110001110101111001110001011000000100011101011001111100000110010101000111010111100111000101100000010001110101100111110000011001000010011100111101101011110 8ebce2c08eb3e0ca8ebce2c08eb3e0c84e7d8ebce2c08eb3e0ca8ebce2c08eb3e0c84e7b5e
UTF-8 シ眄ウ猜シ眄ウ猗N}シ眄ウ猜シ眄ウ猗N{^ 1110111110111101101111001110011110011100100001001110111110111101101100111110011110001100100111001110111110111101101111001110011110011100100001001110111110111101101100111110011110001100100101110100111001111101111011111011110110111100111001111001110010000100111011111011110110110011111001111000110010011100111011111011110110111100111001111001110010000100111011111011110110110011111001111000110010010111010011100111101101011110 efbdbce79c84efbdb3e78c9cefbdbce79c84efbdb3e78c974e7defbdbce79c84efbdb3e78c9cefbdbce79c84efbdb3e78c974e7b5e
UHC ?眄?猜?眄??N}?眄?猜?眄??N{^ 001111111101100011111000001111111110001111000100001111111101100011111000001111110011111101001110011111010011111111011000111110000011111111100011110001000011111111011000111110000011111100111111010011100111101101011110 3fd8f83fe3c43fd8f83f3f4e7d3fd8f83fe3c43fd8f83f3f4e7b5e

SJIS-Win,EUC-JP: Classic charsets mainly used as Japanese encoding set on Windows(SJIS-Win=CP932) and UNIX(EUC-JP)