To what bitstring a character(s) is encoded in each character set?

Input one character or short letters and click "Convert."


(UTF-8)
Charset Character Bit string (binary) Bit String (hexadecimal)
ISO-8859-1 ????????N}????????N{^ 001111110011111100111111001111110011111100111111001111110011111101001110011111010011111100111111001111110011111100111111001111110011111100111111010011100111101101011110 3f3f3f3f3f3f3f3f4e7d3f3f3f3f3f3f3f3f4e7b5e
SJIS-WIN 陰竭?騏陰竭?饑N}陰竭?騏陰竭?饑N{^ 100010010100000111100010100100010011111111101001011101011000100101000001111000101001000100111111111010010101111101001110011111011000100101000001111000101001000100111111111010010111010110001001010000011110001010010001001111111110100101011111010011100111101101011110 8941e2913fe9758941e2913fe95f4e7d8941e2913fe9758941e2913fe95f4e7b5e
EUC-JP 陰竭?騏陰竭?饑N}陰竭?騏陰竭?饑N{^ 101100011010001011100011111100010011111111110001110101101011000110100010111000111111000100111111111100011100000001001110011111011011000110100010111000111111000100111111111100011101011010110001101000101110001111110001001111111111000111000000010011100111101101011110 b1a2e3f13ff1d6b1a2e3f13ff1c04e7db1a2e3f13ff1d6b1a2e3f13ff1c04e7b5e
UTF-8 陰竭렮騏陰竭렮饑N}陰竭렮騏陰竭렮饑N{^ 1110100110011001101100001110011110101011101011011110101110100000101011101110100110101000100011111110100110011001101100001110011110101011101011011110101110100000101011101110100110100101100100010100111001111101111010011001100110110000111001111010101110101101111010111010000010101110111010011010100010001111111010011001100110110000111001111010101110101101111010111010000010101110111010011010010110010001010011100111101101011110 e999b0e7abadeba0aee9a88fe999b0e7abadeba0aee9a5914e7de999b0e7abadeba0aee9a88fe999b0e7abadeba0aee9a5914e7b5e
UHC 陰竭렮騏陰竭렮饑N}陰竭렮騏陰竭렮饑N{^ 11101011111001001100101011100110100011101011101111010001110010011110101111100100110010101110011010001110101110111101000111000111010011100111110111101011111001001100101011100110100011101011101111010001110010011110101111100100110010101110011010001110101110111101000111000111010011100111101101011110 ebe4cae68ebbd1c9ebe4cae68ebbd1c74e7debe4cae68ebbd1c9ebe4cae68ebbd1c74e7b5e

SJIS-Win,EUC-JP: Classic charsets mainly used as Japanese encoding set on Windows(SJIS-Win=CP932) and UNIX(EUC-JP)