To what bitstring a character(s) is encoded in each character set?

Input one character or short letters and click "Convert."


(UTF-8)
Charset Character Bit string (binary) Bit String (hexadecimal)
ISO-8859-1 ?????????n}?????????n{^ 0011111100111111001111110011111100111111001111110011111100111111001111110110111001111101001111110011111100111111001111110011111100111111001111110011111100111111011011100111101101011110 3f3f3f3f3f3f3f3f3f6e7d3f3f3f3f3f3f3f3f3f6e7b5e
SJIS-WIN ???猥??秧①Чn}???猥??秧①Чn{^ 00111111001111110011111111100000110011100011111100111111111000100101111010000111010000001000010001011000011011100111110100111111001111110011111111100000110011100011111100111111111000100101111010000111010000001000010001011000011011100111101101011110 3f3f3fe0ce3f3fe25e874084586e7d3f3f3fe0ce3f3fe25e874084586e7b5e
EUC-JP ???猥??秧?Чn}???猥??秧?Чn{^ 0011111100111111001111111110000011010000001111110011111111100011101111110011111110100111101110010110111001111101001111110011111100111111111000001101000000111111001111111110001110111111001111111010011110111001011011100111101101011110 3f3f3fe0d03f3fe3bf3fa7b96e7d3f3f3fe0d03f3fe3bf3fa7b96e7b5e
UTF-8 僚묌㉦猥뗩뙧秧①Чn}僚묌㉦猥뗩뙧秧①Чn{^ 111011111010011010111011111010111010110010001100111000111000100110100110111001111000110010100101111010111001011110101001111010111001100110100111111001111010011110100111111000101001000110100000110100001010011101101110011111011110111110100110101110111110101110101100100011001110001110001001101001101110011110001100101001011110101110010111101010011110101110011001101001111110011110100111101001111110001010010001101000001101000010100111011011100111101101011110 efa6bbebac8ce389a6e78ca5eb97a9eb99a7e7a7a7e291a0d0a76e7defa6bbebac8ce389a6e78ca5eb97a9eb99a7e7a7a7e291a0d0a76e7b5e
UHC 僚묌㉦猥뗩뙧秧①Чn}僚묌㉦猥뗩뙧秧①Чn{^ 1110100011101000100100011110100110101000101101111110100011100101100010111110100110001100101010111110010011101011101010001110011110101100101110010110111001111101111010001110100010010001111010011010100010110111111010001110010110001011111010011000110010101011111001001110101110101000111001111010110010111001011011100111101101011110 e8e891e9a8b7e8e58be98cabe4eba8e7acb96e7de8e891e9a8b7e8e58be98cabe4eba8e7acb96e7b5e

SJIS-Win,EUC-JP: Classic charsets mainly used as Japanese encoding set on Windows(SJIS-Win=CP932) and UNIX(EUC-JP)