To what bitstring a character(s) is encoded in each character set?

Input one character or short letters and click "Convert."


(UTF-8)
Charset Character Bit string (binary) Bit String (hexadecimal)
ISO-8859-1 ?????????z?????????zB 001111110011111100111111001111110011111100111111001111110011111100111111011110100011111100111111001111110011111100111111001111110011111100111111001111110111101001000010 3f3f3f3f3f3f3f3f3f7a3f3f3f3f3f3f3f3f3f7a42
SJIS-WIN 也や???オ???z也や???オ???zB 100101101110011110000010111000100011111100111111001111111000001101001001001111110011111100111111011110101001011011100111100000101110001000111111001111110011111110000011010010010011111100111111001111110111101001000010 96e782e23f3f3f83493f3f3f7a96e782e23f3f3f83493f3f3f7a42
EUC-JP 也や???オ???z也や???オ???zB 110011001110100110100100111001000011111100111111001111111010010110101010001111110011111100111111011110101100110011101001101001001110010000111111001111110011111110100101101010100011111100111111001111110111101001000010 cce9a4e43f3f3fa5aa3f3f3f7acce9a4e43f3f3fa5aa3f3f3f7a42
UTF-8 也や퓱銳볡オ嶪뤺뵷z也や퓱銳볡オ嶪뤺뵷zB 111001001011100110011111111000111000001010000100111011011001001110110001111010011000101010110011111010111011001110100001111000111000001010101010111001011011011010101010111010111010010010111010111010111011010110110111011110101110010010111001100111111110001110000010100001001110110110010011101100011110100110001010101100111110101110110011101000011110001110000010101010101110010110110110101010101110101110100100101110101110101110110101101101110111101001000010 e4b99fe38284ed93b1e98ab3ebb3a1e382aae5b6aaeba4baebb5b77ae4b99fe38284ed93b1e98ab3ebb3a1e382aae5b6aaeba4baebb5b77a42
UHC 也や퓱銳볡オ嶪뤺뵷z也や퓱銳볡オ嶪뤺뵷zB 111001011010010110101010111001001011111110010111111001111110010110010011111001111010101110101010111001011111010110001111111010001001010010110101011110101110010110100101101010101110010010111111100101111110011111100101100100111110011110101011101010101110010111110101100011111110100010010100101101010111101001000010 e5a5aae4bf97e7e593e7abaae5f58fe894b57ae5a5aae4bf97e7e593e7abaae5f58fe894b57a42

SJIS-Win,EUC-JP: Classic charsets mainly used as Japanese encoding set on Windows(SJIS-Win=CP932) and UNIX(EUC-JP)