To what bitstring a character(s) is encoded in each character set?

Input one character or short letters and click "Convert."


(UTF-8)
Charset Character Bit string (binary) Bit String (hexadecimal)
ISO-8859-1 ?????????N}?????????N{^ 0011111100111111001111110011111100111111001111110011111100111111001111110100111001111101001111110011111100111111001111110011111100111111001111110011111100111111010011100111101101011110 3f3f3f3f3f3f3f3f3f4e7d3f3f3f3f3f3f3f3f3f4e7b5e
SJIS-WIN ??㎝夷????㎏N}??㎝夷????㎏N{^ 0011111100111111100001110111000010001000110011100011111100111111001111110011111110000111011100110100111001111101001111110011111110000111011100001000100011001110001111110011111100111111001111111000011101110011010011100111101101011110 3f3f877088ce3f3f3f3f87734e7d3f3f877088ce3f3f3f3f87734e7b5e
EUC-JP ???夷?????N}???夷?????N{^ 00111111001111110011111110110000110100000011111100111111001111110011111100111111010011100111110100111111001111110011111110110000110100000011111100111111001111110011111100111111010011100111101101011110 3f3f3fb0d03f3f3f3f3f4e7d3f3f3fb0d03f3f3f3f3f4e7b5e
UTF-8 梨꾩㎝夷띿콡吏뺤㎏N}梨꾩㎝夷띿콡吏뺤㎏N{^ 1110111110100111101000101110101010111110101010011110001110001110100111011110010110100100101101111110101110011101101111111110110010111101101000011110111110100111100111101110101110111010101001001110001110001110100011110100111001111101111011111010011110100010111010101011111010101001111000111000111010011101111001011010010010110111111010111001110110111111111011001011110110100001111011111010011110011110111010111011101010100100111000111000111010001111010011100111101101011110 efa7a2eabea9e38e9de5a4b7eb9dbfecbda1efa79eebbaa4e38e8f4e7defa7a2eabea9e38e9de5a4b7eb9dbfecbda1efa79eebbaa4e38e8f4e7b5e
UHC 梨꾩㎝夷띿콡吏뺤㎏N}梨꾩㎝夷띿콡吏뺤㎏N{^ 1110110010110001100001001110110010100111101011111110110010101000100011011110110010110001100110011110110010100111100101011110110010100111101110000100111001111101111011001011000110000100111011001010011110101111111011001010100010001101111011001011000110011001111011001010011110010101111011001010011110111000010011100111101101011110 ecb184eca7afeca88decb199eca795eca7b84e7decb184eca7afeca88decb199eca795eca7b84e7b5e

SJIS-Win,EUC-JP: Classic charsets mainly used as Japanese encoding set on Windows(SJIS-Win=CP932) and UNIX(EUC-JP)