To what bitstring a character(s) is encoded in each character set?

Input one character or short letters and click "Convert."


(UTF-8)
Charset Character Bit string (binary) Bit String (hexadecimal)
ISO-8859-1 ??????????????????????? 0011111100111111001111110011111100111111001111110011111100111111001111110011111100111111001111110011111100111111001111110011111100111111001111110011111100111111001111110011111100111111 3f3f3f3f3f3f3f3f3f3f3f3f3f3f3f3f3f3f3f3f3f3f3f
SJIS-WIN 永??筍?????輿??逸??矣??繞??夷? 100010010110100100111111001111111110001010100001001111110011111100111111001111110011111110010111011000000011111100111111100010001110110100111111001111111110000111100001001111110011111111100011100001010011111100111111100010001100111000111111 89693f3fe2a13f3f3f3f3f97603f3f88ed3f3fe1e13f3fe3853f3f88ce3f
EUC-JP 永??筍??洹??輿??逸??矣??繞??夷? 1011000111001010001111110011111111100100101000110011111100111111100011111100011110111010001111110011111111001101110000010011111100111111101100001110111100111111001111111110001011100011001111110011111111100101111001010011111100111111101100001101000000111111 b1ca3f3fe4a33f3f8fc7ba3f3fcdc13f3fb0ef3f3fe2e33f3fe5e53f3fb0d03f
UTF-8 永띔래筍곁독洹싲틦輿삳쵓逸댐쭆矣명뫒繞볤퀋夷퀲 111001101011000010111000111010111001110110010100111010111001111010011000111001111010110110001101111010101011001110000001111010111000111110000101111001101011010010111001111011001000101110110010111011011000101110100110111010001011110010111111111011001000001010110011111011001011010110010011111010011000000010111000111010111000110010010000111011001010110110000110111001111001111110100011111010111010101010000101111010111010101110010010111001111011100110011110111010111011001110100100111011011000000010001011111001011010010010110111111011011000000010110010 e6b0b8eb9d94eb9e98e7ad8deab381eb8f85e6b4b9ec8bb2ed8ba6e8bcbfec82b3ecb593e980b8eb8c90ecad86e79fa3ebaa85ebab92e7b99eebb3a4ed808be5a4b7ed80b2
UHC 永띔래筍곁독洹싲틦輿삳쵓逸댐쭆矣명뫒繞볤퀋夷퀲 11100111101101011011011011101010101101111010000111100010111011001011000011100111101101011011011011101010101101111001101011101011101110101001000011100110101010111011101111101011101011001001010111101100111011111011010011101111101001111000001011101011111110001011100011101101100100011011010011101001101001001001001111101010101100111000000111101100101010001011010001000101 e7b5b6eab7a1e2ecb0e7b5b6eab79aebba90e6abbbebac95ecefb4efa782ebf8b8ed91b4e9a493eab381eca8b445

SJIS-Win,EUC-JP: Classic charsets mainly used as Japanese encoding set on Windows(SJIS-Win=CP932) and UNIX(EUC-JP)