To what bitstring a character(s) is encoded in each character set?

Input one character or short letters and click "Convert."


(UTF-8)
Charset Character Bit string (binary) Bit String (hexadecimal)
ISO-8859-1 ?????????W}?????????W{^ 0011111100111111001111110011111100111111001111110011111100111111001111110101011101111101001111110011111100111111001111110011111100111111001111110011111100111111010101110111101101011110 3f3f3f3f3f3f3f3f3f577d3f3f3f3f3f3f3f3f3f577b5e
SJIS-WIN 蘊??巍?????W}蘊??巍?????W{^ 111001010101110100111111001111111001101111011001001111110011111100111111001111110011111101010111011111011110010101011101001111110011111110011011110110010011111100111111001111110011111100111111010101110111101101011110 e55d3f3f9bd93f3f3f3f3f577de55d3f3f9bd93f3f3f3f3f577b5e
EUC-JP 蘊??巍?????W}蘊??巍?????W{^ 111010011011111000111111001111111101011011011011001111110011111100111111001111110011111101010111011111011110100110111110001111110011111111010110110110110011111100111111001111110011111100111111010101110111101101011110 e9be3f3fd6db3f3f3f3f3f577de9be3f3fd6db3f3f3f3f3f577b5e
UTF-8 蘊딅젨巍띾떯溜㏓젪W}蘊딅젨巍띾떯溜㏓젪W{^ 1110100010011000100010101110101110010100100001011110110010100000101010001110010110110111100011011110101110011101101111101110101110010110101011111110111110100111100010111110001110001111100100111110110010100000101010100101011101111101111010001001100010001010111010111001010010000101111011001010000010101000111001011011011110001101111010111001110110111110111010111001011010101111111011111010011110001011111000111000111110010011111011001010000010101010010101110111101101011110 e8988aeb9485eca0a8e5b78deb9dbeeb96afefa78be38f93eca0aa577de8988aeb9485eca0a8e5b78deb9dbeeb96afefa78be38f93eca0aa577b5e
UHC 蘊딅젨巍띾떯溜㏓젪W}蘊딅젨巍띾떯溜㏓젪W{^ 1110100010110011100010101110101110100000101000001110100011100100100011011110101110001011101111111110101011111110101001111110101110100000101000100101011101111101111010001011001110001010111010111010000010100000111010001110010010001101111010111000101110111111111010101111111010100111111010111010000010100010010101110111101101011110 e8b38aeba0a0e8e48deb8bbfeafea7eba0a2577de8b38aeba0a0e8e48deb8bbfeafea7eba0a2577b5e

SJIS-Win,EUC-JP: Classic charsets mainly used as Japanese encoding set on Windows(SJIS-Win=CP932) and UNIX(EUC-JP)