To what bitstring a character(s) is encoded in each character set?

Input one character or short letters and click "Convert."


(UTF-8)
Charset Character Bit string (binary) Bit String (hexadecimal)
ISO-8859-1 ?????????}?????????{^ 001111110011111100111111001111110011111100111111001111110011111100111111011111010011111100111111001111110011111100111111001111110011111100111111001111110111101101011110 3f3f3f3f3f3f3f3f3f7d3f3f3f3f3f3f3f3f3f7b5e
SJIS-WIN 厭?????巍??}厭?????巍??{^ 10001001011111010011111100111111001111110011111100111111100110111101100100111111001111110111110110001001011111010011111100111111001111110011111100111111100110111101100100111111001111110111101101011110 897d3f3f3f3f3f9bd93f3f7d897d3f3f3f3f3f9bd93f3f7b5e
EUC-JP 厭?????巍??}厭?????巍??{^ 10110001110111100011111100111111001111110011111100111111110101101101101100111111001111110111110110110001110111100011111100111111001111110011111100111111110101101101101100111111001111110111101101011110 b1de3f3f3f3f3fd6db3f3f7db1de3f3f3f3f3fd6db3f3f7b5e
UTF-8 厭묒뼚溜뀀젧巍랁꽩}厭묒뼚溜뀀젧巍랁꽩{^ 111001011000111010101101111010111010110010010010111010111011110010011010111011111010011110001011111010111000000010000000111011001010000010100111111001011011011110001101111010111001111010000001111010101011110110101001011111011110010110001110101011011110101110101100100100101110101110111100100110101110111110100111100010111110101110000000100000001110110010100000101001111110010110110111100011011110101110011110100000011110101010111101101010010111101101011110 e58eadebac92ebbc9aefa78beb8080eca0a7e5b78deb9e81eabda97de58eadebac92ebbc9aefa78beb8080eca0a7e5b78deb9e81eabda97b5e
UHC 厭묒뼚溜뀀젧巍랁꽩}厭묒뼚溜뀀젧巍랁꽩{^ 111001101111010010010001111011001001011010100000111010101111111010110010111010111010000010011111111010001110010010001101111011011000010010110100011111011110011011110100100100011110110010010110101000001110101011111110101100101110101110100000100111111110100011100100100011011110110110000100101101000111101101011110 e6f491ec96a0eafeb2eba09fe8e48ded84b47de6f491ec96a0eafeb2eba09fe8e48ded84b47b5e

SJIS-Win,EUC-JP: Classic charsets mainly used as Japanese encoding set on Windows(SJIS-Win=CP932) and UNIX(EUC-JP)