To what bitstring a character(s) is encoded in each character set?

Input one character or short letters and click "Convert."


(UTF-8)
Charset Character Bit string (binary) Bit String (hexadecimal)
ISO-8859-1 ?????????}?????????{^ 001111110011111100111111001111110011111100111111001111110011111100111111011111010011111100111111001111110011111100111111001111110011111100111111001111110111101101011110 3f3f3f3f3f3f3f3f3f7d3f3f3f3f3f3f3f3f3f7b5e
SJIS-WIN ??????醫??}??????醫??{^ 0011111100111111001111110011111100111111001111111110011111001110001111110011111101111101001111110011111100111111001111110011111100111111111001111100111000111111001111110111101101011110 3f3f3f3f3f3fe7ce3f3f7d3f3f3f3f3f3fe7ce3f3f7b5e
EUC-JP 倻?????醫??}倻?????醫??{^ 100011111011000111110110001111110011111100111111001111110011111111101110110100000011111100111111011111011000111110110001111101100011111100111111001111110011111100111111111011101101000000111111001111110111101101011110 8fb1f63f3f3f3f3feed03f3f7d8fb1f63f3f3f3f3feed03f3f7b5e
UTF-8 倻귣떵行싷쭓醫덇국}倻귣떵行싷쭓醫덇국{^ 111001011000000010111011111010101011011110100011111010111001011010110101111011111010100010001000111011001000101110110111111011001010110110010011111010011000011010101011111010111000110110000111111010101011010110101101011111011110010110000000101110111110101010110111101000111110101110010110101101011110111110101000100010001110110010001011101101111110110010101101100100111110100110000110101010111110101110001101100001111110101010110101101011010111101101011110 e580bbeab7a3eb96b5efa888ec8bb7ecad93e986abeb8d87eab5ad7de580bbeab7a3eb96b5efa888ec8bb7ecad93e986abeb8d87eab5ad7b5e
UHC 倻귣떵行싷쭓醫덇국}倻귣떵行싷쭓醫덇국{^ 111001011010011010000010111010111011011010111010111110101010000110011010111011111010011110001011111011001010001010001000111010101011000110111001011111011110010110100110100000101110101110110110101110101111101010100001100110101110111110100111100010111110110010100010100010001110101010110001101110010111101101011110 e5a682ebb6bafaa19aefa78beca288eab1b97de5a682ebb6bafaa19aefa78beca288eab1b97b5e

SJIS-Win,EUC-JP: Classic charsets mainly used as Japanese encoding set on Windows(SJIS-Win=CP932) and UNIX(EUC-JP)