To what bitstring a character(s) is encoded in each character set?

Input one character or short letters and click "Convert."


(UTF-8)
Charset Character Bit string (binary) Bit String (hexadecimal)
ISO-8859-1 ??????????????????????^ 0011111100111111001111110011111100111111001111110011111100111111001111110011111100111111001111110011111100111111001111110011111100111111001111110011111100111111001111110011111101011110 3f3f3f3f3f3f3f3f3f3f3f3f3f3f3f3f3f3f3f3f3f3f5e
SJIS-WIN ?????????雲??????????雲?^ 00111111001111110011111100111111001111110011111100111111001111110011111110001001010111110011111100111111001111110011111100111111001111110011111100111111001111110011111110001001010111110011111101011110 3f3f3f3f3f3f3f3f3f895f3f3f3f3f3f3f3f3f3f3f895f3f5e
EUC-JP ?????????雲鈒?????????雲鈒^ 0011111100111111001111110011111100111111001111110011111100111111001111111011000111000000100011111110001111000010001111110011111100111111001111110011111100111111001111110011111100111111101100011100000010001111111000111100001001011110 3f3f3f3f3f3f3f3f3fb1c08fe3c23f3f3f3f3f3f3f3f3fb1c08fe3c25e
UTF-8 쒔렢쑹롊쒀렺쒔렡쒀雲鈒쒔렢쑹롊쒀렺쒔렡쒀雲鈒^ 11101100100100101001010011101011101000001010001011101100100100011011100111101011101000011000101011101100100100101000000011101011101000001011101011101100100100101001010011101011101000001010000111101100100100101000000011101001100110111011001011101001100010001001001011101100100100101001010011101011101000001010001011101100100100011011100111101011101000011000101011101100100100101000000011101011101000001011101011101100100100101001010011101011101000001010000111101100100100101000000011101001100110111011001011101001100010001001001001011110 ec9294eba0a2ec91b9eba18aec9280eba0baec9294eba0a1ec9280e99bb2e98892ec9294eba0a2ec91b9eba18aec9280eba0baec9294eba0a1ec9280e99bb2e988925e
UHC 쒔렢쑹롊쒀렺쒔렡쒀雲鈒쒔렢쑹롊쒀렺쒔렡쒀雲鈒^ 101111101010110110001110101100111011111010101011100011101101000010111110101011001000111011000010101111101010110110001110101100101011111010101100111010101010001111011111101111001011111010101101100011101011001110111110101010111000111011010000101111101010110010001110110000101011111010101101100011101011001010111110101011001110101010100011110111111011110001011110 bead8eb3beab8ed0beac8ec2bead8eb2beaceaa3dfbcbead8eb3beab8ed0beac8ec2bead8eb2beaceaa3dfbc5e

SJIS-Win,EUC-JP: Classic charsets mainly used as Japanese encoding set on Windows(SJIS-Win=CP932) and UNIX(EUC-JP)