To what bitstring a character(s) is encoded in each character set?

Input one character or short letters and click "Convert."


(UTF-8)
Charset Character Bit string (binary) Bit String (hexadecimal)
ISO-8859-1 ?????????}?????????{^ 001111110011111100111111001111110011111100111111001111110011111100111111011111010011111100111111001111110011111100111111001111110011111100111111001111110111101101011110 3f3f3f3f3f3f3f3f3f7d3f3f3f3f3f3f3f3f3f7b5e
SJIS-WIN ??ュ?泥????}??ュ?泥????{^ 00111111001111111000001110000101001111111001001101000100001111110011111100111111001111110111110100111111001111111000001110000101001111111001001101000100001111110011111100111111001111110111101101011110 3f3f83853f93443f3f3f3f7d3f3f83853f93443f3f3f3f7b5e
EUC-JP ??ュ?泥????}??ュ?泥????{^ 00111111001111111010010111100101001111111100010110100101001111110011111100111111001111110111110100111111001111111010010111100101001111111100010110100101001111110011111100111111001111110111101101011110 3f3fa5e53fc5a53f3f3f3f7d3f3fa5e53fc5a53f3f3f3f7b5e
UTF-8 룶혧ュ룵泥풉룵츕⒟}룶혧ュ룵泥풉룵츕⒟{^ 111010111010001110110110111011011001100010100111111000111000001110100101111010111010001110110101111001101011001110100101111011011001001010001001111010111010001110110101111011001011100010010101111000101001001010011111011111011110101110100011101101101110110110011000101001111110001110000011101001011110101110100011101101011110011010110011101001011110110110010010100010011110101110100011101101011110110010111000100101011110001010010010100111110111101101011110 eba3b6ed98a7e383a5eba3b5e6b3a5ed9289eba3b5ecb895e2929f7deba3b6ed98a7e383a5eba3b5e6b3a5ed9289eba3b5ecb895e2929f7b5e
UHC 룶혧ュ룵泥풉룵츕⒟}룶혧ュ룵泥풉룵츕⒟{^ 100011111010101111000010100011111010101111100101100011111010101011010010111110101100011110110001100011111010101010101110100011111010100111010000011111011000111110101011110000101000111110101011111001011000111110101010110100101111101011000111101100011000111110101010101011101000111110101001110100000111101101011110 8fabc28fabe58faad2fac7b18faaae8fa9d07d8fabc28fabe58faad2fac7b18faaae8fa9d07b5e

SJIS-Win,EUC-JP: Classic charsets mainly used as Japanese encoding set on Windows(SJIS-Win=CP932) and UNIX(EUC-JP)