To what bitstring a character(s) is encoded in each character set?

Input one character or short letters and click "Convert."


(UTF-8)
Charset Character Bit string (binary) Bit String (hexadecimal)
ISO-8859-1 ??????????????????? 00111111001111110011111100111111001111110011111100111111001111110011111100111111001111110011111100111111001111110011111100111111001111110011111100111111 3f3f3f3f3f3f3f3f3f3f3f3f3f3f3f3f3f3f3f
SJIS-WIN 癲?8倚??儒??娃??日??臾??繹 111000011001111100111111100000100101011110011000110111110011111100111111100011101111001000111111001111111000100010100001001111110011111110010011111110100011111100111111111001000110101100111111001111111110001110001000 e19f3f825798df3f3f8ef23f3f88a13f3f93fa3f3fe46b3f3fe388
EUC-JP 癲?8倚??儒??娃??日??臾??繹 111000101010000100111111101000111011100011010000111000010011111100111111101111001111010000111111001111111011000010100011001111110011111111000110111111000011111100111111111001111100110000111111001111111110010111101000 e2a13fa3b8d0e13f3fbcf43f3fb0a33f3fc6fc3f3fe7cc3f3fe5e8
UTF-8 癲쒕8倚싨만儒묓뫛娃븐뼲日뗩뼮臾롪뻗繹 111001111001100110110010111011001001001010010101111011111011110010011000111001011000000010011010111011001000101110101000111010111010011110001100111001011000010010010010111010111010110010010011111010111010101110011011111001011010100010000011111010111011100010010000111010111011110010110010111001101001011110100101111010111001011110101001111010111011110010101110111010001000011110111110111010111010000110101010111010111011101110010111111001111011100110111001 e799b2ec9295efbc98e5809aec8ba8eba78ce58492ebac93ebab9be5a883ebb890ebbcb2e697a5eb97a9ebbcaee887beeba1aaebbb97e7b9b9
UHC 癲쒕8倚싨만儒묓뫛娃븐뼲日뗩뼮臾롪뻗繹 1110111110100110100111001110101110100011101110001110101111101111100110101110011010111000101110001110101011100011100100011110110110010001101110111110100011011111101110101110110010010110101101011110110011101101100010111110100110010110101100011110101110101100100011101110101010111011101110001110011010111010 efa69ceba3b8ebef9ae6b8b8eae391ed91bbe8dfbaec96b5eced8be996b1ebac8eeabbb8e6ba

SJIS-Win,EUC-JP: Classic charsets mainly used as Japanese encoding set on Windows(SJIS-Win=CP932) and UNIX(EUC-JP)