To what bitstring a character(s) is encoded in each character set?

Input one character or short letters and click "Convert."


(UTF-8)
Charset Character Bit string (binary) Bit String (hexadecimal)
ISO-8859-1 ????????????????????? 001111110011111100111111001111110011111100111111001111110011111100111111001111110011111100111111001111110011111100111111001111110011111100111111001111110011111100111111 3f3f3f3f3f3f3f3f3f3f3f3f3f3f3f3f3f3f3f3f3f
SJIS-WIN ???韋??矣??域??醫??域??魏?? 001111110011111100111111111010001110100000111111001111111110000111100001001111110011111110001000111001100011111100111111111001111100111000111111001111111000100011100110001111110011111111101001101100000011111100111111 3f3f3fe8e83f3fe1e13f3f88e63f3fe7ce3f3f88e63f3fe9b03f3f
EUC-JP ???韋??矣??域??醫??域??魏?? 001111110011111100111111111100001110101000111111001111111110001011100011001111110011111110110000111010000011111100111111111011101101000000111111001111111011000011101000001111110011111111110010101100100011111100111111 3f3f3ff0ea3f3fe2e33f3fb0e83f3feed03f3fb0e83f3ff2b23f3f
UTF-8 僚녹뼔韋귛푻矣낅븶域뱄섟醫귣븶域㏃뼚魏쒒뒽 111011111010011010111011111010111000010110111001111010111011110010010100111010011001111110001011111010101011011110011011111011011001000110111011111001111001111110100011111010111000001010000101111010111011100010110110111001011001111110011111111010111011000110000100111011001000010010011111111010011000011010101011111010101011011110100011111010111011100010110110111001011001111110011111111000111000111110000011111010111011110010011010111010011010110110001111111011001001001010010010111010111001001010111101 efa6bbeb85b9ebbc94e99f8beab79bed91bbe79fa3eb8285ebb8b6e59f9febb184ec849fe986abeab7a3ebb8b6e59f9fe38f83ebbc9ae9ad8fec9292eb92bd
UHC 僚녹뼔韋귛푻矣낅븶域뱄섟醫귣븶域㏃뼚魏쒒뒽 111010001110100010110011111011001001011010011100111010101101111110000010111001011011111010000111111010111111100010000101111010111001010110011111111001101011010010111001111011111011110010110000111011001010001010000010111010111001010110011111111001101011010010100111111011001001011010100000111010101110000010011100111010011000101010110011 e8e8b3ec969ceadf82e5be87ebf885eb959fe6b4b9efbcb0eca282eb959fe6b4a7ec96a0eae09ce98ab3

SJIS-Win,EUC-JP: Classic charsets mainly used as Japanese encoding set on Windows(SJIS-Win=CP932) and UNIX(EUC-JP)