To what bitstring a character(s) is encoded in each character set?

Input one character or short letters and click "Convert."


(UTF-8)
Charset Character Bit string (binary) Bit String (hexadecimal)
ISO-8859-1 ?????????????????????? 00111111001111110011111100111111001111110011111100111111001111110011111100111111001111110011111100111111001111110011111100111111001111110011111100111111001111110011111100111111 3f3f3f3f3f3f3f3f3f3f3f3f3f3f3f3f3f3f3f3f3f3f
SJIS-WIN ??????鶯??譽??稔????????? 00111111001111110011111100111111001111110011111111101001111100100011111100111111111001101010001100111111001111111001011010101011001111110011111100111111001111110011111100111111001111110011111100111111 3f3f3f3f3f3fe9f23f3fe6a33f3f96ab3f3f3f3f3f3f3f3f3f
EUC-JP ??????鶯??譽??稔????????? 00111111001111110011111100111111001111110011111111110010111101000011111100111111111011001010010100111111001111111100110010101101001111110011111100111111001111110011111100111111001111110011111100111111 3f3f3f3f3f3ff2f43f3feca53f3fccad3f3f3f3f3f3f3f3f3f
UTF-8 溜삵썑溜뤿졎鶯쇻뀞譽싦슴稔꾨젎溜삣짍栒뺠꽊蓼 111011111010011110001011111011001000001010110101111011001000110110010001111011111010011110001011111010111010010010111111111011001010000110001110111010011011011010101111111011001000011110111011111010111000000010011110111010001010110110111101111011001000101110100110111011001000101010110100111001111010100010010100111010101011111010101000111011001010000010001110111011111010011110001011111011001000001010100011111011001010011110001101111001101010000010010010111010111011101010100000111010101011110110001010111011111010011110000010 efa78bec82b5ec8d91efa78beba4bfeca18ee9b6afec87bbeb809ee8adbdec8ba6ec8ab4e7a894eabea8eca08eefa78bec82a3eca78de6a092ebbaa0eabd8aefa782
UHC 溜삵썑溜뤿졎鶯쇻뀞譽싦슴稔꾨젎溜삣짍栒뺠꽊蓼 1110101011111110101110111110110110011011100001001110101011111110100011111110101110100000101110111110010110100011100110011110001110000101100101011110011111100010100110101110010010111101101111111110110011111001100001001110101110100000100011111110101011111110101110111110010110100011100110011110001011100011100101011110100010000100100110101110100110100111 eafebbed9b84eafe8feba0bbe5a399e38595e7e29ae4bdbfecf984eba08feafebbe5a399e2e395e8849ae9a7

SJIS-Win,EUC-JP: Classic charsets mainly used as Japanese encoding set on Windows(SJIS-Win=CP932) and UNIX(EUC-JP)