To what bitstring a character(s) is encoded in each character set?

Input one character or short letters and click "Convert."


(UTF-8)
Charset Character Bit string (binary) Bit String (hexadecimal)
ISO-8859-1 ??????????????????B 00111111001111110011111100111111001111110011111100111111001111110011111100111111001111110011111100111111001111110011111100111111001111110011111101000010 3f3f3f3f3f3f3f3f3f3f3f3f3f3f3f3f3f3f42
SJIS-WIN 脱揃村奪造足巽則造脱揃村奪造足巽則造B 10010010010001011001000110110101100100011011101010010010010001001001000110100010100100011010101110010010010001101001000110100101100100011010001010010010010001011001000110110101100100011011101010010010010001001001000110100010100100011010101110010010010001101001000110100101100100011010001001000010 924591b591ba924491a291ab924691a591a2924591b591ba924491a291ab924691a591a242
EUC-JP 脱揃村奪造足巽則造脱揃村奪造足巽則造B 11000011101001101100001010110111110000101011110011000011101001011100001010100100110000101010110111000011101001111100001010100111110000101010010011000011101001101100001010110111110000101011110011000011101001011100001010100100110000101010110111000011101001111100001010100111110000101010010001000010 c3a6c2b7c2bcc3a5c2a4c2adc3a7c2a7c2a4c3a6c2b7c2bcc3a5c2a4c2adc3a7c2a7c2a442
UTF-8 脱揃村奪造足巽則造脱揃村奪造足巽則造B 11101000100001001011000111100110100011111000001111100110100111011001000111100101101001011010101011101001100000001010000011101000101101101011001111100101101101111011110111100101100010011000011111101001100000001010000011101000100001001011000111100110100011111000001111100110100111011001000111100101101001011010101011101001100000001010000011101000101101101011001111100101101101111011110111100101100010011000011111101001100000001010000001000010 e884b1e68f83e69d91e5a5aae980a0e8b6b3e5b7bde58987e980a0e884b1e68f83e69d91e5a5aae980a0e8b6b3e5b7bde58987e980a042
UHC ??村奪造足巽則造??村奪造足巽則造B 001111110011111111110101101111011111011110101100111100001110001111110000111010111110000111011110111101101100111011110000111000110011111100111111111101011011110111110111101011001111000011100011111100001110101111100001110111101111011011001110111100001110001101000010 3f3ff5bdf7acf0e3f0ebe1def6cef0e33f3ff5bdf7acf0e3f0ebe1def6cef0e342

SJIS-Win,EUC-JP: Classic charsets mainly used as Japanese encoding set on Windows(SJIS-Win=CP932) and UNIX(EUC-JP)