To what bitstring a character(s) is encoded in each character set?

Input one character or short letters and click "Convert."


(UTF-8)
Charset Character Bit string (binary) Bit String (hexadecimal)
ISO-8859-1 ????????^z????????^zB 001111110011111100111111001111110011111100111111001111110011111101011110011110100011111100111111001111110011111100111111001111110011111100111111010111100111101001000010 3f3f3f3f3f3f3f3f5e7a3f3f3f3f3f3f3f3f5e7a42
SJIS-WIN 旦属坦袖単端丹樽^z旦属坦袖単端丹樽^zB 10010010010101011001000110101110100100100101001010010001101100111001001001010000100100100101101110010010010011111001001001001101010111100111101010010010010101011001000110101110100100100101001010010001101100111001001001010000100100100101101110010010010011111001001001001101010111100111101001000010 925591ae925291b39250925b924f924d5e7a925591ae925291b39250925b924f924d5e7a42
EUC-JP 旦属坦袖単端丹樽^z旦属坦袖単端丹樽^zB 11000011101101101100001010110000110000111011001111000010101101011100001110110001110000111011110011000011101100001100001110101110010111100111101011000011101101101100001010110000110000111011001111000010101101011100001110110001110000111011110011000011101100001100001110101110010111100111101001000010 c3b6c2b0c3b3c2b5c3b1c3bcc3b0c3ae5e7ac3b6c2b0c3b3c2b5c3b1c3bcc3b0c3ae5e7a42
UTF-8 旦属坦袖単端丹樽^z旦属坦袖単端丹樽^zB 1110011010010111101001101110010110110001100111101110010110011101101001101110100010100010100101101110010110001101100110001110011110101011101011111110010010111000101110011110011010101000101111010101111001111010111001101001011110100110111001011011000110011110111001011001110110100110111010001010001010010110111001011000110110011000111001111010101110101111111001001011100010111001111001101010100010111101010111100111101001000010 e697a6e5b19ee59da6e8a296e58d98e7abafe4b8b9e6a8bd5e7ae697a6e5b19ee59da6e8a296e58d98e7abafe4b8b9e6a8bd5e7a42
UHC 旦?坦袖?端丹樽^z旦?坦袖?端丹樽^zB 110100111010100100111111111101111010010011100010110000000011111111010011101011101101001110100001111100011101110001011110011110101101001110101001001111111111011110100100111000101100000000111111110100111010111011010011101000011111000111011100010111100111101001000010 d3a93ff7a4e2c03fd3aed3a1f1dc5e7ad3a93ff7a4e2c03fd3aed3a1f1dc5e7a42

SJIS-Win,EUC-JP: Classic charsets mainly used as Japanese encoding set on Windows(SJIS-Win=CP932) and UNIX(EUC-JP)