To what bitstring a character(s) is encoded in each character set?

Input one character or short letters and click "Convert."


(UTF-8)
Charset Character Bit string (binary) Bit String (hexadecimal)
ISO-8859-1 ??????????????????? 00111111001111110011111100111111001111110011111100111111001111110011111100111111001111110011111100111111001111110011111100111111001111110011111100111111 3f3f3f3f3f3f3f3f3f3f3f3f3f3f3f3f3f3f3f
SJIS-WIN 躁???種?義???穗鏃??盒?泌頑策 11100111010011100011111100111111001111111000111011101101001111111000101101100000001111110011111100111111111000100110111011101000010101100011111100111111111000011011010000111111100101001110010110001010111001101000110111110100 e74e3f3f3f8eed3f8b603f3f3fe26ee8563f3fe1b43f94e58ae68df4
EUC-JP 躁???種?義???穗鏃??盒?泌頑策 11101101101011110011111100111111001111111011110011101111001111111011010111000001001111110011111100111111111000111100111111101111101101110011111100111111111000101011011000111111110010001110011110110100111010001011101011110110 edaf3f3f3fbcef3fb5c13f3f3fe3cfefb73f3fe2b63fc8e7b4e8baf6
UTF-8 躁댓렰렎種렟義꿰뤅尿穗鏃퐥ㆁ盒곌泌頑策 111010001011101010000001111010111000110010010011111010111010000010110000111010111010000010001110111001111010100010101110111010111010000010011111111001111011111010101001111010101011111110110000111010111010010010000101111011111010011010111101111001111010100110010111111010011000111110000011111011011001000010100101111000111000011010000001111001111001101110010010111010101011001110001100111001101011001110001100111010011010000010010001111001111010110110010110 e8ba81eb8c93eba0b0eba08ee7a8aeeba09fe7bea9eabfb0eba485efa6bde7a997e98f83ed90a5e38681e79b92eab38ce6b38ce9a091e7ad96
UHC 躁댓렰렎種렟義꿰뤅尿穗鏃퐥ㆁ盒곌泌頑策 1111000011100010101101001111000110001110101111011000111010100100111100001111101010001110101100001110101111111001101100101110011110001111101101011110100011110001111000101011010011110000111011001011110110001110101001001111000111111001111011001011000011101010111110011011001011101000110101111111001111111110 f0e2b4f18ebd8ea4f0fa8eb0ebf9b2e78fb5e8f1e2b4f0ecbd8ea4f1f9ecb0eaf9b2e8d7f3fe

SJIS-Win,EUC-JP: Classic charsets mainly used as Japanese encoding set on Windows(SJIS-Win=CP932) and UNIX(EUC-JP)