To what bitstring a character(s) is encoded in each character set?

Input one character or short letters and click "Convert."


(UTF-8)
Charset Character Bit string (binary) Bit String (hexadecimal)
ISO-8859-1 ??????????????????????B 0011111100111111001111110011111100111111001111110011111100111111001111110011111100111111001111110011111100111111001111110011111100111111001111110011111100111111001111110011111101000010 3f3f3f3f3f3f3f3f3f3f3f3f3f3f3f3f3f3f3f3f3f3f42
SJIS-WIN 形?紆?調絲??妖??賊?紆?調絲??妖??B 100011000110000000111111111000101111110000111111100100101011001011100011010011100011111100111111100101110110010000111111001111111001000110101111001111111110001011111100001111111001001010110010111000110100111000111111001111111001011101100100001111110011111101000010 8c603fe2fc3f92b2e34e3f3f97643f3f91af3fe2fc3f92b2e34e3f3f97643f3f42
EUC-JP 形?紆?調絲??妖??賊?紆?調絲??妖??B 101101111100000100111111111001001111111000111111110001001011010011100101101011110011111100111111110011011100010100111111001111111100001010110001001111111110010011111110001111111100010010110100111001011010111100111111001111111100110111000101001111110011111101000010 b7c13fe4fe3fc4b4e5af3f3fcdc53f3fc2b13fe4fe3fc4b4e5af3f3fcdc53f3f42
UTF-8 形렠紆렣調絲렟렩妖쾅긺賊렠紆렣調絲렟렩妖쾅긺B 11100101101111011010001011101011101000001010000011100111101101001000011011101011101000001010001111101000101010101011111111100111101101011011001011101011101000001001111111101011101000001010100111100101101001101001011011101100101111101000010111101010101110001011101011101000101100111000101011101011101000001010000011100111101101001000011011101011101000001010001111101000101010101011111111100111101101011011001011101011101000001001111111101011101000001010100111100101101001101001011011101100101111101000010111101010101110001011101001000010 e5bda2eba0a0e7b486eba0a3e8aabfe7b5b2eba09feba0a9e5a696ecbe85eab8bae8b38aeba0a0e7b486eba0a3e8aabfe7b5b2eba09feba0a9e5a696ecbe85eab8ba42
UHC 形렠紆렣調絲렟렩妖쾅긺賊렠紆렣調絲렟렩妖쾅긺B 111110111010000110001110101100011110100111100001100011101011010011110000111000001101111011101010100011101011000010001110101101111110100011101101110001001110011110110001111001111110111011100100100011101011000111101001111000011000111010110100111100001110000011011110111010101000111010110000100011101011011111101000111011011100010011100111101100011110011101000010 fba18eb1e9e18eb4f0e0deea8eb08eb7e8edc4e7b1e7eee48eb1e9e18eb4f0e0deea8eb08eb7e8edc4e7b1e742

SJIS-Win,EUC-JP: Classic charsets mainly used as Japanese encoding set on Windows(SJIS-Win=CP932) and UNIX(EUC-JP)