To what bitstring a character(s) is encoded in each character set?

Input one character or short letters and click "Convert."


(UTF-8)
Charset Character Bit string (binary) Bit String (hexadecimal)
ISO-8859-1 ????????????????????? 001111110011111100111111001111110011111100111111001111110011111100111111001111110011111100111111001111110011111100111111001111110011111100111111001111110011111100111111 3f3f3f3f3f3f3f3f3f3f3f3f3f3f3f3f3f3f3f3f3f
SJIS-WIN 畑??乳??濡λ?榮??油??筌??萸?? 1001010010101000001111110011111110010011111110110011111100111111100101000100011110000011110010010011111110011110110001000011111100111111100101101111101100111111001111111110001010100011001111110011111111100100110011100011111100111111 94a83f3f93fb3f3f944783c93f9ec43f3f96fb3f3fe2a33f3fe4ce3f3f
EUC-JP 畑??乳??濡λ?榮??油??筌??萸?? 1100100010101010001111110011111111000110111111010011111100111111110001111010100010100110110010110011111111011100110001100011111100111111110011001111110100111111001111111110010010100101001111110011111111101000110100000011111100111111 c8aa3f3fc6fd3f3fc7a8a6cb3fdcc63f3fccfd3f3fe4a53f3fe8d03f3f
UTF-8 畑밴퉭乳들렟濡λ윹榮싩빊油삳껜筌믩끉萸욜뇖 1110011110010101100100011110101110110000101101001110110110001001101011011110010010111001101100111110101110010011101001001110101110100000100111111110011010111111101000011100111010111011111011001001110010111001111001101010011010101110111011001000101110101001111010111011100110001010111001101011001010111001111011001000001010110011111010101011101110011100111001111010110110001100111010111010111110101001111010111000000110001001111010001001000010111000111011001001101010011100111010111000011110010110 e79591ebb0b4ed89ade4b9b3eb93a4eba09fe6bfa1cebbec9cb9e6a6aeec8ba9ebb98ae6b2b9ec82b3eabb9ce7ad8cebafa9eb8189e890b8ec9a9ceb8796
UHC 畑밴퉭乳들렟濡λ윹榮싩빊油삳껜筌믩끉萸욜뇖 111011111010010110111001111010101011100110000101111010101110000110110101111010011000111010110000111010111010000110100101111010111001111110110011111001111011010010011010111001111001010110110000111010101111101010111011111010111011001010110100111011111010011110010010111010111000010110111100111010111010110110111111111001111000011110000001 efa5b9eab985eae1b5e98eb0eba1a5eb9fb3e7b49ae795b0eafabbebb2b4efa792eb85bcebadbfe78781

SJIS-Win,EUC-JP: Classic charsets mainly used as Japanese encoding set on Windows(SJIS-Win=CP932) and UNIX(EUC-JP)