To what bitstring a character(s) is encoded in each character set?

Input one character or short letters and click "Convert."


(UTF-8)
Charset Character Bit string (binary) Bit String (hexadecimal)
ISO-8859-1 ????????????????B 0011111100111111001111110011111100111111001111110011111100111111001111110011111100111111001111110011111100111111001111110011111101000010 3f3f3f3f3f3f3f3f3f3f3f3f3f3f3f3f42
SJIS-WIN 艶?押??弱??艶?押?????B 10001001100100000011111110001001100111110011111100111111100011101110001100111111001111111000100110010000001111111000100110011111001111110011111100111111001111110011111101000010 89903f899f3f3f8ee33f3f89903f899f3f3f3f3f3f42
EUC-JP 艶?押??弱??艶?押?????B 10110001111100000011111110110010101000010011111100111111101111001110010100111111001111111011000111110000001111111011001010100001001111110011111100111111001111110011111101000010 b1f03fb2a13f3fbce53f3fb1f03fb2a13f3f3f3f3f42
UTF-8 艶쵩押띈씩弱듿컟艶쵩押띈씩呂묋쥤B 11101000100010011011011011101100101101011010100111100110100010101011110011101011100111011000100011101100100101001010100111100101101111001011000111101011100100111011111111101100101110111001111111101000100010011011011011101100101101011010100111100110100010101011110011101011100111011000100011101100100101001010100111101111101001101000000011101011101011001000101111101100101001011010010001000010 e889b6ecb5a9e68abceb9d88ec94a9e5bcb1eb93bfecbb9fe889b6ecb5a9e68abceb9d88ec94a9efa680ebac8beca5a442
UHC 艶쵩押띈씩弱듿컟艶쵩押띈씩呂묋쥤B 111001101111110110101101010010001110010011100011101101101110100010111110101111111110010110110000100010101110010110110000100010101110011011111101101011010100100011100100111000111011011011101000101111101011111111100101111110111001000111101000101000101001011001000010 e6fdad48e4e3b6e8bebfe5b08ae5b08ae6fdad48e4e3b6e8bebfe5fb91e8a29642

SJIS-Win,EUC-JP: Classic charsets mainly used as Japanese encoding set on Windows(SJIS-Win=CP932) and UNIX(EUC-JP)