To what bitstring a character(s) is encoded in each character set?

Input one character or short letters and click "Convert."


(UTF-8)
Charset Character Bit string (binary) Bit String (hexadecimal)
ISO-8859-1 ???W^???\}v???W^???\}vB 0011111100111111001111110101011101011110001111110011111100111111010111000111110101110110001111110011111100111111010101110101111000111111001111110011111101011100011111010111011001000010 3f3f3f575e3f3f3f5c7d763f3f3f575e3f3f3f5c7d7642
SJIS-WIN 烏?┷W^烏?┷\}v烏?┷W^烏?┷\}vB 10001001010001110011111110000100101110000101011101011110100010010100011100111111100001001011100001011100011111010111011010001001010001110011111110000100101110000101011101011110100010010100011100111111100001001011100001011100011111010111011001000010 89473f84b8575e89473f84b85c7d7689473f84b8575e89473f84b85c7d7642
EUC-JP 烏?┷W^烏?┷\}v烏?┷W^烏?┷\}vB 10110001101010000011111110101000101110100101011101011110101100011010100000111111101010001011101001011100011111010111011010110001101010000011111110101000101110100101011101011110101100011010100000111111101010001011101001011100011111010111011001000010 b1a83fa8ba575eb1a83fa8ba5c7d76b1a83fa8ba575eb1a83fa8ba5c7d7642
UTF-8 烏랃┷W^烏랃┷\}v烏랃┷W^烏랃┷\}vB 1110011110000011100011111110101110011110100000111110001010010100101101110101011101011110111001111000001110001111111010111001111010000011111000101001010010110111010111000111110101110110111001111000001110001111111010111001111010000011111000101001010010110111010101110101111011100111100000111000111111101011100111101000001111100010100101001011011101011100011111010111011001000010 e7838feb9e83e294b7575ee7838feb9e83e294b75c7d76e7838feb9e83e294b7575ee7838feb9e83e294b75c7d7642
UHC 烏랃┷W^烏랃┷\}v烏랃┷W^烏랃┷\}vB 1110100010100001100011011110111110100110101110100101011101011110111010001010000110001101111011111010011010111010010111000111110101110110111010001010000110001101111011111010011010111010010101110101111011101000101000011000110111101111101001101011101001011100011111010111011001000010 e8a18defa6ba575ee8a18defa6ba5c7d76e8a18defa6ba575ee8a18defa6ba5c7d7642

SJIS-Win,EUC-JP: Classic charsets mainly used as Japanese encoding set on Windows(SJIS-Win=CP932) and UNIX(EUC-JP)