To what bitstring a character(s) is encoded in each character set?

Input one character or short letters and click "Convert."


(UTF-8)
Charset Character Bit string (binary) Bit String (hexadecimal)
ISO-8859-1 ????????????????B 0011111100111111001111110011111100111111001111110011111100111111001111110011111100111111001111110011111100111111001111110011111101000010 3f3f3f3f3f3f3f3f3f3f3f3f3f3f3f3f42
SJIS-WIN ???節???????節????B 00111111001111110011111110010000110111110011111100111111001111110011111100111111001111110011111110010000110111110011111100111111001111110011111101000010 3f3f3f90df3f3f3f3f3f3f3f90df3f3f3f3f42
EUC-JP ???節???????節????B 00111111001111110011111111000000111000010011111100111111001111110011111100111111001111110011111111000000111000010011111100111111001111110011111101000010 3f3f3fc0e13f3f3f3f3f3f3fc0e13f3f3f3f42
UTF-8 女앭랫節뗦틦溜쥃女앭랫節뗦틦溜쥃B 11101111101001101000000111101100100101011010110111101011100111101010101111100111101011111000000011101011100101111010011011101101100010111010011011101111101001111000101111101100101001011000001111101111101001101000000111101100100101011010110111101011100111101010101111100111101011111000000011101011100101111010011011101101100010111010011011101111101001111000101111101100101001011000001101000010 efa681ec95adeb9eabe7af80eb97a6ed8ba6efa78beca583efa681ec95adeb9eabe7af80eb97a6ed8ba6efa78beca58342
UHC 女앭랫節뗦틦溜쥃女앭랫節뗦틦溜쥃B 111001011111110010011101111001011011011110100111111011111011110110001011111001101011101010010000111010101111111010100010011101101110010111111100100111011110010110110111101001111110111110111101100010111110011010111010100100001110101011111110101000100111011001000010 e5fc9de5b7a7efbd8be6ba90eafea276e5fc9de5b7a7efbd8be6ba90eafea27642

SJIS-Win,EUC-JP: Classic charsets mainly used as Japanese encoding set on Windows(SJIS-Win=CP932) and UNIX(EUC-JP)