To what bitstring a character(s) is encoded in each character set?

Input one character or short letters and click "Convert."


(UTF-8)
Charset Character Bit string (binary) Bit String (hexadecimal)
ISO-8859-1 ?????????^ 00111111001111110011111100111111001111110011111100111111001111110011111101011110 3f3f3f3f3f3f3f3f3f5e
SJIS-WIN ???陰??淫??^ 001111110011111100111111100010010100000100111111001111111000100011111010001111110011111101011110 3f3f3f89413f3f88fa3f3f5e
EUC-JP 渶??陰??淫??^ 1000111111000111111011010011111100111111101100011010001000111111001111111011000011111100001111110011111101011110 8fc7ed3f3fb1a23f3fb0fc3f3f5e
UTF-8 渶⑹꽍陰곁랭淫앹넩^ 11100110101110001011011011100010100100011011100111101010101111011000110111101001100110011011000011101010101100111000000111101011100111101010110111100110101101111010101111101100100101011011100111101011100001001010100101011110 e6b8b6e291b9eabd8de999b0eab381eb9eade6b7abec95b9eb84a95e
UHC 渶⑹꽍陰곁랭淫앹넩^ 11100111101101111010100111101100100001001001110111101011111001001011000011100111101101111010100111101011111000101001110111101100100001101010100101011110 e7b7a9ec849debe4b0e7b7a9ebe29dec86a95e

SJIS-Win,EUC-JP: Classic charsets mainly used as Japanese encoding set on Windows(SJIS-Win=CP932) and UNIX(EUC-JP)