To what bitstring a character(s) is encoded in each character set?

Input one character or short letters and click "Convert."


(UTF-8)
Charset Character Bit string (binary) Bit String (hexadecimal)
ISO-8859-1 ????????????????????????^ 00111111001111110011111100111111001111110011111100111111001111110011111100111111001111110011111100111111001111110011111100111111001111110011111100111111001111110011111100111111001111110011111101011110 3f3f3f3f3f3f3f3f3f3f3f3f3f3f3f3f3f3f3f3f3f3f3f3f5e
SJIS-WIN 筌??誼←?碎???レ?筌??誼←?碎???レ?^ 1110001010100011001111110011111110001011011000101000000110101001001111111110000111101010001111110011111100111111100000111000110000111111111000101010001100111111001111111000101101100010100000011010100100111111111000011110101000111111001111110011111110000011100011000011111101011110 e2a33f3f8b6281a93fe1ea3f3f3f838c3fe2a33f3f8b6281a93fe1ea3f3f3f838c3f5e
EUC-JP 筌??誼←?碎???レ?筌??誼←?碎???レ?^ 1110010010100101001111110011111110110101110000111010001010101011001111111110001011101100001111110011111100111111101001011110110000111111111001001010010100111111001111111011010111000011101000101010101100111111111000101110110000111111001111110011111110100101111011000011111101011110 e4a53f3fb5c3a2ab3fe2ec3f3f3fa5ec3fe4a53f3fb5c3a2ab3fe2ec3f3f3fa5ec3f5e
UTF-8 筌뗪퉭誼←뇖碎밴틢曆レ㎗筌뗪퉭誼←뇖碎밴틢曆レ㎗^ 11100111101011011000110011101011100101111010101011101101100010011010110111101000101010101011110011100010100001101001000011101011100001111001011011100111101000101000111011101011101100001011010011101101100010111010001011101111101001101000101111100011100000111010110011100011100011101001011111100111101011011000110011101011100101111010101011101101100010011010110111101000101010101011110011100010100001101001000011101011100001111001011011100111101000101000111011101011101100001011010011101101100010111010001011101111101001101000101111100011100000111010110011100011100011101001011101011110 e7ad8ceb97aaed89ade8aabce28690eb8796e7a28eebb0b4ed8ba2efa68be383ace38e97e7ad8ceb97aaed89ade8aabce28690eb8796e7a28eebb0b4ed8ba2efa68be383ace38e975e
UHC 筌뗪퉭誼←뇖碎밴틢曆レ㎗筌뗪퉭誼←뇖碎밴틢曆レ㎗^ 11101111101001111000101111101010101110011000010111101011111111101010000111100111100001111000000111100001111011111011100111101010101110101000111011100110101101111010101111101100101001111010001111101111101001111000101111101010101110011000010111101011111111101010000111100111100001111000000111100001111011111011100111101010101110101000111011100110101101111010101111101100101001111010001101011110 efa78beab985ebfea1e78781e1efb9eaba8ee6b7abeca7a3efa78beab985ebfea1e78781e1efb9eaba8ee6b7abeca7a35e

SJIS-Win,EUC-JP: Classic charsets mainly used as Japanese encoding set on Windows(SJIS-Win=CP932) and UNIX(EUC-JP)