To what bitstring a character(s) is encoded in each character set?

Input one character or short letters and click "Convert."


(UTF-8)
Charset Character Bit string (binary) Bit String (hexadecimal)
ISO-8859-1 ?????R?????^[?????R?????^[^ 001111110011111100111111001111110011111101010010001111110011111100111111001111110011111101011110010110110011111100111111001111110011111100111111010100100011111100111111001111110011111100111111010111100101101101011110 3f3f3f3f3f523f3f3f3f3f5e5b3f3f3f3f3f523f3f3f3f3f5e5b5e
SJIS-WIN 午??臆?R午??臆?^[午??臆?R午??臆?^[^ 1000110011011111001111110011111110001001101100000011111101010010100011001101111100111111001111111000100110110000001111110101111001011011100011001101111100111111001111111000100110110000001111110101001010001100110111110011111100111111100010011011000000111111010111100101101101011110 8cdf3f3f89b03f528cdf3f3f89b03f5e5b8cdf3f3f89b03f528cdf3f3f89b03f5e5b5e
EUC-JP 午??臆?R午??臆?^[午??臆?R午??臆?^[^ 1011100011100001001111110011111110110010101100100011111101010010101110001110000100111111001111111011001010110010001111110101111001011011101110001110000100111111001111111011001010110010001111110101001010111000111000010011111100111111101100101011001000111111010111100101101101011110 b8e13f3fb2b23f52b8e13f3fb2b23f5e5bb8e13f3fb2b23f52b8e13f3fb2b23f5e5b5e
UTF-8 午닻넀臆럑R午닻넀臆럑^[午닻넀臆럑R午닻넀臆럑^[^ 11100101100011011000100011101011100010111011101111101011100001001000000011101000100001111000011011101011100111111001000101010010111001011000110110001000111010111000101110111011111010111000010010000000111010001000011110000110111010111001111110010001010111100101101111100101100011011000100011101011100010111011101111101011100001001000000011101000100001111000011011101011100111111001000101010010111001011000110110001000111010111000101110111011111010111000010010000000111010001000011110000110111010111001111110010001010111100101101101011110 e58d88eb8bbbeb8480e88786eb9f9152e58d88eb8bbbeb8480e88786eb9f915e5be58d88eb8bbbeb8480e88786eb9f9152e58d88eb8bbbeb8480e88786eb9f915e5b5e
UHC 午닻넀臆럑R午닻넀臆럑^[午닻넀臆럑R午닻넀臆럑^[^ 1110011111101101101101001110100110000110100100001110010111100110100011100110111001010010111001111110110110110100111010011000011010010000111001011110011010001110011011100101111001011011111001111110110110110100111010011000011010010000111001011110011010001110011011100101001011100111111011011011010011101001100001101001000011100101111001101000111001101110010111100101101101011110 e7edb4e98690e5e68e6e52e7edb4e98690e5e68e6e5e5be7edb4e98690e5e68e6e52e7edb4e98690e5e68e6e5e5b5e

SJIS-Win,EUC-JP: Classic charsets mainly used as Japanese encoding set on Windows(SJIS-Win=CP932) and UNIX(EUC-JP)