To what bitstring a character(s) is encoded in each character set?

Input one character or short letters and click "Convert."


(UTF-8)
Charset Character Bit string (binary) Bit String (hexadecimal)
ISO-8859-1 ?????R?????^[?????R?????^[^ 001111110011111100111111001111110011111101010010001111110011111100111111001111110011111101011110010110110011111100111111001111110011111100111111010100100011111100111111001111110011111100111111010111100101101101011110 3f3f3f3f3f523f3f3f3f3f5e5b3f3f3f3f3f523f3f3f3f3f5e5b5e
SJIS-WIN 曄??幽?R曄??幽?^[曄??幽?R曄??幽?^[^ 1001111001000000001111110011111110010111010010000011111101010010100111100100000000111111001111111001011101001000001111110101111001011011100111100100000000111111001111111001011101001000001111110101001010011110010000000011111100111111100101110100100000111111010111100101101101011110 9e403f3f97483f529e403f3f97483f5e5b9e403f3f97483f529e403f3f97483f5e5b5e
EUC-JP 曄??幽?R曄??幽?^[曄??幽?R曄??幽?^[^ 1101101110100001001111110011111111001101101010010011111101010010110110111010000100111111001111111100110110101001001111110101111001011011110110111010000100111111001111111100110110101001001111110101001011011011101000010011111100111111110011011010100100111111010111100101101101011110 dba13f3fcda93f52dba13f3fcda93f5e5bdba13f3fcda93f52dba13f3fcda93f5e5b5e
UTF-8 曄욌빋幽톘R曄욌빋幽톘^[曄욌빋幽톘R曄욌빋幽톘^[^ 11100110100110111000010011101100100110101000110011101011101110011000101111100101101110011011110111101101100001101001100001010010111001101001101110000100111011001001101010001100111010111011100110001011111001011011100110111101111011011000011010011000010111100101101111100110100110111000010011101100100110101000110011101011101110011000101111100101101110011011110111101101100001101001100001010010111001101001101110000100111011001001101010001100111010111011100110001011111001011011100110111101111011011000011010011000010111100101101101011110 e69b84ec9a8cebb98be5b9bded869852e69b84ec9a8cebb98be5b9bded86985e5be69b84ec9a8cebb98be5b9bded869852e69b84ec9a8cebb98be5b9bded86985e5b5e
UHC 曄욌빋幽톘R曄욌빋幽톘^[曄욌빋幽톘R曄욌빋幽톘^[^ 1110011110100101100111101110101110010101101100011110101011101011101101110110111001010010111001111010010110011110111010111001010110110001111010101110101110110111011011100101111001011011111001111010010110011110111010111001010110110001111010101110101110110111011011100101001011100111101001011001111011101011100101011011000111101010111010111011011101101110010111100101101101011110 e7a59eeb95b1eaebb76e52e7a59eeb95b1eaebb76e5e5be7a59eeb95b1eaebb76e52e7a59eeb95b1eaebb76e5e5b5e

SJIS-Win,EUC-JP: Classic charsets mainly used as Japanese encoding set on Windows(SJIS-Win=CP932) and UNIX(EUC-JP)