To what bitstring a character(s) is encoded in each character set?

Input one character or short letters and click "Convert."


(UTF-8)
Charset Character Bit string (binary) Bit String (hexadecimal)
ISO-8859-1 ???OW???OJn}???OW???OJn{^ 00111111001111110011111101001111010101110011111100111111001111110100111101001010011011100111110100111111001111110011111101001111010101110011111100111111001111110100111101001010011011100111101101011110 3f3f3f4f573f3f3f4f4a6e7d3f3f3f4f573f3f3f4f4a6e7b5e
SJIS-WIN 箋頑或OW箋頑或OJn}箋頑或OW箋頑或OJn{^ 11100010101100111000101011100110100010001011110101001111010101111110001010110011100010101110011010001000101111010100111101001010011011100111110111100010101100111000101011100110100010001011110101001111010101111110001010110011100010101110011010001000101111010100111101001010011011100111101101011110 e2b38ae688bd4f57e2b38ae688bd4f4a6e7de2b38ae688bd4f57e2b38ae688bd4f4a6e7b5e
EUC-JP 箋頑或OW箋頑或OJn}箋頑或OW箋頑或OJn{^ 11100100101101011011010011101000101100001011111101001111010101111110010010110101101101001110100010110000101111110100111101001010011011100111110111100100101101011011010011101000101100001011111101001111010101111110010010110101101101001110100010110000101111110100111101001010011011100111101101011110 e4b5b4e8b0bf4f57e4b5b4e8b0bf4f4a6e7de4b5b4e8b0bf4f57e4b5b4e8b0bf4f4a6e7b5e
UTF-8 箋頑或OW箋頑或OJn}箋頑或OW箋頑或OJn{^ 11100111101011101000101111101001101000001001000111100110100010001001011001001111010101111110011110101110100010111110100110100000100100011110011010001000100101100100111101001010011011100111110111100111101011101000101111101001101000001001000111100110100010001001011001001111010101111110011110101110100010111110100110100000100100011110011010001000100101100100111101001010011011100111101101011110 e7ae8be9a091e688964f57e7ae8be9a091e688964f4a6e7de7ae8be9a091e688964f57e7ae8be9a091e688964f4a6e7b5e
UHC 箋頑或OW箋頑或OJn}箋頑或OW箋頑或OJn{^ 11101111101010001110100011010111111110111110010001001111010101111110111110101000111010001101011111111011111001000100111101001010011011100111110111101111101010001110100011010111111110111110010001001111010101111110111110101000111010001101011111111011111001000100111101001010011011100111101101011110 efa8e8d7fbe44f57efa8e8d7fbe44f4a6e7defa8e8d7fbe44f57efa8e8d7fbe44f4a6e7b5e

SJIS-Win,EUC-JP: Classic charsets mainly used as Japanese encoding set on Windows(SJIS-Win=CP932) and UNIX(EUC-JP)