To what bitstring a character(s) is encoded in each character set?

Input one character or short letters and click "Convert."


(UTF-8)
Charset Character Bit string (binary) Bit String (hexadecimal)
ISO-8859-1 ?????????????????????^ 00111111001111110011111100111111001111110011111100111111001111110011111100111111001111110011111100111111001111110011111100111111001111110011111100111111001111110011111101011110 3f3f3f3f3f3f3f3f3f3f3f3f3f3f3f3f3f3f3f3f3f5e
SJIS-WIN 淫??陰??堊??陰⑥?蒻?????濡??^ 1000100011111010001111110011111110001001010000010011111100111111100110101011111100111111001111111000100101000001100001110100010100111111111001001110100000111111001111110011111100111111001111111001010001000111001111110011111101011110 88fa3f3f89413f3f9abf3f3f894187453fe4e83f3f3f3f3f94473f3f5e
EUC-JP 淫??陰??堊??陰??蒻?????濡??^ 10110000111111000011111100111111101100011010001000111111001111111101010011000001001111110011111110110001101000100011111100111111111010001110101000111111001111110011111100111111001111111100011110101000001111110011111101011110 b0fc3f3fb1a23f3fd4c13f3fb1a23f3fe8ea3f3f3f3f3fc7a83f3f5e
UTF-8 淫쇔떽陰먮젿堊앸죰陰⑥꽱蒻쎌넀溜긴풚濡뗭넚^ 11100110101101111010101111101100100001111001010011101011100101101011110111101001100110011011000011101011101010001010111011101100101000001011111111100101101000001000101011101100100101011011100011101100101000111011000011101001100110011011000011100010100100011010010111101010101111011011000111101000100100101011101111101100100011101000110011101011100001001000000011101111101001111000101111101010101110001011010011101101100100101001101011100110101111111010000111101011100101111010110111101011100001001001101001011110 e6b7abec8794eb96bde999b0eba8aeeca0bfe5a08aec95b8eca3b0e999b0e291a5eabdb1e892bbec8e8ceb8480efa78beab8b4ed929ae6bfa1eb97adeb849a5e
UHC 淫쇔떽陰먮젿堊앸죰陰⑥꽱蒻쎌넀溜긴풚濡뗭넚^ 11101011111000101011110011100101101101101011110111101011111001001001000011101011101000001011000111100100101111101001110111101011101000011000101111101011111001001010100011101100100001001011110011100101101101101011110111101100100001101001000011101010111111101011000111100100101111101001110111101011101000011000101111101100100001101010000101011110 ebe2bce5b6bdebe490eba0b1e4be9deba18bebe4a8ec84bce5b6bdec8690eafeb1e4be9deba18bec86a15e

SJIS-Win,EUC-JP: Classic charsets mainly used as Japanese encoding set on Windows(SJIS-Win=CP932) and UNIX(EUC-JP)