To what bitstring a character(s) is encoded in each character set?

Input one character or short letters and click "Convert."


(UTF-8)
Charset Character Bit string (binary) Bit String (hexadecimal)
ISO-8859-1 ?????????????????????^ 00111111001111110011111100111111001111110011111100111111001111110011111100111111001111110011111100111111001111110011111100111111001111110011111100111111001111110011111101011110 3f3f3f3f3f3f3f3f3f3f3f3f3f3f3f3f3f3f3f3f3f5e
SJIS-WIN 淫??陰??陰ъ?淫??蒻??淫??淫??^ 100010001111101000111111001111111000100101000001001111110011111110001001010000011000010010001100001111111000100011111010001111110011111111100100111010000011111100111111100010001111101000111111001111111000100011111010001111110011111101011110 88fa3f3f89413f3f8941848c3f88fa3f3fe4e83f3f88fa3f3f88fa3f3f5e
EUC-JP 淫??陰??陰ъ?淫??蒻??淫??淫??^ 101100001111110000111111001111111011000110100010001111110011111110110001101000101010011111101100001111111011000011111100001111110011111111101000111010100011111100111111101100001111110000111111001111111011000011111100001111110011111101011110 b0fc3f3fb1a23f3fb1a2a7ec3fb0fc3f3fe8ea3f3fb0fc3f3fb0fc3f3f5e
UTF-8 淫쇔떽陰잛꽱陰ъ꽱淫좎꽱蒻쎌넖淫쇱넭淫쇱꽑^ 111001101011011110101011111011001000011110010100111010111001011010111101111010011001100110110000111011001001111010011011111010101011110110110001111010011001100110110000110100011000101011101010101111011011000111100110101101111010101111101100101000101000111011101010101111011011000111101000100100101011101111101100100011101000110011101011100001001001011011100110101101111010101111101100100001111011000111101011100001001010110111100110101101111010101111101100100001111011000111101010101111011001000101011110 e6b7abec8794eb96bde999b0ec9e9beabdb1e999b0d18aeabdb1e6b7abeca28eeabdb1e892bbec8e8ceb8496e6b7abec87b1eb84ade6b7abec87b1eabd915e
UHC 淫쇔떽陰잛꽱陰ъ꽱淫좎꽱蒻쎌넖淫쇱넭淫쇱꽑^ 11101011111000101011110011100101101101101011110111101011111001001001111111101100100001001011110011101011111001001010110011101100100001001011110011101011111000101010000011101100100001001011110011100101101101101011110111101100100001101001111111101011111000101011110011101100100001101010110011101011111000101011110011101100100001001010000001011110 ebe2bce5b6bdebe49fec84bcebe4acec84bcebe2a0ec84bce5b6bdec869febe2bcec86acebe2bcec84a05e

SJIS-Win,EUC-JP: Classic charsets mainly used as Japanese encoding set on Windows(SJIS-Win=CP932) and UNIX(EUC-JP)