To what bitstring a character(s) is encoded in each character set?

Input one character or short letters and click "Convert."


(UTF-8)
Charset Character Bit string (binary) Bit String (hexadecimal)
ISO-8859-1 ?????????????????????? 00111111001111110011111100111111001111110011111100111111001111110011111100111111001111110011111100111111001111110011111100111111001111110011111100111111001111110011111100111111 3f3f3f3f3f3f3f3f3f3f3f3f3f3f3f3f3f3f3f3f3f3f
SJIS-WIN 堊??繹??曄?????吟?????怨??韋 10011010101111110011111100111111111000111000100000111111001111111001111001000000001111110011111100111111001111110011111110001011111000010011111100111111001111110011111100111111100010011000010100111111001111111110100011101000 9abf3f3fe3883f3f9e403f3f3f3f3f8be13f3f3f3f3f89853f3fe8e8
EUC-JP 堊??繹??曄?????吟?????怨??韋 11010100110000010011111100111111111001011110100000111111001111111101101110100001001111110011111100111111001111110011111110110110111000110011111100111111001111110011111100111111101100011110010100111111001111111111000011101010 d4c13f3fe5e83f3fdba13f3f3f3f3fb6e33f3f3f3f3fb1e53f3ff0ea
UTF-8 堊앸젉繹먮젾曄됯퀋溜잙떧吟닸틦溜곁쪏怨곷젧韋 111001011010000010001010111011001001010110111000111011001010000010001001111001111011100110111001111010111010100010101110111011001010000010111110111001101001101110000100111010111001000010101111111011011000000010001011111011111010011110001011111011001001111010011001111010111001011010100111111001011001000010011111111010111000101110111000111011011000101110100110111011111010011110001011111010101011001110000001111011001010101010001111111001101000000010101000111010101011001110110111111011001010000010100111111010011001111110001011 e5a08aec95b8eca089e7b9b9eba8aeeca0bee69b84eb90afed808befa78bec9e99eb96a7e5909feb8bb8ed8ba6efa78beab381ecaa8fe680a8eab3b7eca0a7e99f8b
UHC 堊앸젉繹먮젾曄됯퀋溜잙떧吟닸틦溜곁쪏怨곷젧韋 1110010010111110100111011110101110100000100010111110011010111010100100001110101110100000101100001110011110100101100010011110101010110011100000011110101011111110100111111110101110001011101110101110101111100001101101001110011010111010100100001110101011111110101100001110011110100101100010011110101010110011100000011110101110100000100111111110101011011111 e4be9deba08be6ba90eba0b0e7a589eab381eafe9feb8bbaebe1b4e6ba90eafeb0e7a589eab381eba09feadf

SJIS-Win,EUC-JP: Classic charsets mainly used as Japanese encoding set on Windows(SJIS-Win=CP932) and UNIX(EUC-JP)