To what bitstring a character(s) is encoded in each character set?

Input one character or short letters and click "Convert."


(UTF-8)
Charset Character Bit string (binary) Bit String (hexadecimal)
ISO-8859-1 ?????????v?????????vB 001111110011111100111111001111110011111100111111001111110011111100111111011101100011111100111111001111110011111100111111001111110011111100111111001111110111011001000010 3f3f3f3f3f3f3f3f3f763f3f3f3f3f3f3f3f3f7642
SJIS-WIN 止?障?乳?牽魏?v止?障?乳?牽魏?vB 10001110011111100011111110001111111000010011111110010011111110110011111110001100101000011110100110110000001111110111011010001110011111100011111110001111111000010011111110010011111110110011111110001100101000011110100110110000001111110111011001000010 8e7e3f8fe13f93fb3f8ca1e9b03f768e7e3f8fe13f93fb3f8ca1e9b03f7642
EUC-JP 止?障?乳?牽魏?v止?障?乳?牽魏?vB 10111011110111110011111110111110111000110011111111000110111111010011111110111000101000111111001010110010001111110111011010111011110111110011111110111110111000110011111111000110111111010011111110111000101000111111001010110010001111110111011001000010 bbdf3fbee33fc6fd3fb8a3f2b23f76bbdf3fbee33fc6fd3fb8a3f2b23f7642
UTF-8 止렮障렔乳縷牽魏렏v止렮障렔乳縷牽魏렏vB 111001101010110110100010111010111010000010101110111010011001101010011100111010111010000010010100111001001011100110110011111011111010010110010000111001111000100110111101111010011010110110001111111010111010000010001111011101101110011010101101101000101110101110100000101011101110100110011010100111001110101110100000100101001110010010111001101100111110111110100101100100001110011110001001101111011110100110101101100011111110101110100000100011110111011001000010 e6ada2eba0aee99a9ceba094e4b9b3efa590e789bde9ad8feba08f76e6ada2eba0aee99a9ceba094e4b9b3efa590e789bde9ad8feba08f7642
UHC 止렮障렔乳縷牽魏렏v止렮障렔乳縷牽魏렏vB 111100101010110110001110101110111110111010100001100011101010100111101010111000011101001011101010110011001011001011101010111000001000111010100101011101101111001010101101100011101011101111101110101000011000111010101001111010101110000111010010111010101100110010110010111010101110000010001110101001010111011001000010 f2ad8ebbeea18ea9eae1d2eaccb2eae08ea576f2ad8ebbeea18ea9eae1d2eaccb2eae08ea57642

SJIS-Win,EUC-JP: Classic charsets mainly used as Japanese encoding set on Windows(SJIS-Win=CP932) and UNIX(EUC-JP)