To what bitstring a character(s) is encoded in each character set?

Input one character or short letters and click "Convert."


(UTF-8)
Charset Character Bit string (binary) Bit String (hexadecimal)
ISO-8859-1 ???????????????????????? 001111110011111100111111001111110011111100111111001111110011111100111111001111110011111100111111001111110011111100111111001111110011111100111111001111110011111100111111001111110011111100111111 3f3f3f3f3f3f3f3f3f3f3f3f3f3f3f3f3f3f3f3f3f3f3f3f
SJIS-WIN ????????㎞??????????????? 00111111001111110011111100111111001111110011111100111111001111111000011101110001001111110011111100111111001111110011111100111111001111110011111100111111001111110011111100111111001111110011111100111111 3f3f3f3f3f3f3f3f87713f3f3f3f3f3f3f3f3f3f3f3f3f3f3f
EUC-JP ???????????????????????? 001111110011111100111111001111110011111100111111001111110011111100111111001111110011111100111111001111110011111100111111001111110011111100111111001111110011111100111111001111110011111100111111 3f3f3f3f3f3f3f3f3f3f3f3f3f3f3f3f3f3f3f3f3f3f3f3f
UTF-8 梨붿㎚吏좎콡吏몄㎞梨꾩콌吏몄콠吏싴삟梨쀬갹吏쒖콝 111011111010011110100010111010111011011010111111111000111000111010011010111011111010011110011110111011001010001010001110111011001011110110100001111011111010011110011110111010111010101010000100111000111000111010011110111011111010011110100010111010101011111010101001111011001011110110001100111011111010011110011110111010111010101010000100111011001011110110100000111011111010011110011110111011001000101110110100111011001000001010011111111011111010011110100010111011001000000010101100111010101011000010111001111011111010011110011110111011001001001010010110111011001011110110011101 efa7a2ebb6bfe38e9aefa79eeca28eecbda1efa79eebaa84e38e9eefa7a2eabea9ecbd8cefa79eebaa84ecbda0efa79eec8bb4ec829fefa7a2ec80aceab0b9efa79eec9296ecbd9d
UHC 梨붿㎚吏좎콡吏몄㎞梨꾩콌吏몄콠吏싴삟梨쀬갹吏쒖콝 111011001011000110010100111011001010011110101100111011001010011110100000111011001011000110011001111011001010011110111000111011001010011110110000111011001011000110000100111011001011000110001000111011001010011110111000111011001011000110011000111011001010011110011010111011011001100010100010111011001011000110010111111011001011000010111101111011001010011110011100111011001011000110010101 ecb194eca7aceca7a0ecb199eca7b8eca7b0ecb184ecb188eca7b8ecb198eca79aed98a2ecb197ecb0bdeca79cecb195

SJIS-Win,EUC-JP: Classic charsets mainly used as Japanese encoding set on Windows(SJIS-Win=CP932) and UNIX(EUC-JP)